I have a task that involves extracting specific values from a TMT PNG image of a table. Depending on the required output, I need to extract either a specific value from a table cell or some text from a report within the image.
Can you suggest the existing machine learning models and any libraries capable of extracting text, numbers, or special symbols from a specific location within an image and outputting them into an Excel file? Additionally, please break down the procedure into step-by-step tasks to achieve the desired output.
Currently, I have started working on libraries,
Oytesseract, opencv-python-headless, Pandas, Pillow, and Openpyxl.
!sudo apt-get install tesseract-ocr
And I need to know if there are any ways to explore, and what I should learn to achieve those new methods, and to solve this problem explain it to me so I can do research on it.
Pranay Ganagalla is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.