For a project I want to parse the text inside the pdf to share that file to an android application from a java application on the computer but I tried to convert it into JSON But if the file contains images then it will become heavy But if I tried to extract the text from the PDF file it will not led to the desired output. I used Apache PDFbox and Pdf.co lib and API.
I made several packages like pdf to text , pdf to json even I had created a image extractor But I want the task to be done like if the PDF contains images then it will automatically skips the images and converts only the text to JSON
Abhishek chauhan is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.