Was using tabula-py for extracting tabular information and then storing it in .csv files however it fails to understand the structure of the tableScreenshot of pdf using as a dataset Real structure of table
Extracted data using tabula Extracted structure using tabula module is not matching with the real structure.
Also if you look at the Screenshot of pdf you will find college name and course name which were outside the structure and it was extracted and stored in a .csv file easily but was using PyPDF2.
I want to create a .csv file with College name, Course name, GOPENS cutoff,……. for all cateogries
SY_C_41_PIYUSH DESHMUKH is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.