data extraction from an image of a .pdf file using python in colab data extraction from an image of a .pdf file using python in colab