Can I tell textract to isolate runs rather than paragraphs?
I’m extracting information from DOCX documents via Python, and sometimes I have to take into account the layout of textual content as it appears in MS Word.
I’m extracting information from DOCX documents via Python, and sometimes I have to take into account the layout of textual content as it appears in MS Word.