Tika (2.x) unable to detect CSV correctly for Excel output format (semicolon separated)
I’m trying to integrate TIKA to detect file types in content management system.
Unfortunately, it fails to detect CSV format.
I’ve inspected it detailed, and it seems, it can detect CSV if separator is coma or tab, but it fails for semicolon (which unfortunately is Excel ‘standard’):