poi

Apache POI - the Java library for reading and writing Microsoft Office formats (DOCX, XLSX, PPTX and their legacy equivalents); what Tika delegates to internally for Office documents
apache, parsing, office
tika