some powerpoint documents fail to parse
I have had some powerpoint documents fail as type "not supported". Therefor I have added some additional content_type_to_keep, files_to_keep, files_to_ommit.
content_type_to_keep:
'application/vnd.openxmlformats-officedocument.presentationml.presentation.main+xml', 'application/vnd.openxmlformats-officedocument.presentationml.presProps+xml', 'application/vnd.openxmlformats-officedocument.presentationml.presentation.slideLayout+xml', 'application/vnd.openxmlformats-officedocument.presentationml.tableStyles+xml', 'application/vnd.openxmlformats-officedocument.presentationml.viewProps+xml', 'application/vnd.openxmlformats-officedocument.theme+xml', 'application/vnd.openxmlformats-officedocument.presentationml.slideLayout+xml', 'application/vnd.openxmlformats-officedocument.presentationml.tags+xml', 'application/vnd.openxmlformats-officedocument.presentationml.slide+xml', 'application/vnd.openxmlformats-officedocument.presentationml.slideMaster+xml', 'application/vnd.openxmlformats-officedocument.presentationml.notesMaster+xml', 'application/vnd.openxmlformats-officedocument.presentationml.handoutMaster+xml', 'application/vnd.openxmlformats-officedocument.presentationml.notesSlide+xml', 'application/vnd.openxmlformats-officedocument.presentationml.Relationships+xml', 'application/vnd.openxmlformats-officedocument.presentationml.docProps+xml',
files_to_keep:
r'^ppt/slideLayouts/_rels/slideLayout[0-9]*\.xml\.rels$', r'^ppt/slideMasters/_rels/slideMaster[0-9]*\.xml\.rels$', r'^ppt/slides/_rels/slide[0-9]*\.xml\.rels$', r'^ppt/notesSlides/_rels/notesSlide[0-9]*\.xml\.rels$', r'^ppt/handoutMasters/_rels/handoutMaster[0-9]*\.xml\.rels$', r'^ppt/notesMasters/_rels/notesMaster[0-9]*\.xml\.rels$', r'^ppt/_rels/presentation\.xml\.rels$',
files_to_ommit:
r'^ppt/_rels/', r'^ppt/printerSettings/',
Is this the correct way to add additional "parsing" of powerpoint documents?