What is PdfItDown?
PdfItDown is a python package that relies onmarkitdown by Microsoft, markdown_pdf and img2pdf to carry out the conversion of text-based files, images and unstructured documents to PDF. PdfItDown is applicable to the following file formats:
- Markdown
- PowerPoint
- Word
- Excel
- HTML
- Text-based formats (CSV, XML, JSON)
- ZIP files (iterates over contents)
- Image files (PNG, JPG)