What is PdfItDown?
PdfItDown is a python package that relies onmarkitdown
by Microsoft, markdown_pdf
and img2pdf
to carry on conversion of text-based files and images to PDF. PdfItDown is applicable to the following file formats:
- Markdown
- PowerPoint
- Word
- Excel
- HTML
- Text-based formats (CSV, XML, JSON)
- ZIP files (iterates over contents)
- Image files (PNG, JPG)