Skip to main content
Hero Light

What is PdfItDown?

PdfItDown is a python package that relies on markitdown by Microsoft, markdown_pdf and img2pdf to carry out the conversion of text-based files, images and unstructured documents to PDF. PdfItDown is applicable to the following file formats:
  • Markdown
  • PowerPoint
  • Word
  • Excel
  • HTML
  • Text-based formats (CSV, XML, JSON)
  • ZIP files (iterates over contents)
  • Image files (PNG, JPG)

Setting up

To set uo PdfItDown, it is good practice to create an isolated development environment
# you can use all python versions above 3.10 and below 3.14
uv venv -p 3.13
uv add pdfitdown

Choose how to use it

Once you have PdfItDown set up, you can choose how to use it: