Convert between 8 formats (PDF, DOCX, PPTX, XLSX, TXT, CSV, MD, HTML). Best-effort text extraction, batch processing, and document format transformation.
Run best-effort extraction and rebuild workflows across common document formats. Preserve clean structure, not pixel-perfect layout.
pdf
docx
pptx
xlsx
txt
csv
md
html
scripts/convert.py
scripts/batch_convert.py
scripts/pdf_toolkit.py
scripts/table_extractor.py
scripts/form_filler.py
references/conversion_matrix.md
references/limitations.md