ActivePythonMIT

llm-ocr

Convert PDFs to clean markdown using LLMs with vision capabilities. No traditional OCR, just send pages to a vision model and get structured output.

$ pip install llm-ocr

Source Code PyPI

About

Traditional OCR struggles with mixed layouts, tables, and mathematical notation. This tool renders PDF pages as images and uses vision-capable LLMs to produce clean, properly formatted markdown. Works especially well for academic papers and technical docs.

0Stars

0Forks

4Technologies

Tech Stack

PythonLLMVisionPDF