← Back to projects
ActivePythonMIT

llm-ocr

Convert PDFs to clean markdown using LLMs with vision capabilities. No traditional OCR — just send pages to a vision model and get structured output.

$ pip install llm-ocr

About

Traditional OCR struggles with mixed layouts, tables, and mathematical notation. This tool renders PDF pages as images and uses vision-capable LLMs to produce clean, properly formatted markdown. Works especially well for academic papers and technical docs.

0Stars
0Forks
4Technologies

Tech Stack

PythonLLMVisionPDF