Docling
IBM's document conversion tool for AI pipelines
β15,000
Data ToolsFree (open-source)
About Docling
Docling by IBM Research converts documents (PDFs, DOCX, PPTX, images) into clean, structured formats optimized for AI. It handles tables, OCR, and complex layouts β designed for RAG pipelines.
Features
β¦PDF conversion
β¦Table extraction
β¦OCR
β¦Markdown output
β¦LlamaIndex integration
Pros & Cons
Pros
- +Excellent PDF parsing
- +Table extraction
- +OCR capability
- +IBM Research quality
- +LlamaIndex integration
Cons
- βHeavy dependencies
- βCan be slow on large docs
- βPython only
- βComplex output format
Platforms
LinuxmacOSWindows
Tags
Similar Tools
Qdrant
High-performance vector database for AI applications
Free (open-source) + CloudFirecrawl
Turn websites into LLM-ready markdown or structured data
Free (open-source) + CloudChromaDB
Open-source embedding database for AI applications
Free (open-source)Weaviate
Open-source vector database with built-in AI modules
Free (open-source) + CloudNeed help choosing?
Compare Docling with alternatives side by side
Compare Tools β