Docling

IBM's document conversion tool for AI pipelines

⭐15,000
Data ToolsFree (open-source)

About Docling

Docling by IBM Research converts documents (PDFs, DOCX, PPTX, images) into clean, structured formats optimized for AI. It handles tables, OCR, and complex layouts β€” designed for RAG pipelines.

Features

✦PDF conversion
✦Table extraction
✦OCR
✦Markdown output
✦LlamaIndex integration

Pros & Cons

Pros

  • +Excellent PDF parsing
  • +Table extraction
  • +OCR capability
  • +IBM Research quality
  • +LlamaIndex integration

Cons

  • βˆ’Heavy dependencies
  • βˆ’Can be slow on large docs
  • βˆ’Python only
  • βˆ’Complex output format

Platforms

LinuxmacOSWindows

Tags

Similar Tools

Need help choosing?

Compare Docling with alternatives side by side

Compare Tools β†’