Back to Docling

Data Prep Kit

docs/integrations/data_prep_kit.md

2.92.0740 B
Original Source

Docling is used by the Data Prep Kit open-source toolkit for preparing unstructured data for LLM application development ranging from laptop scale to datacenter scale.

Components

PDF ingestion to Parquet

Document chunking