Skip to content

Add text extraction with Docling #814

@davidmezzetti

Description

@davidmezzetti

Docling looks like a promising text extraction library that could possibly augment or replace Apache Tika.

Update: Docling added 3.9 support, this is a go!
The main integration issue is that it only supports Python 3.10+.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions