-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Open
0 / 60 of 6 issues completedLabels
Description
Search before asking
- I had searched in the feature and found no similar feature requirement.
Description
As a multimodal data integration tool, we hope that SeaTunnel can support parsing complex file types, converting their contents into structured file streams, and ultimately writing them into a vector library through embedding. This issue tracks related tasks.
For chunking please refer
Please refer https://docs.dify.ai/en/guides/knowledge-base/create-knowledge-and-upload-documents/chunking-and-cleaning-text
and
https://docs.llamaindex.ai/en/stable/examples/node_parsers/semantic_chunking/
Usage Scenario
No response
Related issues
No response
Are you willing to submit a PR?
- Yes I am willing to submit a PR!
Code of Conduct
- I agree to follow this project's Code of Conduct
liugddx, zhangshenghang, davidzollo and loupipalien
Sub-issues
Metadata
Metadata
Assignees
Labels
Type
Projects
Status
Todo