Improve data onboarding speed:  `ipfs add` and `ipfs dag import|export`

### Checklist

- [X] My issue is specific & actionable.
- [X] I am not suggesting a protocol enhancement.
- [X] I have searched on the [issue tracker](https://github.com/ipfs/kubo/issues?q=is%3Aissue) for my issue.

### Description

This is a followup on user @endomorphosis's [comment](https://github.com/filecoin-project/community/discussions/696#discussioncomment-8967972) in the Filecoin community discussions about IPFS hashing being slow.

> I noticed that when trying to index large ML models that the IPFS daemon hashing seems to be single threaded, and therefore somewhat slow when indexing large files. If this is funded, it is my hope that someone in your org can try to create a new spec, to parallelize the hashing of large files.

Per @lidel in an ipfs-steering conversation on 2 April 2024:
<blockquote>
In my mind this is not about inventing new hashing specifications, this is about making the most popular implementation majority of ecosystem uses for data onboarding (Kubo) better. My translation:

_the IPFS daemon hashing [..] slow when indexing large files_
→ Kubo's commands like  `ipfs add`  are not as fast as they "should be", when comparing with `sha256sum` over the number of chunks

_parallelize the hashing of large files._
→ improve implementation, make core commands like `ipfs dag import|export` and  `ipfs add` as fast as possible (we know they are not)

Once we have reference implementation, we can add some rules of thumb how to implement UnixFs hashing and chunking to "notes for implementers" section of [wip Unix specification](https://github.com/ipfs/specs/pull/331).</blockquote>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Improve data onboarding speed: `ipfs add` and `ipfs dag import|export` #10383

Checklist

Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Improve data onboarding speed: ipfs add and ipfs dag import|export #10383

Description

Checklist

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Improve data onboarding speed: `ipfs add` and `ipfs dag import|export` #10383