-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Comparing changes
Open a pull request
base repository: huggingface/datasets
base: 3.3.2
head repository: huggingface/datasets
compare: 3.4.0
- 11 commits
- 75 files changed
- 6 contributors
Commits on Feb 20, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 14233c0 - Browse repository at this point
Copy the full SHA 14233c0View commit details
Commits on Mar 4, 2025
-
fix: None default with bool type on load creates typing error (#7426)
* fix typing on load * fix docstring
Configuration menu - View commit details
-
Copy full SHA for 6631dc0 - Browse repository at this point
Copy the full SHA 6631dc0View commit details
Commits on Mar 5, 2025
-
Use pyupgrade --py39-plus (#7428)
* Use pyupgrade --py39-plus * Make style
Configuration menu - View commit details
-
Copy full SHA for 26379d5 - Browse repository at this point
Copy the full SHA 26379d5View commit details -
Faster folder based builder + parquet support + allow repeated media …
…+ use torchvideo (#7424) * faster folder based builder + parquet support + allow repeated media * add _visit_with_path in features * support image/audio/video in nested data * docs * use filters even without metadata * minor * replace decord by torchcodec * switch to torchvision * update video docs * minor * fix tests * fix tests * fix tests * better webdataset docs * style * fix
Configuration menu - View commit details
-
Copy full SHA for 5c8869f - Browse repository at this point
Copy the full SHA 5c8869fView commit details
Commits on Mar 7, 2025
-
Add with_split to DatasetDict.map (#7368)
* Add: with_split * Add: support for 'with_split' parameter in DatasetDict.map method * Refactor: simplify dataset mapping in DatasetDict * Refactor: DatasetDict to bind function with split parameter * rm breakpoint * Enhance DatasetDict and IterableDatasetDict to support function binding with split parameter * Add: unbind * fix ci --------- Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for f693f4e - Browse repository at this point
Copy the full SHA f693f4eView commit details
Commits on Mar 12, 2025
-
Refactor
string_to_dict
to returnNone
if there is no match inste……ad of raising `ValueError` (#7435) * Refactor string_to_dict to return None if there is no match instead of raising ValueError instead of having the pattern of using try-except to handle when there is no match, we can instead check if the return value is None; we can also assert that the return value should not be None if we know that should be true * Allow for source_url_fields to be None they can be local file paths here https://github.com/huggingface/datasets/actions/runs/13683185040/job/38380924390?pr=7435#step:10:9731
Matthew Hoffman authoredMar 12, 2025 Configuration menu - View commit details
-
Copy full SHA for 67ffdfb - Browse repository at this point
Copy the full SHA 67ffdfbView commit details
Commits on Mar 13, 2025
-
Fix small bugs with async map (#7445)
* fix async map resuming * fix with_indices * fix tests * fix tests * again
Configuration menu - View commit details
-
Copy full SHA for f09db01 - Browse repository at this point
Copy the full SHA f09db01View commit details
Commits on Mar 14, 2025
-
Add IterableDataset.decode with multithreading (#7450)
* add IterableDataset.decode with multithreading * graceful async ends * test * docs * fix tests
Configuration menu - View commit details
-
Copy full SHA for 7ad7379 - Browse repository at this point
Copy the full SHA 7ad7379View commit details -
Fix resuming after
ds.set_epoch(new_epoch)
(#7451)* fix resuming with new epoch * more readable states * add test * make style
Configuration menu - View commit details
-
Copy full SHA for e8ee24a - Browse repository at this point
Copy the full SHA e8ee24aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 97ff626 - Browse repository at this point
Copy the full SHA 97ff626View commit details -
Configuration menu - View commit details
-
Copy full SHA for 14fb15a - Browse repository at this point
Copy the full SHA 14fb15aView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff 3.3.2...3.4.0