Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: huggingface/datasets
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: 3.4.1
Choose a base ref
...
head repository: huggingface/datasets
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: 3.5.0
Choose a head ref
  • 6 commits
  • 19 files changed
  • 4 contributors

Commits on Mar 17, 2025

  1. set dev version (#7462)

    lhoestq authored Mar 17, 2025
    Configuration menu
    Copy the full SHA
    0f6e35f View commit details
    Browse the repository at this point in the history

Commits on Mar 18, 2025

  1. Introduce pdf support (#7318) (#7325)

    * added option to check for pdfplumber library
    
    * added pdf to init
    
    * added pdf to features.py
    
    * added pdf to init
    
    * added pdf to features.py
    
    * first version of the Pdf feature
    
    * Update src/datasets/features/pdf.py
    
    Co-authored-by: Andrea Francis Soria Jimenez <andrea@huggingface.co>
    
    * Update src/datasets/features/pdf.py
    
    Co-authored-by: Andrea Francis Soria Jimenez <andrea@huggingface.co>
    
    * Update src/datasets/features/pdf.py
    
    Co-authored-by: Andrea Francis Soria Jimenez <andrea@huggingface.co>
    
    * Update src/datasets/features/pdf.py
    
    Co-authored-by: Andrea Francis Soria Jimenez <andrea@huggingface.co>
    
    * Update src/datasets/features/pdf.py
    
    Co-authored-by: Andrea Francis Soria Jimenez <andrea@huggingface.co>
    
    * Update src/datasets/features/pdf.py
    
    Co-authored-by: Andrea Francis Soria Jimenez <andrea@huggingface.co>
    
    * Update src/datasets/features/pdf.py
    
    Co-authored-by: Andrea Francis Soria Jimenez <andrea@huggingface.co>
    
    * Update src/datasets/features/pdf.py
    
    Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
    
    * Update src/datasets/features/pdf.py
    
    Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
    
    * added packages required for PDF support
    
    * created decorator for requirement of pdfplumber
    
    * added a simple pdf with images and plots for testing pdf support
    
    * first version of tests for pdf
    
    * update to pdf feature
    
    * fix Pdf feature
    
    * add PdfFolder
    
    * docs
    
    * fix docs
    
    * a bit more docs
    
    ---------
    
    Co-authored-by: Andrea Francis Soria Jimenez <andrea@huggingface.co>
    Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
    Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>
    4 people authored Mar 18, 2025
    Configuration menu
    Copy the full SHA
    d6d0281 View commit details
    Browse the repository at this point in the history
  2. Fix local pdf loading (#7466)

    fix local pdf loading
    lhoestq authored Mar 18, 2025
    Configuration menu
    Copy the full SHA
    781ce85 View commit details
    Browse the repository at this point in the history
  3. Minor fix for metadata files in extension counter (#7464)

    minor fix metadata files
    lhoestq authored Mar 18, 2025
    Configuration menu
    Copy the full SHA
    a256b85 View commit details
    Browse the repository at this point in the history

Commits on Mar 25, 2025

  1. Priotitize json (#7476)

    priotitize json
    lhoestq authored Mar 25, 2025
    2 Configuration menu
    Copy the full SHA
    5f8d2ad View commit details
    Browse the repository at this point in the history

Commits on Mar 27, 2025

  1. release: 3.5.0 (#7484)

    lhoestq authored Mar 27, 2025
    Configuration menu
    Copy the full SHA
    0b5998a View commit details
    Browse the repository at this point in the history
Loading