Releases: neuml/paperetl
Releases · neuml/paperetl
v2.5.1
This release adds the following enhancements and bug fixes:
- Make Elasticsearch import conditional (#67)
v2.5.0
This release adds the following enhancements and bug fixes:
- Remove CORD-19 parser (#63)
- Add citation references for TEI (#64)
- Make Elasticsearch optional (#65)
v2.4.0
This release adds the following enhancements and bug fixes:
- Improve file processing performance (#59)
- Add keyword filter to PMB and modify filtering logic (#60)
- Require Python >= 3.10 (#61)
- Add citation reference support (#62)
v2.3.0
This release adds the following enhancements and bug fixes:
- Use figure index rather than xml:id attribute this is not always present (#46, #51)
- Added note on grobid concurrency configuration to README (#50, #52)
- Can't insert all my data into sqlite database (#56)
- Close processes at end of Execute.run method (#57)
- Require Python >= 3.9 (#58)
v2.2.1
This release adds the following enhancements and bug fixes:
- Update setup.py to only show standard image on PyPI (#48)
v2.2.0
This release adds the following enhancements and bug fixes:
- Add example notebook (#43)
- Update CORD-19 scripts (#44)
- Update minimum Python version to 3.8 (#47)
v2.1.0
This release adds the following enhancements and bug fixes:
- Issue processing into Elasticsearch (#41)
- Improve PMB filtering logic (#42)
v2.0.0
This release adds the following enhancements and bug fixes:
- Add PubMed as source (#16)
- Add arXiv as source (#17)
- Detect month changes in CORD-19 entry date process (#33)
- Remove study attribute and design models and all related dependencies (#34)
- Add pre-commit checks (#35)
- Remove legacy merge logic (#36)
- Add database flag to determine if database should be replaced (#37)
- Add multiprocessing support to files process (#38)
- Support reading compressed files (#39)
- Require Python 3.7+ (#40)
v1.6.0
This release adds the following enhancements and bug fixes:
- Improve sample size extraction (#29)
- Add generic CSV source (#30)
- Add common method for accessing Grammar object (#31)
- Update CORD-19 entry dates source (#32)
- Limit docker and setup.py to spaCy 2.x until attribute/design models rebuilt
v1.5.0
This release adds the following enhancements and bug fixes:
- Add dockerfile for building paperetl environment (#9)
- Add component to build entry-dates.csv (#18)
- Add pre-trained study design models to GitHub (#19)
- Update README to correct and improve documentation (#20)
- Ensure length of sections is less than max nlp length (#27)