Skip to content

Releases: neuml/paperetl

v2.5.1

02 Aug 12:18
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Make Elasticsearch import conditional (#67)

v2.5.0

01 Jul 17:37
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Remove CORD-19 parser (#63)
  • Add citation references for TEI (#64)
  • Make Elasticsearch optional (#65)

v2.4.0

11 Jun 12:08
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Improve file processing performance (#59)
  • Add keyword filter to PMB and modify filtering logic (#60)
  • Require Python >= 3.10 (#61)
  • Add citation reference support (#62)

v2.3.0

28 Dec 19:31
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Use figure index rather than xml:id attribute this is not always present (#46, #51)
  • Added note on grobid concurrency configuration to README (#50, #52)
  • Can't insert all my data into sqlite database (#56)
  • Close processes at end of Execute.run method (#57)
  • Require Python >= 3.9 (#58)

v2.2.1

18 Sep 21:16
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Update setup.py to only show standard image on PyPI (#48)

v2.2.0

18 Sep 20:30
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Add example notebook (#43)
  • Update CORD-19 scripts (#44)
  • Update minimum Python version to 3.8 (#47)

v2.1.0

20 Jan 14:24
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Issue processing into Elasticsearch (#41)
  • Improve PMB filtering logic (#42)

v2.0.0

12 Mar 01:05
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Add PubMed as source (#16)
  • Add arXiv as source (#17)
  • Detect month changes in CORD-19 entry date process (#33)
  • Remove study attribute and design models and all related dependencies (#34)
  • Add pre-commit checks (#35)
  • Remove legacy merge logic (#36)
  • Add database flag to determine if database should be replaced (#37)
  • Add multiprocessing support to files process (#38)
  • Support reading compressed files (#39)
  • Require Python 3.7+ (#40)

v1.6.0

23 Apr 12:12
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Improve sample size extraction (#29)
  • Add generic CSV source (#30)
  • Add common method for accessing Grammar object (#31)
  • Update CORD-19 entry dates source (#32)
  • Limit docker and setup.py to spaCy 2.x until attribute/design models rebuilt

v1.5.0

21 Jan 18:43
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Add dockerfile for building paperetl environment (#9)
  • Add component to build entry-dates.csv (#18)
  • Add pre-trained study design models to GitHub (#19)
  • Update README to correct and improve documentation (#20)
  • Ensure length of sections is less than max nlp length (#27)