Skip to content

Conversation

bdewilde
Copy link
Collaborator

@bdewilde bdewilde commented Aug 25, 2020

Description

  • refactored text_stats module into a sub-package with the same name and top-level API, but with underlying functionality broken out into separate modules
  • improved performance, API, and documentation on the core TextStats class, plus improved documentation on the individual stats functions
  • added new readability tests -- Automatic Arabic Readability Index (for Arabic texts), µ-legibility and perspecuity index (for Spanish texts), and a Turkish formulation for Flesch Reading Ease -- and a new basic stat -- word entropy

TODO:

  • add tests for the new and revised functionality

Motivation and Context

This was an under-nurtured part of the code base, with some weird corners and clunky API decisions.

How Has This Been Tested?

All tests pass!

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

  • My code follows the code style of this project.
  • My change requires a change to the documentation, and I have updated it accordingly.

@bdewilde bdewilde marked this pull request as ready for review August 25, 2020 04:12
@bdewilde bdewilde merged commit 3cb61a0 into master Aug 26, 2020
@bdewilde bdewilde deleted the improve-expand-textstats branch August 26, 2020 00:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant