Skip to content

Conversation

bertsky
Copy link
Contributor

@bertsky bertsky commented Mar 5, 2024

IIUC, this is used when a language model / dict / dawg is in place to normalise the characters before applying hyphenation and quoting rules (which are not all covered by punctuation dawgs etc).

Copy link
Member

@stweil stweil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you.

@stweil stweil merged commit a08a6ac into tesseract-ocr:main Mar 5, 2024
@stweil
Copy link
Member

stweil commented Mar 5, 2024

The 2nd commit was not strictly necessary. It's quite common that lists are terminated like that because adding more entries at the end is easier.

@bertsky
Copy link
Contributor Author

bertsky commented Mar 5, 2024

sorry, I forgot to increase the constants for the number of initialisers, hence the failing build.

will make a new PR with that fix right now

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants