Releases: segment-any-text/wtpsplit
Releases · segment-any-text/wtpsplit
Release 2.1.6
Release 2.1.5
Changelog
Release 2.1.4
- Introduce optional hat weighting by @lsorber
- Clarify LoRA adaptation
- Clarify
treat_newline_as_space
: renamed tosplit_on_input_newlines
.treat_newline_as_space
will be deprecated in a future release.
Release 2.1.2
- Fixes #142: AssertionError when string is only comprised of newlines, whitespace, or if its an empty strong.
Release 2.1.1
- Change default behaviour for newlines in SaT.split.
- Now, while the model ignores them, they will used to split as simple post-processing.
- Small bugfixes for LoRA training
- Update Readme for advanced usage
Release 2.1.0
- Adds ONNX support for SaT models.
- Including export scripts and an updated README.
- This results in 50% improved inference time on GPU.
Release 2.0.8
- Fix splitting of short sequences into individual characters (#127)
Release 2.0.7
- Allow numpy>=2.0
- Fix adaptation code
- Add some comments
Release 2.0.5
- Fixes potential CUDA device error when the input has exactly 511 tokens (#121).
Release 2.0.4
- Fix a speed issue with SaT (#118). Now it is (as expected) ~6x faster than WtP.