Releases: apmoore1/stanza-batch
Token Misc Fix
PyPi Release
PyPi Release
No updates in this release, the release has only come about due to the last version (v0.2.0) failing to upload to PyPi due to an error in the github workflow script, which should now be fixed.
Stanza 1.2.0 and 1.1.1
Compatible with Stanza 1.2.0 and 1.1.1
Updates:
- Compatible with Stanza 1.2.0 and 1.1.1
- The code now has a version number through
stanza_batch.__version__
- The code will be available through PyPi
- GitHub actions will publish the code to PyPi due to the test-action.yml new job
publish
- README now has a quick links section.
Stable version of version 0.1.0
Updates
- Limited the version of Stanza to version 1.1.1, before this update the dependency requirements allowed
Stanza>=1.1.1
Stable version for Stanza v1.1.1
Stable version for Stanza v1.1.1
This is the first release of a batching utility for Stanza specifically it works for v1.1.1 of Stanza. It makes processing documents/texts with Stanza quicker and easier due to the batching wrapper that this code contains.
The current recommendation for batching by Stanza is to concatenate documents together with each document separated by a blank line (\n\n
). This way of batching has one main drawback:
- The return of processing this document is one Stanza Document with lots of sentences, thus you don't know where one document ends and another starts, easily.
This batching utility solves this problem, when given a list of documents, it will return a list of corresponding processed Stanza documents. For more details see the README which contains an example.