Skip to content

Releases: apmoore1/stanza-batch

Token Misc Fix

20 Mar 22:29
Compare
Choose a tag to compare

This release contains the fix from pull request #5 by @ales-t this fixes the problem where tokens (sub-tokens) do not contain the "misc" attribute as they are part of a multi-word token. See pull request #5 for more details.

PyPi Release

08 Feb 09:27
Compare
Choose a tag to compare

PyPi Release

No updates in this release, the release has only come about due to the last version (v0.2.0) failing to upload to PyPi due to an error in the github workflow script, which should now be fixed.

Stanza 1.2.0 and 1.1.1

07 Feb 17:27
Compare
Choose a tag to compare

Compatible with Stanza 1.2.0 and 1.1.1

Updates:

  1. Compatible with Stanza 1.2.0 and 1.1.1
  2. The code now has a version number through stanza_batch.__version__
  3. The code will be available through PyPi
  4. GitHub actions will publish the code to PyPi due to the test-action.yml new job publish
  5. README now has a quick links section.

Stable version of version 0.1.0

04 Feb 17:19
Compare
Choose a tag to compare

Updates

  1. Limited the version of Stanza to version 1.1.1, before this update the dependency requirements allowed Stanza>=1.1.1

Stable version for Stanza v1.1.1

04 Feb 15:06
Compare
Choose a tag to compare

Stable version for Stanza v1.1.1

This is the first release of a batching utility for Stanza specifically it works for v1.1.1 of Stanza. It makes processing documents/texts with Stanza quicker and easier due to the batching wrapper that this code contains.

The current recommendation for batching by Stanza is to concatenate documents together with each document separated by a blank line (\n\n). This way of batching has one main drawback:

  1. The return of processing this document is one Stanza Document with lots of sentences, thus you don't know where one document ends and another starts, easily.

This batching utility solves this problem, when given a list of documents, it will return a list of corresponding processed Stanza documents. For more details see the README which contains an example.