20 Mar 22:29

apmoore1

4b914cb

Token Misc Fix Latest

Latest

This release contains the fix from pull request #5 by @ales-t this fixes the problem where tokens (sub-tokens) do not contain the "misc" attribute as they are part of a multi-word token. See pull request #5 for more details.

Assets 2

08 Feb 09:27

apmoore1

v0.2.1

8236524

PyPi Release

No updates in this release, the release has only come about due to the last version (v0.2.0) failing to upload to PyPi due to an error in the github workflow script, which should now be fixed.

Assets 2

07 Feb 17:27

apmoore1

v0.2.0

f89c419

Stanza 1.2.0 and 1.1.1

Compatible with Stanza 1.2.0 and 1.1.1

Updates:

Compatible with Stanza 1.2.0 and 1.1.1
The code now has a version number through stanza_batch.__version__
The code will be available through PyPi
GitHub actions will publish the code to PyPi due to the test-action.yml new job publish
README now has a quick links section.

Assets 2

04 Feb 17:19

apmoore1

v0.1.1

7af84ec

Stable version of version 0.1.0

Updates

Limited the version of Stanza to version 1.1.1, before this update the dependency requirements allowed Stanza>=1.1.1

Assets 2

04 Feb 15:06

apmoore1

v0.1.0

871b56b

Stable version for Stanza v1.1.1

This is the first release of a batching utility for Stanza specifically it works for v1.1.1 of Stanza. It makes processing documents/texts with Stanza quicker and easier due to the batching wrapper that this code contains.

The current recommendation for batching by Stanza is to concatenate documents together with each document separated by a blank line (\n\n). This way of batching has one main drawback:

The return of processing this document is one Stanza Document with lots of sentences, thus you don't know where one document ends and another starts, easily.

This batching utility solves this problem, when given a list of documents, it will return a list of corresponding processed Stanza documents. For more details see the README which contains an example.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyPi Release

Compatible with Stanza 1.2.0 and 1.1.1

Updates

Stable version for Stanza v1.1.1

Releases: apmoore1/stanza-batch

Token Misc Fix

PyPi Release

PyPi Release

Stanza 1.2.0 and 1.1.1

Compatible with Stanza 1.2.0 and 1.1.1

Stable version of version 0.1.0

Updates

Stable version for Stanza v1.1.1

Stable version for Stanza v1.1.1