Skip to content

can we training emformer transducer stateless with phoneme base? #585

Answered by yfyeung
trangtv57 asked this question in Q&A
Discussion options

You must be logged in to vote

Actually, icefall supports model training with phoneme lexicon.
Take LibriSpeech for example:
To prepare data: https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/prepare.sh#L161-L189
To generate a unique lexicon: https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/local/generate_unique_lexicon.py
To use unique lexicon duration training or decoding: https://github.com/k2-fsa/icefall/blob/master/icefall/lexicon.py#L200

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by marcoyang1998
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants