Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tokotron: Tokenized TTS for the SpeechBrain benchmark (single speaker) #37

Merged
merged 144 commits into from
Jul 25, 2024

Conversation

flexthink
Copy link
Contributor

No description provided.

Copy link
Collaborator

@poonehmousavi poonehmousavi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

since util is only used in TTS.. it would be better to move it to TTS folder

Copy link
Collaborator

@poonehmousavi poonehmousavi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the wrappers added for all tokenzier are only used in TTS.. so maybe better to cerate the custom_model specific to TTS so it is less confusing.. but the diea is really good . in the next refactoring phase, I will try to apply teh changes across all tokenizers

from huggingface_hub import snapshot_download

try:
from speechtokenizer import SpeechTokenizer
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

precommit error for unused lib

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No longer needed given some of the code has been moved out, will update

@poonehmousavi
Copy link
Collaborator

Thanks @flexthink, everything looks really neat .. I did the review... I have fetched the latest update from the DASB main branch... These are the few comments that I have before merging the PR:

  1. There is some recommit error mostly related to unused lib.
  2. The utils and Custim_model should be moved to TTS folder, especially custom_model should be renamed or avoid the link to the main custim_model because the wrappers ae not used for other tasks and it could be confusing,.. we could add your changes later to all tasks, but for now let's keep it only for TTS.
  3. I have added the run_generative_benchmark.sh for SE and SS, please complete it for TTS and make sure that it works.

@poonehmousavi poonehmousavi merged commit c3d3e89 into speechbrain:DASB Jul 25, 2024
1 check passed
@poonehmousavi
Copy link
Collaborator

Thanks @flexthink for this PR... everything works now..It merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants