Tokotron: Tokenized TTS for the SpeechBrain benchmark (single speaker) #37

flexthink · 2024-06-11T18:20:51Z

No description provided.

…e fixed for the benchmark

poonehmousavi

since util is only used in TTS.. it would be better to move it to TTS folder

poonehmousavi

the wrappers added for all tokenzier are only used in TTS.. so maybe better to cerate the custom_model specific to TTS so it is less confusing.. but the diea is really good . in the next refactoring phase, I will try to apply teh changes across all tokenizers

poonehmousavi · 2024-07-22T19:31:11Z

benchmarks/DASB/model/custom_model.py

+from huggingface_hub import snapshot_download
+
+try:
+    from speechtokenizer import SpeechTokenizer


precommit error for unused lib

No longer needed given some of the code has been moved out, will update

poonehmousavi · 2024-07-22T19:50:26Z

Thanks @flexthink, everything looks really neat .. I did the review... I have fetched the latest update from the DASB main branch... These are the few comments that I have before merging the PR:

There is some recommit error mostly related to unused lib.
The utils and Custim_model should be moved to TTS folder, especially custom_model should be renamed or avoid the link to the main custim_model because the wrappers ae not used for other tasks and it could be confusing,.. we could add your changes later to all tasks, but for now let's keep it only for TTS.
I have added the run_generative_benchmark.sh for SE and SS, please complete it for TTS and make sure that it works.

…nchmarks into DASB-tokotron-clean

poonehmousavi · 2024-07-25T15:55:15Z

Thanks @flexthink for this PR... everything works now..It merged

flexthink added 30 commits May 7, 2024 15:41

Tokotron: Initial import for the Benchmark

d76a3b9

Tokotron: Update hyperparameter defaults

1dcc20c

Tokotron: Add a workaround for concat_padded_features, which cannot b…

7e1ae9b

…e fixed for the benchmark

Tokotron: Update the default EOS mode

75509fc

Tokotron: Add multispeaker support with LibriTTS

d55a23d

Tokotron: Fix defaults

9dcaf55

Tokotron: LibriTTS fixes

c23f6ce

Tokotron: Fixes for feature extraction

1e187e1

Tokotron: Add support for Encodec

6d1326b

Tokotron: Fix for Encodec

65e0754

Tokotron: Add normalization to embeddings, change the injection strategy

4afb395

DASB: Implement SSL Tacotron (continuous baseline)

0e84349

DASB: Tacotron: Update ljspeech parameters

bc29dfa

DASB: Tacotron: Add a "freezer"

3f9d4bf

DASB: Tacotron: Fixes

3f19048

Tokotron: Add Encodec support

ad5991b

DASB: Tacotron: Fix splits

4fe3a10

DASB: Tacotron device fix

6ac1a13

Tokotron: Fix audio_num_tokens for encodec

31b6080

DASB: Add a MSTacotron2 recipe

ea0afb1

Tokotron: Update LJSpeech to use the new vocoder

4229ef9

Tokotron: Update for the latest vocoders

b456a79

DASB: MSTacotron2: Update for all possible model types

c4a30a2

MSTacotron2: Fixes

2142488

DASB: MSTacotron2: Update a bad reference

bbdd9dc

DASB: MSTacotron2: Fixes

4ec2af8

DASB: MSTacotron: Undo a temporary debugging change

dce0563

MSTacotron2: Remove the test set

7d65efe

DASB: Tokotron: Update hparams

d296345

DASB: Tokotron: Update multispeaker for the latest vocoder

fad4c1c

flexthink added 3 commits July 16, 2024 18:55

DASB: Tokotron: Device fix

e2f4b51

DASB: Tokotron: Fix a typo

c32c709

DASB: Tokotron: Remove ST duplication

c4389bd

poonehmousavi requested changes Jul 22, 2024

View reviewed changes

poonehmousavi reviewed Jul 22, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/DASB' into DASB-tokotron-clean

a80c1b6

flexthink and others added 20 commits July 23, 2024 10:29

DASB: Tokotron: Fix flake8 errors

5fc43bc

DASB: Tokotron: Minor refactoring

d50c0df

DASB: Tokotron: Update generative script

29e3bab

DASB: Tokotron: Update script, fix a missing import

e1f6690

DASB: Update the script to allow arbitrary overrides

ea28166

fix discriminative script

a306d3c

DASB: Tokotron: Refactoring

1ba7afd

Merge branch 'DASB-tokotron-clean' of https://github.com/flexthink/be…

586a7a5

…nchmarks into DASB-tokotron-clean

DASB: Tokotron: Fixes

e767da5

modify bash

6ae2d02

fix bash scripts

a1493a8

add print to script

7f92e6f

DASB: Tokotron: Add the UTMOS path to the shell script

dc1daf3

Merge branch 'DASB-tokotron-clean' of https://github.com/flexthink/be…

f74c40e

…nchmarks into DASB-tokotron-clean

DASB: Tokotron: Update README for UTMOS

4522953

FIX INDENT IN README

b11c1d2

DASB: Tokotron: Update TTS args

dc0bf29

remove comments

1318aee

fix precommit

e89de0e

fix main readme with main branch

2939ae4

poonehmousavi merged commit c3d3e89 into speechbrain:DASB Jul 25, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tokotron: Tokenized TTS for the SpeechBrain benchmark (single speaker) #37

Tokotron: Tokenized TTS for the SpeechBrain benchmark (single speaker) #37

flexthink commented Jun 11, 2024

poonehmousavi left a comment •

edited

Loading

poonehmousavi left a comment

poonehmousavi Jul 22, 2024

flexthink Jul 23, 2024

poonehmousavi commented Jul 22, 2024

poonehmousavi commented Jul 25, 2024

Tokotron: Tokenized TTS for the SpeechBrain benchmark (single speaker) #37

Tokotron: Tokenized TTS for the SpeechBrain benchmark (single speaker) #37

Conversation

flexthink commented Jun 11, 2024

poonehmousavi left a comment • edited Loading

Choose a reason for hiding this comment

poonehmousavi left a comment

Choose a reason for hiding this comment

poonehmousavi Jul 22, 2024

Choose a reason for hiding this comment

flexthink Jul 23, 2024

Choose a reason for hiding this comment

poonehmousavi commented Jul 22, 2024

poonehmousavi commented Jul 25, 2024

poonehmousavi left a comment •

edited

Loading