-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
32kHz Vocos Multi Speaker Model Training Log #48
Comments
Do you have a standard tensorboard logs? It is interesting to compare. |
@patriotyk Sorry, I've change the code to log on WandB server. I have no local logging files nor tensorboard logs. |
What is your validation loss on the last checkpoint? It is encoded in to the checkpoint file name. I am training 44100 for an almost a week already and loss still goes down. |
TKS for your work,could your share 32k model training detail like: |
I'm sry for your confuse. But I have plans to train Mel-Encodec?(Mel Spectrogram to RVQ Encoder, and Vocos Decoder for Various Speech data) in the future. |
I estimated mel loss, and Generator loss with newly gained dataset. and each was 0.0942 and 2.82. how about your model output's quality? any artifacts? |
I am still training(third week). It is very slow. I will update with my results when finish. |
how much data do we need for training |
@LEECHOONGHO I have published my model here https://huggingface.co/patriotyk/vocos-mel-hifigan-compat-44100khz |
Great work! @patriotyk, Thank you so much |
I'm new to this... Could you please tell me what's the purpose of sharing the model? I mean, when I try to use it with a wav file, the output is very close to the original input file... So I'm confused here. Thank you |
This model generates audio from mel spectrograms. The functionality that you tried just generates mel from audio and then back audio from mel. But real tts systmes generate mels directly from text then vocoder generates audio. |
Ah ok so generating mel from audio is different from what tts systems do? Is there any code snippet that would let me test the model you trained (ans possibly others)? Thank you! |
Training Loss, Generated Outputs.
I hope this will be a reference for model training.
https://api.wandb.ai/links/xi-speech-team/k0kdfwch
The text was updated successfully, but these errors were encountered: