Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementation performance #16

Open
LeyuanQu opened this issue Jan 12, 2023 · 0 comments
Open

Implementation performance #16

LeyuanQu opened this issue Jan 12, 2023 · 0 comments

Comments

@LeyuanQu
Copy link

Hi,
thank you very much for your great work!

I was wondering if you conduct any evaluations on the model performance and voice quality for multi-speaker results, e.g. MOS or sMOS ?

After listening the demos you provided, I found the generated voices for speaker p257 and p250 are quite similar. (I suppose p250-265.wav and p257-243.wav come from different speakers.)
p250
demo/VCTK)/shallow_diffusion_400k/demo_VCTK_shallow_diffusion_400k_p250-265.wav

p257
demo/VCTK)/shallow_diffusion_400k/demo_VCTK_shallow_diffusion_400k_p257-243.wav

Could you please give me a hint?

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant