Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How would you train for BW extension? #397

Open
kelseyjd opened this issue Mar 2, 2023 · 2 comments
Open

How would you train for BW extension? #397

kelseyjd opened this issue Mar 2, 2023 · 2 comments
Labels
question Further information is requested

Comments

@kelseyjd
Copy link

kelseyjd commented Mar 2, 2023

I'm interested in training to convert 24 kHz mel spectrograms to 48 kHz waveforms (like HIFI-GAN2). Might not work without changing the architecture, but that's ok. How would you modify the config files to do this? I've already run the recipe through stage 1 to extract features with downsampled VCTK. Now I'm hesitating on how to modify the generator parameters to produce 2x length waveform with the HIFI gan config

@kan-bayashi
Copy link
Owner

You can simply increase upsample scale here.

upsample_scales: [8, 8, 2, 2] # Upsampling scales.
upsample_kernel_sizes: [16, 16, 4, 4] # Kernel size for upsampling layers.

E.g.,

 upsample_scales: [8, 8, 4, 2]         # Upsampling scales. 
 upsample_kernel_sizes: [16, 16, 8, 4] # Kernel size for upsampling layers.

@kan-bayashi kan-bayashi added the question Further information is requested label Mar 4, 2023
@kelseyjd
Copy link
Author

kelseyjd commented Mar 7, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants