You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is there a way to randomly add a codec compression as a data augmentation when training speaker embeddings ? Is it already done in current pre-trained models ?
Things like Opus, MP3 etc.. but also telephony like a-law, mu-law etc.
Thanks.
The text was updated successfully, but these errors were encountered:
No, we did not do any codec augmentation in our training data pipeline.
It would be very nice if you can contribute the codes with experimental results!
Hi,
Is there a way to randomly add a codec compression as a data augmentation when training speaker embeddings ? Is it already done in current pre-trained models ?
Things like Opus, MP3 etc.. but also telephony like a-law, mu-law etc.
Thanks.
The text was updated successfully, but these errors were encountered: