Has emotion support only existed for coqui API users? #41
Replies: 3 comments
-
Yes, the Line 243 in a682fa8 |
Beta Was this translation helpful? Give feedback.
-
Theoretically XTTSv2 should be able to do this because it is a child of the tortoise model and this is where that emotion control comes from. I've been trying to decipher this over the last few days and I found my way here to see if anyone else has any information. Along with some other things it appears intentionally obfuscated/omitted. The tokenizer/vocab show little to help. I am maybe jumping to conclusions though, if anyone has any insight please let me know. |
Beta Was this translation helpful? Give feedback.
-
An option could be adding another embedder for tags, similar to a speaker embedding, only for emotions. This embedder would simply generate an embedding for a label instead of using another neural model for computing this feature, using this, for example. A small embedding could do it, since no more than 10 emotions will be used in a dataset (hopefully). The embedding dimesion could be updated in the model configuration. |
Beta Was this translation helpful? Give feedback.
-
What would it take to get emotion control out of this?
Beta Was this translation helpful? Give feedback.
All reactions