[Feature request] Server api to return available model names and speaker id; load and unload downloaded model #37

chigkim · 2024-05-30T03:35:36Z

Right now, you can request server to return audio by fetching

http://localhost:5002/api/tts?text={text}&speaker_id={speaker}

Can we also have api points for the server to return available model names and speaker ids (for multi speakers) as well as load and unload downloaded model?

Thanks for your consideration!

eginhard · 2024-05-30T08:07:27Z

Seems sensible, at least listing models/languages/speakers should be straightforward. I won't implement this myself since we don't use this server, but would merge a PR.

Roy6250 · 2024-07-24T09:57:56Z

Hi @eginhard, can I work on this issue?
Thanks

eginhard · 2024-07-24T10:19:53Z

@Roy6250 Sure, thank you! I'd suggest to leave out the part about (un)loading models for now to keep it simple. We could discuss it at a later stage.

Roy6250 · 2024-07-24T10:45:53Z

Sure, Thanks.

Roy6250 · 2024-07-27T11:08:13Z

Hi @eginhard, went through repo. Made the setup. Before proceeding, would like to verify if I am in the correct path.

Requirement: List available model_names, languages and speakerIds.

Solution: From .models.json from TTS directory I will get the model names and languages. But not able to find out speakerIds. It would be helpful if you point me in the correct direction.

Thanks

Roy6250 · 2024-07-31T14:30:06Z

Hi @eginhard please let me know your views. Thanks

eginhard · 2024-07-31T14:37:54Z

@Roy6250 You don't need to parse the .models.json file yourself. There are helper functions for this already in the ModelManager class. Also see how the CLI is implemented, e.g. to get speaker IDs for a model:

coqui-ai-TTS/TTS/bin/synthesize.py

Line 425 in 20bbb41

if args.list_speaker_idxs:

Roy6250 · 2024-08-02T12:34:47Z

Thanks for the help @eginhard. Using the helper functions, I was able to fetch all the models and languages.

Using this I can fetch the speaker names for a particular model.

speakers=synthesizer.tts_model.speaker_manager.name_to_id

`For this I have to download the model. This approach doesn't seem viable. Shall I preprocess and store the speaker names in json format and then show it from there during GET request?

eginhard · 2024-08-02T14:10:22Z

@Roy6250 I would only return the speaker names for the currently loaded model and not for any arbitrary one to keep it simple.

Roy6250 · 2024-08-02T15:15:32Z

@eginhard Sure, got it. Just one final query, about the API structure:

Request Type: GET
Params: None,

Response :{
model_name:[...] # List of all Model_names
languages:[...] # List of Languages,
speaker_ids:[...] # List of Speaker_ids, if any model is loaded, also will mention that particular model
}

eginhard · 2024-08-05T09:48:28Z

@Roy6250 I'd suggest to create separate endpoints for each of these. Also check what is already available, e.g. I see that there is

coqui-ai-TTS/TTS/server/server.py

Line 216 in 19fce2c

@app.route("/locales", methods=["GET"])

and

coqui-ai-TTS/TTS/server/server.py

Line 227 in 19fce2c

@app.route("/voices", methods=["GET"])

eginhard added enhancement New feature or request good first issue Good for newcomers labels May 30, 2024

eginhard assigned Roy6250 Jul 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] Server api to return available model names and speaker id; load and unload downloaded model #37

[Feature request] Server api to return available model names and speaker id; load and unload downloaded model #37

chigkim commented May 30, 2024 •

edited

Loading

eginhard commented May 30, 2024

Roy6250 commented Jul 24, 2024

eginhard commented Jul 24, 2024

Roy6250 commented Jul 24, 2024

Roy6250 commented Jul 27, 2024

Roy6250 commented Jul 31, 2024 •

edited

Loading

eginhard commented Jul 31, 2024

Roy6250 commented Aug 2, 2024

eginhard commented Aug 2, 2024

Roy6250 commented Aug 2, 2024 •

edited

Loading

eginhard commented Aug 5, 2024

[Feature request] Server api to return available model names and speaker id; load and unload downloaded model #37

[Feature request] Server api to return available model names and speaker id; load and unload downloaded model #37

Comments

chigkim commented May 30, 2024 • edited Loading

eginhard commented May 30, 2024

Roy6250 commented Jul 24, 2024

eginhard commented Jul 24, 2024

Roy6250 commented Jul 24, 2024

Roy6250 commented Jul 27, 2024

Roy6250 commented Jul 31, 2024 • edited Loading

eginhard commented Jul 31, 2024

Roy6250 commented Aug 2, 2024

eginhard commented Aug 2, 2024

Roy6250 commented Aug 2, 2024 • edited Loading

eginhard commented Aug 5, 2024

chigkim commented May 30, 2024 •

edited

Loading

Roy6250 commented Jul 31, 2024 •

edited

Loading

Roy6250 commented Aug 2, 2024 •

edited

Loading