-
Notifications
You must be signed in to change notification settings - Fork 0
This pipeline processes voice queries by detecting voice activity with VAD, converting speech to text, generating concise responses using a Large Language Model (LLM), and converting text back to speech. It features low latency, limits responses to 2 sentences, and allows customization of pitch, voice gender, and speed.
Adityaswami05/Voice_Pipeline
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
About
This pipeline processes voice queries by detecting voice activity with VAD, converting speech to text, generating concise responses using a Large Language Model (LLM), and converting text back to speech. It features low latency, limits responses to 2 sentences, and allows customization of pitch, voice gender, and speed.
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published