Skip to content

This pipeline processes voice queries by detecting voice activity with VAD, converting speech to text, generating concise responses using a Large Language Model (LLM), and converting text back to speech. It features low latency, limits responses to 2 sentences, and allows customization of pitch, voice gender, and speed.

Notifications You must be signed in to change notification settings

Adityaswami05/Voice_Pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

About

This pipeline processes voice queries by detecting voice activity with VAD, converting speech to text, generating concise responses using a Large Language Model (LLM), and converting text back to speech. It features low latency, limits responses to 2 sentences, and allows customization of pitch, voice gender, and speed.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages