Controllable TalkNet is a web application that lets you synthesize speech, which mimics the pitch and pacing of an existing audio clip. It's based on NVIDIA's implementation of TalkNet 2, with some changes to support singing synthesis and higher audio quality.
- A Google account to run Colab, or...
- An NVIDIA GPU with 4+ GB of VRAM
- 10 GB of free space
- Go to the Colab notebook and follow the instructions.
- Download the setup script and extract it to a folder.
- Run setup.bat. The initial setup will take about 20 minutes.
- When it's done, run talknet.bat to start TalkNet on http://127.0.0.1:8050/. To download updates, run update.bat.
- Install Docker and NVIDIA Container Toolkit.
- Download the Dockerfile. Open a terminal, and navigate to the directory where you saved it.
- Run
docker build -t talknet-offline .
to build the image. Addsudo
if you're not using rootless Docker. - Run
docker run -it --gpus all -p 8050:8050 talknet-offline
to start TalkNet on http://127.0.0.1:8050/.