Speech enhancement models:MLP, Auto-encoder, GAN
The dataset is the speech enhancment dataset built by the University of Edinburgh. DataShare.
- PyTorch
conda install pytorch torchvision -c pytorch
- librosa
pip install librosa
The audios should be sliced into pieces with equal time length. Then do Short Time Fourier Transform on them, turn them into a 2D matirx. Then we use CNN to extract features from them.