occm

My attempt to employ one-class learning to detect synthetic speech. In fancy terms, I am trying to transform a hyper-plane classifier into a sphere covering positive samples.

For feature extraction (frontend), I use wav2vec model from Meta and finetune it with a subset of real/synthetic samples. For classification (backend), I work with several models SE-Resnet, AASIST, etc. This work is under progress. All suggestions are welcome.

Note

install this version of fairseq

pip install git+https://github.com/facebookresearch/fairseq.git@a54021305d6b3c4c5959ac9395135f63202db8f1

fix numpy package if necessary

float -> float64

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
audio_preprocess @ 4603de1		audio_preprocess @ 4603de1
losses		losses
models		models
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
RawBoost.py		RawBoost.py
__init__.py		__init__.py
calculate_eer.py		calculate_eer.py
data_utils_SSL.py		data_utils_SSL.py
environment.yml		environment.yml
evaluate.py		evaluate.py
evaluate_metrics.py		evaluate_metrics.py
oc_classifier.py		oc_classifier.py
oc_training.py		oc_training.py
requirement.txt		requirement.txt
test_dataloader_v2.py		test_dataloader_v2.py
test_model_merge.py		test_model_merge.py
test_sampler.py		test_sampler.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

occm

Note

install this version of fairseq

fix numpy package if necessary

About

Releases

Packages

Languages

License

nguyenvulong/occm

Folders and files

Latest commit

History

Repository files navigation

occm

Note

install this version of fairseq

fix numpy package if necessary

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages