GitHub - marisbasha/vak: A neural network framework for researchers studying animal acoustic communication

A neural network framework for animal acoustic communication and bioacoustics

🚧 vak version 1.0.0 is in development! 🚧 📣 Test out the alpha release: `pip install vak==1.0.0a1`. 📣 For more info, please see this forum post.

vak is a Python framework for neural network models, designed for researchers studying animal acoustic communication and bioacoustics. Many people will be familiar with work in this area on animal vocalizations such as birdsong, bat calls, and even human speech. Neural network models have provided a powerful new tool for researchers in this area, as in many other fields.

The library has two main goals:

Make it easier for researchers studying animal vocalizations to apply neural network algorithms to their data
Provide a common framework that will facilitate benchmarking neural network algorithms on tasks related to animal vocalizations

Currently, the main use is an automatic annotation of vocalizations and other animal sounds. By annotation, we mean something like the example of annotated birdsong shown below:

You give vak training data in the form of audio or spectrogram files with annotations, and then vak helps you train neural network models and use the trained models to predict annotations for new files.

We developed vak to benchmark a neural network model we call tweetynet.
Please see the eLife article here: https://elifesciences.org/articles/63853

For more background on animal acoustic communication and deep learning, and how these intersect with related fields like computational ethology and neuroscience, please see the "About" section below.

Installation

Short version:

with `pip`

$ pip install vak

with `conda`

$ conda install vak -c pytorch -c conda-forge
$ #                  ^ notice additional channel!

Notice that for conda you specify two channels, and that the pytorch channel should come first, so it takes priority when installing the dependencies pytorch and torchvision.

For more details, please see:
https://vak.readthedocs.io/en/latest/get_started/installation.html

We test vak on Ubuntu and MacOS. We have run on Windows and know of other users successfully running vak on that operating system, but installation on Windows may require some troubleshooting. A good place to start is by searching the issues.

Usage

Tutorial

Currently the easiest way to work with vak is through the command line.

You run it with configuration files, using one of a handful of commands.

For more details, please see the "autoannotate" tutorial here:
https://vak.readthedocs.io/en/latest/get_started/autoannotate.html

How can I use my data with `vak`?

Please see the How-To Guides in the documentation here:
https://vak.readthedocs.io/en/latest/howto/index.html

Support / Contributing

For help, please begin by checking out the Frequently Asked Questions:
https://vak.readthedocs.io/en/latest/faq.html.

To ask a question about vak, discuss its development, or share how you are using it, please start a new "Q&A" topic on the VocalPy forum with the vak tag:
https://forum.vocalpy.org/

To report a bug, or to request a feature, please use the issue tracker on GitHub:
https://github.com/vocalpy/vak/issues

For a guide on how you can contribute to vak, please see: https://vak.readthedocs.io/en/latest/development/index.html

Citation

If you use vak for a publication, please cite its DOI:

License

is here.

About

Are humans unique among animals? We speak languages, but is speech somehow like other animal behaviors, such as birdsong? Questions like these are answered by studying how animals communicate with sound. This research requires cutting edge computational methods and big team science across a wide range of disciplines, including ecology, ethology, bioacoustics, psychology, neuroscience, linguistics, and genomics ¹²³. As in many other domains, this research is being revolutionized by deep learning algorithms ¹²³. Deep neural network models enable answering questions that were previously impossible to address, in part because these models automate analysis of very large datasets. Within the study of animal acoustic communication, multiple models have been proposed for similar tasks, often implemented as research code with different libraries, such as Keras and Pytorch. This situation has created a real need for a framework that allows researchers to easily benchmark models and apply trained models to their own data. To address this need, we developed vak. We originally developed vak to benchmark a neural network model, TweetyNet ⁴⁵, that automates annotation of birdsong by segmenting spectrograms. TweetyNet and vak have been used in both neuroscience ⁶⁷⁸ and bioacoustics ⁹. For additional background and papers that have used vak, please see: https://vak.readthedocs.io/en/latest/reference/about.html

"Why this name, vak?"

It has only three letters, so it is quick to type, and it wasn't taken on pypi yet. Also I guess it has something to do with speech. "vak" rhymes with "squawk" and "talk".

Does your library have any poems?

Yes.

Contributors ✨

Thanks goes to these wonderful people (emoji key):

_avanikop 🐛	_{Luke Poeppel} 📖	_{yardencsGitHub} 💻 🤔 📢 📓 💬	_{David Nicholson} 🐛 💻 🔣 📖 💡 🤔 🚇 🚧 🧑‍🏫 📆 👀 💬 📢 ⚠️ ✅	_marichard123 📖	_{Therese Koch} 📖 🐛	_alyndanoel 🤔
_adamfishbein 📖	_vivinastase 🐛 📓	_kaiyaprovost 💻 🤔	_ymk12345 🐛 📖	_neuronalX 🐛 📖	_Khoa 📖	_sthaar 📖 🐛 🤔
_{yangzheng-121} 🐛 🤔	_lmpascual 📖	_{ItamarFruchter} 📖	_{Hjalmar K. Turesson} 🐛 🤔	_nhoglen 🐛	_Ja-sonYun 💻

This project follows the all-contributors specification. Contributions of any kind welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 2,726 Commits
.github		.github
.vscode		.vscode
doc		doc
src		src
tests		tests
.all-contributorsrc		.all-contributorsrc
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
noxfile.py		noxfile.py
pyproject.toml		pyproject.toml
test_vae.ipynb		test_vae.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A neural network framework for animal acoustic communication and bioacoustics