Spoken_Language_Recognition_Tensorflow_Embedded

An artificial neural network using mel spectrograms to recognize the language from human conversations, running with Tensorflow Micro on an Arduino Nano 33 BLE Sense

Project for the course Hardware Architecture for Embedded and Edge AI at Politecnico di Milano, A.Y. 2022-2023

Devoloped by:

Simone Giampà
Claudio Galimberti

Project summary

The goal of this project is to develop a neural network able to recognize the language spoken by a person, given a short audio clip of his/her voice. The network is trained on a dataset that we produced on our own. This dataset contains audio clips of people speaking in 3 languages: italian, english and french. The network is trained on a modified version of the mel spectrograms: MFCC (mel frequency cepstral coefficients) extracted from the audio clips, and it is able to tell apart the 3 different languages. This convolutional neural network runs on an Arduino Nano 33 BLE Sense, using Tensorflow Lite for Microcontrollers library

The dataset is kept private for privacy purposes.

Project structure

The project is structured as follows:

Arduino audio recorder: contains arduino code meant to record audio data and store it on the SD card of the Arduino Nano 33 BLE Sense, or send it to a computer via serial port
Audio recording notebooks: contains jupyter notebooks used to transform binary data into WAV audio files, and process them. They can also listen to a serial port and record audio data from it, saving the raw data in a file
Course Noteooks: contains jupyter notebooks used to train the neural network, and to test it. Taken from the professor of our university course
arduinoMFCC library: contains the arduino library used to extract MFCC from audio data. This library can be executed both in an arduino environment and in a computer linux environment
C++ MFCC scripts: contains a C++ script that uses the arduinoMFCC library to extract MFCC from audio data. It can be executed in a linux environment
Arduino TFMicro Inference with MFCC: contains an arduino script that uses the arduinoMFCC library to extract MFCC from audio data. It can be executed in an arduino environment. It also uses a tfmicro model to make inference about a short audio clip recorded from the microcontroller itself. This folder contains the core of this project.
Models: contains the Tensorflow model, the Tensorflow Lite model and the TFMicro model (C bytes array) used to make inference on the Arduino Nano 33 BLE Sense
dataset_creator: jupyter notebook used to create the dataset and the splitting between training and validation
training_network: jupyter notebook used to train the neural network, quantize the model into TFLite and convert to TFMicro. Then evaluates the model on the different test sets created ad hoc
Report: report of the project
Presentation: presentation of the project

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
MFCC_script_arduino		MFCC_script_arduino
MFCC_script_cpp		MFCC_script_cpp
arduino audio recorders		arduino audio recorders
data_recorder_notebooks		data_recorder_notebooks
haeeai_course_notebooks		haeeai_course_notebooks
libraries/arduinoMFCC		libraries/arduinoMFCC
model_lite		model_lite
.gitignore		.gitignore
README.md		README.md
Spoken Language Recognition - Project Presentation.pdf		Spoken Language Recognition - Project Presentation.pdf
Spoken Language Recognition - Project Presentation.pptx		Spoken Language Recognition - Project Presentation.pptx
Spoken Language Recognition - Project Report.pdf		Spoken Language Recognition - Project Report.pdf
dataset_creator.ipynb		dataset_creator.ipynb
training_net.ipynb		training_net.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spoken_Language_Recognition_Tensorflow_Embedded

An artificial neural network using mel spectrograms to recognize the language from human conversations, running with Tensorflow Micro on an Arduino Nano 33 BLE Sense

Project summary

Project structure

About

Releases

Packages

Languages

SimonGiampy/Spoken_Language_Recognition_Tensorflow_Embedded

Folders and files

Latest commit

History

Repository files navigation

Spoken_Language_Recognition_Tensorflow_Embedded

An artificial neural network using mel spectrograms to recognize the language from human conversations, running with Tensorflow Micro on an Arduino Nano 33 BLE Sense

Project summary

Project structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages