-
Notifications
You must be signed in to change notification settings - Fork 86
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions regarding the use of Sopare #77
Comments
|
Check if your hardware limits the input. Other than that it could be that you are using a different configuration while you are using plot. Or it is a bug ;). |
I noticed you concentrated lots of time in YouTube in the frequency domain. How to get frequency domain with a wav file? |
The question about frequency range would interest me as well. Any insights? |
What question about frequency range is unanswered? |
Question about how is the following possible:
(see 3 comments above in #77 (comment) ) |
Only the full FFT graph shows all frequencies. Single token graphs are, as the word states, tokenized and inherit only parts of the frequencies. Like a single piece of cake don't contain all the ingredients of the full cake... |
That didn't appear to me. Thanks for clarification and a great project overall! |
Hello again, still a big fan. I've been tinkering around a bit with your program, and I've got some questions.
By default, the frequency range used in the speech analysis is 20-600, even though human speech goes much higher than that. Does increasing this range have an effect on the accuracy of the analysis? Also, it seems like the built-in FFT plot function's x axis caps out at 2000Hz by default, even if the frequency range is e.g 20-5000. Does the actual analysis still take the entire range into account, and can the axis limits be increased?
I noticed the master branch hasn't had a commit since January 2018. Have there been any significant improvements in the testing branch that would warrant using that over the master branch for my own project?
In my own project, I would like my Raspberry to recognise simple voice commands from a handful of different people. I can't necessarily get word samples from all of them, so I'm wondering what settings in the config file I should tinker with in order to improve the rate of success.
Thanks in advance, have a good day.
The text was updated successfully, but these errors were encountered: