Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use pyannote-audio for speaker diarization #10

Open
saharmor opened this issue Dec 8, 2022 · 2 comments
Open

Use pyannote-audio for speaker diarization #10

saharmor opened this issue Dec 8, 2022 · 2 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@saharmor
Copy link
Owner

saharmor commented Dec 8, 2022

Logic will be to combine Whisper + pyannote.audio based on timestamps to output something along the lines of:

Person A: Hi
Person B: Hello, how are you
Person A: I'm good, and you?
....
@saharmor saharmor added enhancement New feature or request help wanted Extra attention is needed labels Dec 8, 2022
@remic33
Copy link

remic33 commented Dec 14, 2022

I am working on the same subject, you can find work done by Majdoddin here: https://github.com/Majdoddin/nlp
Not perfect but a good way to start. I ll push my solution when done

@saharmor
Copy link
Owner Author

@remic33 Cool - any plans to build into Whisper Playground?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants