Use pyannote-audio for speaker diarization #10

saharmor · 2022-12-08T04:56:28Z

Logic will be to combine Whisper + pyannote.audio based on timestamps to output something along the lines of:

Person A: Hi
Person B: Hello, how are you
Person A: I'm good, and you?
....

The text was updated successfully, but these errors were encountered:

remic33 · 2022-12-14T16:52:44Z

I am working on the same subject, you can find work done by Majdoddin here: https://github.com/Majdoddin/nlp
Not perfect but a good way to start. I ll push my solution when done

saharmor · 2023-01-11T13:07:02Z

@remic33 Cool - any plans to build into Whisper Playground?

saharmor added enhancement New feature or request help wanted Extra attention is needed labels Dec 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use pyannote-audio for speaker diarization #10

Use pyannote-audio for speaker diarization #10

saharmor commented Dec 8, 2022

remic33 commented Dec 14, 2022

saharmor commented Jan 11, 2023

Use pyannote-audio for speaker diarization #10

Use pyannote-audio for speaker diarization #10

Comments

saharmor commented Dec 8, 2022

remic33 commented Dec 14, 2022

saharmor commented Jan 11, 2023