Version 2.1.1
Version 2.1.x
introduces a major overhaul of pyannote.audio
default speaker diarization pipeline, made of three main stages:
- neural speaker segmentation applied to a short sliding window;
- neural speaker embedding of each (local) speakers;
- (global) agglomerative clustering.
More details in the attached technical report.