Version 2.1.1

hbredin released this 31 Jan 13:50

· 227 commits to develop since this release

Version 2.1.x introduces a major overhaul of pyannote.audio default speaker diarization pipeline, made of three main stages:

neural speaker segmentation applied to a short sliding window;
neural speaker embedding of each (local) speakers;
(global) agglomerative clustering.

More details in the attached technical report.

Assets 3