Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, Epiciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

The Speed Submission to DIHARD II: Contributions & Lessons Learned

Abstract : This paper describes the speaker diarization systems developed for the Second DIHARD Speech Diarization Challenge (DIHARD II) by the Speed team. Besides describing the system, which considerably outperformed the challenge baselines, we also focus on the lessons learned from numerous approaches that we tried for single and multi-channel systems. We present several components of our diarization system, including categorization of domains, speech enhancement, speech activity detection, speaker embeddings, clustering methods, resegmentation, and system fusion. We analyze and discuss the effect of each such component on the overall diarization performance within the realistic settings of the challenge.
Complete list of metadata

Cited literature [36 references]  Display  Hide  Download
Contributor : Md Sahidullah Connect in order to contact the contributor
Submitted on : Tuesday, June 30, 2020 - 7:04:00 PM
Last modification on : Saturday, June 25, 2022 - 10:49:43 PM


Files produced by the author(s)


  • HAL Id : hal-02352840, version 2


Md Sahidullah, Jose Patino, Samuele Cornell, Ruiqing yin, Sunit Sivasankaran, et al.. The Speed Submission to DIHARD II: Contributions & Lessons Learned. 2019. ⟨hal-02352840v2⟩



Record views


Files downloads