Learnable MFCCs for Speaker Verification

Xuechen Liu; Md Sahidullah; Tomi Kinnunen

doi:10.1109/ISCAS51556.2021.9401593

Communication Dans Un Congrès Année : 2021

Learnable MFCCs for Speaker Verification

(1, 2) , (1) , (2)

1
2

Xuechen Liu

Fonction : Auteur
PersonId : 1090826

Speech Modeling for Facilitating Oral-Based Communication

University of Eastern Finland

Md Sahidullah

Fonction : Auteur
PersonId : 737397
IdHAL : sahid

Speech Modeling for Facilitating Oral-Based Communication

Tomi Kinnunen

Fonction : Auteur

University of Eastern Finland

Résumé

We propose a learnable mel-frequency cepstral coefficients (MFCCs) front-end architecture for deep neural network (DNN) based automatic speaker verification. Our architecture retains the simplicity and interpretability of MFCC-based features while allowing the model to be adapted to data flexibly. In practice, we formulate data-driven version of four linear transforms in a standard MFCC extractor-windowing, discrete Fourier transform (DFT), mel filterbank and discrete cosine transform (DCT). Results reported reach up to 6.7% (VoxCeleb1) and 9.7% (SITW) relative improvement in term of equal error rate (EER) from static MFCCs, without additional tuning effort. Index Terms-Speaker verification, feature extraction, melfrequency cesptral coefficients (MFCCs).

Domaines

Traitement du signal et de l'image [eess.SP] Apprentissage [cs.LG] Intelligence artificielle [cs.AI]

Fichier principal

ISCAS_2021_Xuechen.pdf (311.35 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Md Sahidullah : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03139532

Soumis le : vendredi 12 février 2021-09:58:56

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : jeudi 13 mai 2021-18:29:02

Dates et versions

hal-03139532 , version 1 (12-02-2021)

Identifiants

HAL Id : hal-03139532 , version 1
DOI : 10.1109/ISCAS51556.2021.9401593

Citer

Xuechen Liu, Md Sahidullah, Tomi Kinnunen. Learnable MFCCs for Speaker Verification. ISCAS 2021 - IEEE International Symposium on Circuits and Systems, May 2021, Daegu, South Korea. ⟨10.1109/ISCAS51556.2021.9401593⟩. ⟨hal-03139532⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

198 Consultations

474 Téléchargements

Learnable MFCCs for Speaker Verification

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager