The Perceptimatic English Benchmark for Speech Perception Models

Juliette Millet; Ewan Dunbar

Communication Dans Un Congrès Année : 2020

The Perceptimatic English Benchmark for Speech Perception Models

(1, 2, 3) , (1, 3)

1
2
3

Juliette Millet

Fonction : Auteur
PersonId : 1053078

Laboratoire de Linguistique Formelle

Ecole Doctorale Frontiere de l’Innovation en Recherche et Education

Apprentissage machine et développement cognitif

Ewan Dunbar

Fonction : Auteur
PersonId : 1078898

Laboratoire de Linguistique Formelle

Apprentissage machine et développement cognitif

Résumé

We present the Perceptimatic English Benchmark, an open experimental benchmark for evaluating quantitative models of speech perception in English. The benchmark consists of ABX stimuli along with the responses of 91 American Englishspeaking listeners. The stimuli test discrimination of a large number of English and French phonemic contrasts. They are extracted directly from corpora of read speech, making them appropriate for evaluating statistical acoustic models (such as those used in automatic speech recognition) trained on typical speech data sets. We show that phone discrimination is correlated with several types of models, and give recommendations for researchers seeking easily calculated norms of acoustic distance on experimental stimuli. We show that DeepSpeech, a standard English speech recognizer, is more specialized on English phoneme discrimination than English listeners, and is poorly correlated with their behaviour, even though it yields a low error on the decision task given to humans.

Mots clés

Benchmarks Speech perception Acoustic distance Speech recognition

Domaines

Sciences cognitives Informatique [cs]

Fichier principal

Cog_Sci_2020___Predicting_phoneme_confusions__Juliette_.pdf (196.44 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ewan Dunbar : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03087248

Soumis le : mercredi 23 décembre 2020-15:53:42

Dernière modification le : vendredi 19 avril 2024-16:18:59

Dates et versions

hal-03087248 , version 1 (23-12-2020)

Identifiants

HAL Id : hal-03087248 , version 1

Citer

Juliette Millet, Ewan Dunbar. The Perceptimatic English Benchmark for Speech Perception Models. CogSci 2020 - 42nd Annual Virtual Meeting of the Cognitive Science Society, Jul 2020, Toronto / Virtual, Canada. ⟨hal-03087248⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA EHESS LLF LSCP DEC INRIA2 CAMPUS-AAR AAI PSL AMIDEX UP-SOCIETES-HUMANITES ANR PRAIRIE-IA

58 Consultations

57 Téléchargements

The Perceptimatic English Benchmark for Speech Perception Models

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager