Perceptimatic: A human speech perception benchmark for unsupervised subword modelling

Juliette Millet; Ewan Dunbar

Communication Dans Un Congrès Année : 2020

Perceptimatic: A human speech perception benchmark for unsupervised subword modelling

(1, 2) , (1, 2)

1
2

Juliette Millet

Fonction : Auteur
PersonId : 1053078

Laboratoire de Linguistique Formelle

Apprentissage machine et développement cognitif

Ewan Dunbar

Fonction : Auteur

Laboratoire de Linguistique Formelle

Apprentissage machine et développement cognitif

Résumé

In this paper, we present a data set and methods to compare speech processing models and human behaviour on a phone discrimination task. We provide Perceptimatic, an open data set which consists of French and English speech stimuli, as well as the results of 91 English-and 93 French-speaking listeners. The stimuli test a wide range of French and English contrasts, and are extracted directly from corpora of natural running read speech, used for the 2017 Zero Resource Speech Challenge. We provide a method to compare humans' perceptual space with models' representational space, and we apply it to models previously submitted to the Challenge. We show that, unlike unsupervised models and supervised multilingual models, a standard supervised monolingual HMM-GMM phone recognition system, while good at discriminating phones, yields a representational space very different from that of human native listeners.

Mots clés

Evaluation Unsupervised Speech recognition

Domaines

Informatique [cs] Sciences cognitives

Fichier principal

Interspeech_2020_Native_Perceptimatic__a_human_benchmark_for_the_Zerospeech_challenges.pdf (500.24 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ewan Dunbar : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03087252

Soumis le : mercredi 23 décembre 2020-15:57:20

Dernière modification le : vendredi 19 avril 2024-16:18:59

Dates et versions

hal-03087252 , version 1 (23-12-2020)

Identifiants

HAL Id : hal-03087252 , version 1

Citer

Juliette Millet, Ewan Dunbar. Perceptimatic: A human speech perception benchmark for unsupervised subword modelling. Interspeech 2020 - 21st Annual Conference of the International Speech Communication Association, Oct 2020, Shanghai / Virtual, China. ⟨hal-03087252⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA EHESS LLF LSCP DEC INRIA2 CAMPUS-AAR AAI PSL UP-SOCIETES-HUMANITES ANR INTERACTIFS

57 Consultations

63 Téléchargements

Perceptimatic: A human speech perception benchmark for unsupervised subword modelling

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager