STOP: A dataset for spoken task oriented semantic parsing - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2023

STOP: A dataset for spoken task oriented semantic parsing

Résumé

End-to-end spoken language understanding (SLU) predicts intent directly from audio using a single model. It promises to improve the performance of assistant systems by leveraging acoustic information lost in the intermediate textual representation and preventing cascading errors from Automatic Speech Recognition (ASR). Further, having one unified model has efficiency advantages when deploying assistant systems on-device. However, the limited number of public audio datasets with semantic parse labels hinders the research progress in this area. In this paper, we release the Spoken Task-Oriented semantic Parsing (STOP) dataset 1 , the largest and most complex SLU dataset publicly available. Additionally, we define low-resource splits to establish a benchmark for improving SLU when limited labeled data is available. Furthermore, in addition to the human-recorded audio, we are releasing a TTS-generated versions to benchmark the performance for low-resource and domain adaptation of end-to-end SLU systems.

Domaines

Linguistique
Fichier principal
Vignette du fichier
2207.10643.pdf (791.11 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03989829 , version 1 (15-02-2023)

Identifiants

  • HAL Id : hal-03989829 , version 1

Citer

Paden Tomasello, Akshat Shrivastava, Daniel Lazar, Po-Chun Hsu, Duc Le, et al.. STOP: A dataset for spoken task oriented semantic parsing. SLT-2022 - IEEE Spoken Language Technology Workshop, Jan 2023, Doha, Qatar. ⟨hal-03989829⟩
33 Consultations
82 Téléchargements

Partager

Gmail Facebook X LinkedIn More