Textless-lib: a Library for Textless Spoken Language Processing - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Textless-lib: a Library for Textless Spoken Language Processing

Résumé

Textless spoken language processing research aims to extend the applicability of standard NLP toolset onto spoken language and languages with few or no textual resources. In this paper, we introduce textless-lib, a PyTorch-based library aimed to facilitate research in this research area. We describe the building blocks that the library provides and demonstrate its usability by discuss three different use-case examples: (i) speaker probing, (ii) speech resynthesis and compression, and (iii) speech continuation. We believe that textless-lib substantially simplifies research the textless setting and will be handful not only for speech researchers but also for the NLP community at large. The code, documentation, and pre-trained models are available at https://github.com/ facebookresearch/textlesslib/.
Fichier principal
Vignette du fichier
textlesslib_paper.pdf (1.23 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03831838 , version 1 (15-02-2023)

Identifiants

  • HAL Id : hal-03831838 , version 1

Citer

Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, et al.. Textless-lib: a Library for Textless Spoken Language Processing. NAACL 2022 - Annual Conference of the North American Chapter of the Association for Computational Linguistics, Jul 2022, Seattle, United States. pp.1-9. ⟨hal-03831838⟩
26 Consultations
21 Téléchargements

Partager

Gmail Facebook X LinkedIn More