T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation

Paul-Ambroise Duquenne; Hongyu Gong; Benoît Sagot; Holger Schwenk

Communication Dans Un Congrès Année : 2022

T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation

(1, 2) , (1) , (2) , (1)

1
2

Paul-Ambroise Duquenne

Fonction : Auteur

Meta AI

Automatic Language Modelling and ANAlysis & Computational Humanities

Hongyu Gong

Fonction : Auteur

Meta AI

Benoît Sagot

Fonction : Auteur
PersonId : 1461
IdHAL : bsagot
ORCID : 0000-0002-0107-8526
IdRef : 177454229

Automatic Language Modelling and ANAlysis & Computational Humanities

Holger Schwenk

Fonction : Auteur
PersonId : 1180405

Meta AI

Résumé

We present a new approach to perform zeroshot cross-modal transfer between speech and text for translation tasks. Multilingual speech and text are encoded in a joint fixed-size representation space. Then, we compare different approaches to decode these multimodal and multilingual fixed-size representations, enabling zero-shot translation between languages and modalities. All our models are trained without the need of cross-modal labeled translation data. Despite a fixed-size representation, we achieve very competitive results on several text and speech translation tasks. In particular, we outperform the state of the art for zero-shot speech translation on Must-C. We also introduce the first results for zero-shot direct speechto-speech and text-to-speech translation.

Domaines

Intelligence artificielle [cs.AI] Informatique et langage [cs.CL]

Fichier principal

T_modules___EMNLP_2022___8_pages-3.pdf (767.66 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Benoît Sagot : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03834732

Soumis le : dimanche 30 octobre 2022-23:45:07

Dernière modification le : mardi 3 octobre 2023-17:18:04

Archivage à long terme le : mardi 31 janvier 2023-18:22:25

Dates et versions

hal-03834732 , version 1 (30-10-2022)

Identifiants

HAL Id : hal-03834732 , version 1

Citer

Paul-Ambroise Duquenne, Hongyu Gong, Benoît Sagot, Holger Schwenk. T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation. EMNLP 2022 - 2022 Conference on Empirical Methods in Natural Language Processing, Dec 2022, Abu Dhabi, United Arab Emirates. ⟨hal-03834732⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2

45 Consultations

92 Téléchargements

T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager