Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection

Tomi Kinnunen; Rosa González Hautamäki; Ville Vestman; Md Sahidullah

Communication Dans Un Congrès Année : 2019

Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection

(1) , (1) , (1) , (2)

1
2

Tomi Kinnunen

Fonction : Auteur

University of Eastern Finland

Rosa González Hautamäki

Fonction : Auteur

University of Eastern Finland

Ville Vestman

Fonction : Auteur

University of Eastern Finland

Md Sahidullah

Fonction : Auteur
PersonId : 737397
IdHAL : sahid

Speech Modeling for Facilitating Oral-Based Communication

Résumé

We consider technology-assisted mimicry attacks in the context of automatic speaker verification (ASV). We use ASV itself to select targeted speakers to be attacked by human-based mimicry. We recorded 6 naive mimics for whom we select target celebrities from VoxCeleb1 and VoxCeleb2 corpora (7,365 potential targets) using an i-vector system. The attacker attempts to mimic the selected target, with the utterances subjected to ASV tests using an independently developed x-vector system. Our main finding is negative: even if some of the attacker scores against the target speakers were slightly increased, our mimics did not succeed in spoofing the x-vector system. Interestingly, however, the relative ordering of the selected targets (closest, furthest, median) are consistent between the systems, which suggests some level of transferability between the systems.

Mots clés

spoofing mimicry Speaker verification

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Intelligence artificielle [cs.AI] Interface homme-machine [cs.HC] Apprentissage [cs.LG] Multimédia [cs.MM] Acoustique [physics.class-ph] Traitement du signal et de l'image [eess.SP] Linguistique

Fichier principal

ICASSP19_Manuscript_UEF_Inria.pdf (390.92 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Md Sahidullah : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02051701

Soumis le : jeudi 28 février 2019-04:46:54

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : mercredi 29 mai 2019-17:30:49

Dates et versions

hal-02051701 , version 1 (28-02-2019)

Identifiants

HAL Id : hal-02051701 , version 1

Citer

Tomi Kinnunen, Rosa González Hautamäki, Ville Vestman, Md Sahidullah. Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection. ICASSP 2019 – 44th International Conference on Acoustics, Speech, and Signal Processing, May 2019, Brighton, United Kingdom. ⟨hal-02051701⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

90 Consultations

131 Téléchargements

Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager