Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs

Nathan Trouvain; Xavier Hinaut

doi:10.1007/978-3-030-86383-8_6

Communication Dans Un Congrès Année : 2021

Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs

(1) , (1)

Nathan Trouvain

Fonction : Auteur

Mnemonic Synergy

Xavier Hinaut

Fonction : Auteur
PersonId : 8171
IdHAL : xavier-hinaut
ORCID : 0000-0002-1924-1184
IdRef : 22823218X

Mnemonic Synergy

Résumé

Domestic canaries produce complex vocal patterns embed- ded in various levels of abstraction. Studying such temporal organization is of particular relevance to understand how animal brains represent and process vocal inputs such as language. However, this requires a large amount of annotated data. We propose a fast and easy-to-train trans- ducer model based on RNN architectures to automate parts of the anno- tation process. This is similar to a speech recognition task. We demon- strate that RNN architectures can be efficiently applied on spectral fea- tures (MFCC) to annotate songs at time frame level and at phrase level. We achieved around 95% accuracy at frame level on particularly complex canary songs, and ESNs achieved around 5% of word error rate (WER) at phrase level. Moreover, we are able to build this model using only around 13 to 20 minutes of annotated songs. Training time takes only 35 seconds using 2 hours and 40 minutes of data for the ESN, allowing to quickly run experiments without the need of powerful hardware.

Mots clés

Birdsong Echo State Networks Long Short Terms Memory RNN Audio Classification MFCC

Domaines

Intelligence artificielle [cs.AI] Réseau de neurones [cs.NE] Apprentissage [cs.LG] Neurosciences [q-bio.NC] Linguistique

Fichier principal

TrouvainHinaut2021_ICANN_Canary-decoder_HAL-v2.pdf (457.04 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Xavier Hinaut : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03203374

Soumis le : jeudi 23 décembre 2021-19:44:35

Dernière modification le : jeudi 15 février 2024-03:31:08

Dates et versions

hal-03203374 , version 1 (20-04-2021)

hal-03203374 , version 2 (23-12-2021)

Identifiants

HAL Id : hal-03203374 , version 2
DOI : 10.1007/978-3-030-86383-8_6

Citer

Nathan Trouvain, Xavier Hinaut. Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs. ICANN 2021 - 30th International Conference on Artificial Neural Networks, Sep 2021, Bratislava, Slovakia. pp.71--82, ⟨10.1007/978-3-030-86383-8_6⟩. ⟨hal-03203374v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

229 Consultations

438 Téléchargements

Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager