A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds

Zafar Rafii 1 Antoine Liutkus 2, 3 Bryan Pardo 4
2 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
3 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Repetition is a fundamental element in generating and perceiving structure in audio. Especially in music, structures tend to be composed of patterns that repeat through time (e.g., rhythmic elements in a musical accompaniment), and also frequency (e.g., different notes of the same instrument). The auditory system has the remarkable ability to parse such patterns by identifying repetitions within the audio mixture. On this basis, we propose a simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds. A user selects a region in the log-frequency spectrogram of an audio recording from which she/he wishes to recover a repeating pattern masked by an undesired element (e.g., a note masked by a cough). The selected region is then cross-correlated with the spectrogram to identify similar regions where the underlying pattern repeats. The identified regions are finally averaged over their repetitions and the repeating pattern is recovered.
Type de document :
Communication dans un congrès
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, France. 2015
Liste complète des métadonnées

Littérature citée [19 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01116689
Contributeur : Antoine Liutkus <>
Soumis le : vendredi 13 février 2015 - 23:46:51
Dernière modification le : mercredi 21 février 2018 - 07:50:09
Document(s) archivé(s) le : samedi 12 septembre 2015 - 13:30:33

Fichier

Rafii-Liutkus-Pardo - A Simple...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01116689, version 1

Collections

Citation

Zafar Rafii, Antoine Liutkus, Bryan Pardo. A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, France. 2015. 〈hal-01116689〉

Partager

Métriques

Consultations de la notice

237

Téléchargements de fichiers

263