A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design

Résumé

Audio fingerprinting, also named as audio hashing, has been well-known as a powerful technique to perform audio identification and synchronization. It basically involves two major steps: fingerprint (voice pattern) design and matching search. While the first step concerns the derivation of a robust and compact audio signature, the second step usually requires knowledge about database and quick-search algorithms. Though this technique offers a wide range of real-world applications, to the best of the authors' knowledge, a comprehensive survey of existing algorithms appeared more than eight years ago. Thus, in this paper, we present a more up-to-date review and, for emphasizing on the audio signal processing aspect, we focus our state-of-the-art survey on the fingerprint design step for which various audio features and their tractable statistical models are discussed.
Fichier principal
Vignette du fichier
bare_conf.pdf (281.71 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01119503 , version 1 (24-02-2015)

Identifiants

Citer

Ngoc Q. K. Duong, Hien-Thanh Duong. A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design. Seventh International Conferences on Pervasive Patterns and Applications (PATTERNS 2015), Mar 2015, Nice, France. ⟨hal-01119503⟩
47 Consultations
266 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More