Statistics Based  Features for Unvoiced Sound Classification

Sunit Sivasankaran; Kmm Prabhu

doi:10.1109/MLSP.2013.6661986

Communication Dans Un Congrès Année : 2013

Statistics Based Features for Unvoiced Sound Classification

(1, 2) , (2)

1
2

Sunit Sivasankaran

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Indian Institute of Technology Madras

Kmm Prabhu

Fonction : Auteur

Indian Institute of Technology Madras

Résumé

Unvoiced phonemes have significant presence in spoken English language. These phonemes are hard to classify, due to their weak energy and lack of periodicity. Sound textures such as sound made by a flowing stream of water or falling droplets of rain have similar ape-riodic properties in temporal domain as unvoiced phonemes. These sounds are easily differentiated by a human ear. Recent studies on sound texture analysis and synthesis have shown that the human auditory system perceives sound textures using simple statistics. These statistics are obtained by decomposing sounds using a set of filter-banks and computing the moments of the filter responses, along with their correlation values. In this work we investigate if the above mentioned statistics, which are easy to extract, can also be used as features for classifying unvoiced sounds. To incorporate the moments and correlation values as features, a framework containing multiple classifiers is proposed. Experiments conducted on the TIMIT dataset gave an accuracy on par with the latest reported in the literature with lesser computational cost.

Mots clés

Gaussian Mixture Model (GMM) features statistics sound textures Unvoiced phonemes Linear Prediction Coefficients (LPC)

Domaines

Apprentissage [cs.LG] Son [cs.SD]

Sunit Sivasankaran : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01801021

Soumis le : lundi 28 mai 2018-10:54:41

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-01801021 , version 1 (28-05-2018)

Identifiants

HAL Id : hal-01801021 , version 1
DOI : 10.1109/MLSP.2013.6661986

Citer

Sunit Sivasankaran, Kmm Prabhu. Statistics Based Features for Unvoiced Sound Classification. MLSP 2013 - IEEE International Workshop on Machine Learning for Signal Processing, Sep 2013, Southampton, United Kingdom. ⟨10.1109/MLSP.2013.6661986⟩. ⟨hal-01801021⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

51 Consultations

1 Téléchargements