An Extraction Method of Lip Movement Images from Successive Image Frames in the Speech Activity Extraction Process

Abstract : In this paper, we propose an extraction method of lip movement images from successive image frames and present the possibility to utilize lip movement images in the speech activity extraction process of speech recognition phase. The image frames are acquired from the PC image camera with the assumption that facial movement is limited during talking. First of all, one new lip movement image frame is generated with comparing two successive image frames each other. Second, the fine image noises are removed. Each fitness rate is calculated by comparing the lip feature data as objectly separated images. It is analyzed whether or not there is the lip movement image through verification to the objects and three images which have higher rates in their fitnesses. As a result of linking the speech & image processing system, the interworking rate shows 99.3% even in the various illumination environments. It was visually confirmed that lip movement images are tracked and can be utilized in speech activity extraction process.
Type de document :
Communication dans un congrès
Hyun Seung Yang; Rainer Malaka; Junichi Hoshino; Jung Hyun Han. 9th International Conference on Entertainment Computing (ICEC), Sep 2010, Seoul, South Korea. Springer, Lecture Notes in Computer Science, LNCS-6243, pp.317-325, 2010, Entertainment Computing - ICEC 2010. 〈10.1007/978-3-642-15399-0_33〉
Liste complète des métadonnées

Littérature citée [6 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01055628
Contributeur : Hal Ifip <>
Soumis le : mercredi 13 août 2014 - 13:50:50
Dernière modification le : mercredi 16 août 2017 - 17:33:07
Document(s) archivé(s) le : jeudi 27 novembre 2014 - 00:02:08

Fichier

icec2010_submission_11.pdf
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

Citation

Eung-Kyeu Kim, Soo-Jong Lee, Nohpill Park. An Extraction Method of Lip Movement Images from Successive Image Frames in the Speech Activity Extraction Process. Hyun Seung Yang; Rainer Malaka; Junichi Hoshino; Jung Hyun Han. 9th International Conference on Entertainment Computing (ICEC), Sep 2010, Seoul, South Korea. Springer, Lecture Notes in Computer Science, LNCS-6243, pp.317-325, 2010, Entertainment Computing - ICEC 2010. 〈10.1007/978-3-642-15399-0_33〉. 〈hal-01055628〉

Partager

Métriques

Consultations de la notice

49

Téléchargements de fichiers

395