Voice Activity Detection. Fundamentals and Speech Recognition System Robustness, 2007. ,
Audio-visual automatic speech recognition: An overview, 2004. ,
Voice activity detection using visual information, ICASSP, 2004. ,
Visual voice activity detection in the wild, IEEE TMM, 2016. ,
Visual lip activity detection and speaker detection using mouth region intensities, IEEE TCSVT, 2009. ,
Interference reduction in reverberant speech separation with visual voice activity detection, IEEE TMM, 2014. ,
An analysis of visual speech information applied to voice activity detection, ICASSP, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-00361750
Two novel visual voice activity detectors based on appearance models and retinal filtering, 2007. ,
A visual voice activity detection method with adaboosting, IET SSPD, 2011. ,
Toward visual voice activity detection for unconstrained videos, ICIP, 2019. ,
Cuave: A new audio-visual database for multimodal human-computer interface research, ICASSP, 2002. ,
Robust visual speakingness detection using bi-level hmm, Pattern Recognition, 2012. ,
Simultaneous-speaker voice activity detection and localization using mid-fusion of svm and hmms, IEEE TMM, 2014. ,
An audio-visual corpus for speech perception and automatic speech recognition, JASA, 2006. ,
How far are we from solving the 2d & 3d face alignment problem? (and a dataset of 230,000 3d facial landmarks), ICCV, 2017. ,
Framewise phoneme classification with bidirectional lstm and other neural network architectures, Neural Networks, 2005. ,
Batch normalization: Accelerating deep network training by reducing internal covariate shift, ICML, 2015. ,
P-cnn: Pose-based cnn features for action recognition, IEEE ICCV, 2015. ,
, , 2015.
Very deep convolutional networks for large-scale image recognition, 2014. ,
Youtube-8m: A large-scale video classification benchmark, 2016. ,
Voice activity detector (VAD) based on long-term mel frequency band features, TSD, 2016. ,
Dlib-ml: A machine learning toolkit, JMLR, 2009. ,
Adam: A method for stochastic optimization, 2015. ,
A comprehensive analysis of deep regression, IEEE TPAMI, 2020. ,
On space-time interest points, 2005. ,
Action recognition by dense trajectories, CVPR, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00583818
Visual speech detection using mouth region intensities, 2006. ,
Dynamic visual features for visual-speech activity detection, 2010. ,
Visual voice activity detection using frontal versus profile views, 2011. ,
Neural network based reinforcement learning for audio-visual gaze control in humanrobot interaction, Pattern Recognition Letters, 2019. ,