H. Bourlard and S. Dupont, A new ASR approach based on independent processing and recombination of partial frequency bands, Proc. ICSLP, 1996.

T. Stephenson, M. Mathew, and H. Bourlard, Modeling auxiliary information in bayesian network based ASR, Proc. Eurospeech, 2001.

G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. Senior, Recent advances in the automatic recognition of audiovisual speech, Proc. IEEE, pp.1306-1326, 2003.

C. Chibelushi, J. Mason, and F. Deravi, Integration of acoustic and visual speech for speaker recognition, Proc. EUROSPEECH, 1993.

S. Dupont and J. Luettin, Audio-visual speech modeling for continuous speech recognition, IEEE Transactions on Multimedia, vol.2, issue.3, pp.141-151, 2000.
DOI : 10.1109/6046.865479

M. Heckmann, F. Berthommier, K. Kroschel, and J. , Noise Adaptive Stream Weighting in Audio-Visual Speech Recognition, EURASIP Journal on Advances in Signal Processing, vol.2002, issue.11
DOI : 10.1155/S1110865702206150

A. Adjoudani and C. Benoit, On the Integration of Auditory and Visual Parameters in an HMM-based ASR, Series F: Comput. Syst. Sci, vol.150, pp.465-472, 1996.
DOI : 10.1007/978-3-662-13015-5_35

J. Luettin, G. Potamianos, and C. Neti, Asynchronous stream modeling for large vocabulary audio-visual speech recognition, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), pp.169-172, 2001.
DOI : 10.1109/ICASSP.2001.940794

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.22.8576

J. Hernando, Maximum likelihood weighting of dynamic speech features for CDHMM speech recognition, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.1267-1270, 1997.
DOI : 10.1109/ICASSP.1997.596176

G. Potamianos and H. P. Graf, Discrimative training of HMM stream exponents for audio-visual speech recognition, Proc. ICASSP, pp.3733-3736, 1998.

C. Miyajima, K. Tokuda, and T. Kitamura, Audio visual speech recognition using MCE-based HMMs and model dependent stream weights, Proc. ICSLP, 2000.

G. Potamianos, J. Luettin, and C. Neti, Hierarchical discriminant features for audio-visual LVCSR, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), pp.165-168, 2001.
DOI : 10.1109/ICASSP.2001.940793

S. Nakamura, K. Kumatani, and S. Tamura, Robust bi-modal speech recognition based on state synchronous modeling stream weight optimization, Proc. ICASSP, pp.309-312, 2002.

A. Rogozan, P. Deléglise, and M. Alissali, Adaptive determination of audio and visual weights for automatic speech recognition, Proc. Workshop Audio-Visual Speech Process, 1997.
URL : https://hal.archives-ouvertes.fr/hal-01437207

H. Glotin, D. Vergyri, C. Neti, G. Potamianos, and J. Luettin, Weighting schemes for audio-visual fusion in speech recognition, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), 2001.
DOI : 10.1109/ICASSP.2001.940795

S. Nakamura, H. Ito, and K. Shikano, Stream weight optimization of speech and lip image sequence for audio-visul speech recognition Potamianos and C. Neti, " Stream confidence estimation for audio-visual speech recognition, Proc. ICSLP Proc. ICSLP, 2000.

G. Potamianos and C. Neti, Stream confidence estimation for audiovisual speech recognition, Proc. ICSLP, 2000.

S. Tamura, K. Iwano, and S. Furui, A Stream-Weight Optimization Method for Multi-Stream HMMS Based on Likelihood Value Normalization, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.469-472, 2005.
DOI : 10.1109/ICASSP.2005.1415152

A. Potamianos, E. Sánchez-soto, and K. Daoudi, Stream Weight Computation for Multi-Stream Classifiers, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, pp.353-356, 2006.
DOI : 10.1109/ICASSP.2006.1660030

URL : https://hal.archives-ouvertes.fr/inria-00439207

E. Sánchez-soto, A. Potamianos, and K. Daoudi, Unsupervised stream weight computation using anti-models, Proc. ICASSP, pp.365-368, 2007.

M. Rahim, C. Lee, and B. Juang, Discriminative utterance verification for connected digits recognition, IEEE Transactions on Speech and Audio Processing, vol.5, issue.3, pp.266-277, 1997.
DOI : 10.1109/89.568733

E. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, CUAVE: A new audio-visual database for multimodal human-computer interface research, Proc. ICASSP, pp.2017-2020, 2002.

G. Potamianos and P. Scanlon, Exploiting low face symmetry in appearance-based automatic speechreading, Proc. Workshop Audio- Visual Speech Process, 2005.