I. Sheikh, I. Illina, D. Fohr, and G. Linarès, OOV Proper Name retrieval using topic and lexical context models, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2015-5291
DOI : 10.1109/ICASSP.2015.7178981

A. Rastrow, A. Sethy, B. Ramabhadran, and F. Jelinek, Towards using hybrid word and fragment units for vocabulary independent LVCSR systems, ISCA INTERSPEECH, pp.1931-1934, 2009.

L. Qin and A. Rudnicky, OOV word detection using hybrid models with mixed types of fragments, ISCA INTERSPEECH, pp.2450-2453, 2012.

C. Parada, M. Dredze, D. Filimonov, and F. Jelinek, Contextual information improves OOV detection in speech, Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp.216-224, 2010.

S. Kombrink, M. Hannemann, and L. Burget, Detection and Identification of Rare Audiovisual Cues Out-of-Vocabulary Word Detection and Beyond, pp.57-65

W. Chen, S. Ananthakrishnan, R. Prasad, and P. Natarajan, Variablespan out-of-vocabulary named entity detection, ISCA INTER- SPEECH, pp.3761-3765, 2013.

M. Bisani and H. Ney, Open vocabulary speech recognition with flat hybrid models, ISCA INTERSPEECH, pp.725-728, 2005.

M. A. Shaik, A. E. Mousa, S. Hahn, R. Schlüter, and H. Ney, Improved strategies for a zero oov rate LVCSR system, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2015-5048
DOI : 10.1109/ICASSP.2015.7178932

A. Allauzen and J. Gauvain, Diachronic vocabulary adaptation for broadcast news transcription, 9th European Conference on Speech Communication and Technology (INTERSPEECH'2005 -Eurospeech), pp.1305-1308, 2005.

C. Liu, K. Thambiratnam, and F. Seide, Online vocabulary adaptation using limited adaptation data, ISCA INTERSPEECH, pp.1821-1824, 2007.

D. Jouvet and D. Langlois, A Machine Learning Based Approach for Vocabulary Selection for Speech Transcription, 16th International Conference on Text, Speech, and Dialogue (TSD), pp.60-67, 2013.
DOI : 10.1007/978-3-642-40585-3_9

URL : https://hal.archives-ouvertes.fr/hal-00834302

A. I. Sun and Y. Chen, Learning OOV through semantic relatedness in spoken dialog systems, ISCA INTERSPEECH, pp.1453-1457, 2015.

C. Martins, A. Texeira, and J. Neto, Dynamic language modeling for a daily broadcast news transcription system, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pp.165-170, 2007.
DOI : 10.1109/ASRU.2007.4430103

O. S. Seneff, A two-pass strategy for handling OOVs in a large vocabulary recognition task, ISCA INTERSPEECH, pp.1669-1672, 2005.

S. Oger, G. Linarès, F. Béchet, and P. Nocera, On-demand new word learning using world wide web, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4305-4308, 2008.
DOI : 10.1109/ICASSP.2008.4518607

URL : https://hal.archives-ouvertes.fr/hal-01319857

S. Meng, L. Wang, Y. Lin, G. Li, K. Thambiratnam et al., Vocabulary and language model adaptation using just one speech file, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5410-5413, 2010.
DOI : 10.1109/ICASSP.2010.5494929

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.186.9571

P. Maergner, A. Waibel, and I. Lane, Unsupervised vocabulary selection for real-time speech recognition of lectures, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2012-4417
DOI : 10.1109/ICASSP.2012.6288899

I. Nkairi, I. Illina, G. Linarès, and D. Fohr, Exploring temporal context in diachronic text documents for automatic OOV proper name retrieval, Language & Technology Conference, pp.540-544, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00924696

I. Sheikh, I. Illina, D. Fohr, and G. Linarès, Document level semantic context for retrieving OOV proper names, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2016-6050
DOI : 10.1109/ICASSP.2016.7472839

URL : https://hal.archives-ouvertes.fr/hal-01331716

D. M. Blei, A. Y. Ng, and M. I. Jordan, Latent dirichlet allocation, Journal of Machine Learning Research, vol.3, pp.993-1022, 2003.

I. Sheikh, I. Illina, D. Fohr, and G. Linarès, Improved neural bag-ofwords model to retrieve out-of-vocabulary words in speech recognition, ISCA INTERSPEECH, pp.675-679, 2016.

M. Iyyer, V. Manjunatha, J. Boyd-graber, H. Daumé, and I. , Deep Unordered Composition Rivals Syntactic Methods for Text Classification, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp.2015-1681
DOI : 10.3115/v1/P15-1162

P. D. Turney and P. Pantel, From frequency to meaning: Vector space models of semantics, J. Artif. Int. Res, vol.37, issue.1, pp.141-188, 2010.

S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman, Indexing by latent semantic analysis, Journal of the American Society for Information Science, vol.41, issue.6, pp.391-407, 1990.
DOI : 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.108.8490

T. L. Griffiths, J. B. Tenenbaum, and M. Steyvers, Topics in semantic representation., Psychological Review, vol.114, issue.2, pp.211-244, 2007.
DOI : 10.1037/0033-295X.114.2.211

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.408.7420

T. L. Griffiths and M. Steyvers, Finding scientific topics, Proceedings of the National Academy of Sciences, vol.101, issue.Supplement 1, pp.5228-5235, 2004.
DOI : 10.1073/pnas.0307752101

URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC387300

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, Distributed representations of words and phrases and their compositionality, Advances in Neural Information Processing Systems, pp.3111-3119, 2013.

J. Pennington, R. Socher, and C. Manning, Glove: Global Vectors for Word Representation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1532-1543, 2014.
DOI : 10.3115/v1/D14-1162

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.645.8863

M. Baroni, G. Dinu, and G. Kruszewski, Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.2014-238
DOI : 10.3115/v1/P14-1023

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, 1301.

Y. Goldberg and O. Levy, word2vec explained: deriving Mikolov et al.'s negative-sampling word-embedding method, 1402.

A. O. Bayer and G. Riccardi, Semantic language models for Automatic Speech Recognition, 2014 IEEE Spoken Language Technology Workshop (SLT), pp.7-12, 2014.
DOI : 10.1109/SLT.2014.7078541

G. Senay, B. Bigot, R. Dufour, G. Linarès, and C. Fredouille, Person name spotting by combining acoustic matching and LDA topic models, 14th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp.1584-1588, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01340026

B. Bigot, G. Senay, G. Linarès, C. Fredouille, and R. Dufour, Person name recognition in ASR outputs using continuous context models, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2013-8470
DOI : 10.1109/ICASSP.2013.6639318

URL : https://hal.archives-ouvertes.fr/hal-01314411

I. Sheikh, I. Illina, and D. Fohr, Study of entity-topic models for OOV proper name retrieval, ISCA INTERSPEECH, pp.3506-3510, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01184955

D. Fohr and I. Illina, Continuous word representation using neural networks for proper name retrieval from diachronic documents, ISCA INTERSPEECH, pp.1344-1348, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01184951

J. Nam, J. Kim, E. Loza-mencía, I. Gurevych, and J. Fürnkranz, Largescale multi-label text classification -revisiting neural networks, Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML- PKDD-14), Part 2, pp.437-452, 2014.
DOI : 10.1007/978-3-662-44851-9_28

URL : http://arxiv.org/abs/1312.5419

Y. Kim, Convolutional Neural Networks for Sentence Classification, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1746-1751, 2014.
DOI : 10.3115/v1/D14-1181

URL : http://arxiv.org/abs/1408.5882

R. Johnson and T. Zhang, Effective Use of Word Order for Text Categorization with Convolutional Neural Networks, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.103-112, 2015.
DOI : 10.3115/v1/N15-1011

P. Wang, J. Xu, B. Xu, C. Liu, H. Zhang et al., Semantic Clustering and Convolutional Neural Network for Short Text Categorization, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pp.2015-352
DOI : 10.3115/v1/P15-2058

R. Socher, A. Perelygin, J. Wu, J. Chuang, C. D. Manning et al., Recursive deep models for semantic compositionality over a sentiment treebank, Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1631-1642, 2013.

K. M. Hermann and P. Blunsom, The Role of Syntax in Vector Space Models of Compositional Semantics, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp.2013-894

L. Dong, F. Wei, C. Tan, D. Tang, M. Zhou et al., Adaptive Recursive Neural Network for Target-dependent Twitter Sentiment Classification, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp.49-54, 2014.
DOI : 10.3115/v1/P14-2009

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.636.225

K. S. Tai, R. Socher, and C. D. Manning, Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp.2015-1556
DOI : 10.3115/v1/P15-1150

URL : http://arxiv.org/abs/1503.00075

A. M. Dai and Q. V. Le, Semi-supervised sequence learning, Advances in Neural Information Processing Systems, pp.3079-3087, 2015.

Y. Zhang and B. Wallace, A sensitivity analysis of (and practitioners' guide to) convolutional neural networks for sentence classification, 1510.

W. Ling, Y. Tsvetkov, S. Amir, R. Fermandez, C. Dyer et al., Not All Contexts Are Created Equal: Better Word Representations with Variable Attention, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp.1367-1372, 2015.
DOI : 10.18653/v1/D15-1161

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.697.6860

D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, 2014.

W. Chan, N. Jaitly, Q. V. Le, and O. Vinyals, Listen, attend and spell, 1211.
DOI : 10.1109/icassp.2016.7472621

K. Xu, J. Ba, R. Kiros, K. Cho, A. C. Courville et al., Show, attend and tell: Neural image caption generation with visual attention, Proceedings of International Conference on Machine Learning (ICML), pp.2048-2057, 2015.

S. K. Sønderby, C. K. Sønderby, H. Nielsen, and O. Winther, Convolutional LSTM Networks for Subcellular Localization of Proteins, Proceedings of the 2nd International Conference on Algorithms for Computational Biology, pp.68-80, 2015.
DOI : 10.1007/978-3-319-21233-3_6

]. N. Kalchbrenner, E. Grefenstette, and P. Blunsom, A Convolutional Neural Network for Modelling Sentences, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.2014-655
DOI : 10.3115/v1/P14-1062

URL : http://arxiv.org/abs/1404.2188

Y. Goldberg, A primer on neural network models for natural language processing, 1510.

I. A. Sheikh, I. Illina, D. Fohr, and G. Linarès, Learning to retrieve outof-vocabulary words in speech recognition, 1511.
DOI : 10.21437/interspeech.2016-1219

URL : https://hal.archives-ouvertes.fr/hal-01384488/document

A. Allauzen and H. Bonneau-maynard, Training and evaluation of pos taggers on the french multitag corpus, Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC), 2008.

I. Sheikh, I. Illina, and D. Fohr, How diachronic text corpora affect context based retrieval of oov proper names for audio news, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC), pp.3851-3855, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01331714

I. Illina, D. Fohr, O. Mella, and C. Cerisara, The Automatic News Transcription System: ANTS some Real Time experiments, 8th International Conference on Spoken Language Processing, pp.377-380, 2004.
URL : https://hal.archives-ouvertes.fr/inria-00100043

A. Lee and T. Kawahara, Recent development of open-source speech recognition engine julius, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp.131-137, 2009.

A. Stolcke, SRILM -an extensible language modeling toolkit, Proceedings International Conference on Spoken Language Processing, pp.257-286, 2002.

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The kaldi speech recognition toolkit, IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2011.

C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval, 2008.
DOI : 10.1017/CBO9780511809071

C. Parada, A. Sethy, M. Dredze, and F. Jelinek, A spoken term detection framework for recovering out-of-vocabulary words using the web, 11th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp.1269-1272, 2010.

M. D. Smucker, J. Allan, and B. Carterette, A comparison of statistical significance tests for information retrieval evaluation, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management , CIKM '07, pp.623-632, 2007.
DOI : 10.1145/1321440.1321528

H. M. Wallach, D. M. Mimno, and A. Mccallum, Rethinking lda: Why priors matter, Advances in Neural Information Processing Systems, pp.1973-1981, 2009.

H. Larochelle, Y. Bengio, J. Louradour, and P. Lamblin, Exploring strategies for training deep neural networks, J. Mach. Learn. Res, vol.10, pp.1-40, 2009.

M. D. Zeiler, ADADELTA: an adaptive learning rate method, 1212.

Y. Bengio, Practical Recommendations for Gradient-Based Training of Deep Architectures, 1206.
DOI : 10.1162/089976602317318938

N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, Dropout: A simple way to prevent neural networks from overfitting, Journal of Machine Learning Research, vol.15, pp.1929-1958, 2014.

M. Bisani and H. Ney, Joint-sequence models for grapheme-to-phoneme conversion, Speech Communication, vol.50, issue.5, pp.434-451, 2008.
DOI : 10.1016/j.specom.2008.01.002

URL : https://hal.archives-ouvertes.fr/hal-00499203

L. Orosanu and D. Jouvet, Adding new words into a language model using parameters of known words with similar behavior, International Conference on Natural Language and Speech Processing, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01184194

L. Qin, Learning out-of-vocabulary words in automatic speech recognition, 2013.

G. Lecorv, G. Gravier, and P. Sbillot, Automatically finding semantically consistent n-grams to add new words in LVCSR systems, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4676-4679, 2011.
DOI : 10.1109/ICASSP.2011.5947398

A. Allauzen and J. Gauvain, Open Vocabulary ASR for Audiovisual Document Indexation, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.1013-1016, 2005.
DOI : 10.1109/ICASSP.2005.1415288

A. Pra?ák, P. Ircing, and L. Müller, Language model adaptation using different class-based models, SPECOM 2007 Proceedings, pp.449-454, 2007.

W. Naptali, M. Tsuchiya, and S. Nakagawa, Class-Based N-Gram Language Model for New Words Using Out-of-Vocabulary to In-Vocabulary Similarity, IEICE Transactions on Information and Systems, vol.95, issue.9, pp.2308-2317, 2012.
DOI : 10.1587/transinf.E95.D.2308

C. Troncoso and T. Kawahara, Trigger-Based Language Model Adaptation for Automatic Transcription of Panel Discussions, IEICE Transactions on Information and Systems, vol.89, issue.3
DOI : 10.1093/ietisy/e89-d.3.1024