R. Kubota, A. , and T. Zhang, A framework for learning predictive structures from multiple tasks and unlabeled data, Journal of Machine Learning Research, vol.6, pp.1817-1853, 2005.

F. Collin, . Baker, J. Charles, J. B. Fillmore, and . Lowe, The Berkeley FrameNet project, ACL, 1998.

M. Ballesteros, C. Dyer, and N. A. Smith, Improved Transition-based Parsing by Modeling Characters instead of Words with LSTMs, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015.
DOI : 10.18653/v1/D15-1041

J. Bjerva, B. Plank, and J. Bos, Semantic tagging with deep residual networks, COLING, 2016.

C. Braud, B. Plank, and A. Søgaard, Multi-view and multi-task training of rst discourse parsers, COLING, 2016.

R. Caruana, Multitask Learning, Learning to learn, pp.95-133, 1997.
DOI : 10.1007/978-1-4615-5529-2_5

H. Cheng, H. Fang, and M. Ostendorf, Open-Domain Name Error Detection using a Multi-Task RNN, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015.
DOI : 10.18653/v1/D15-1085
URL : http://aclweb.org/anthology/D/D15/D15-1085.pdf

K. Cho, Natural Language Understanding with Distributed Representation. ArXiv, abs/1511, p.7916, 2015.

M. Ciaramita and Y. Altun, Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger, Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, EMNLP '06, 2006.
DOI : 10.3115/1610075.1610158

T. Cohn and L. Specia, Modelling annotator bias with multi-task gaussian processes: An application to machine translation quality estimation, ACL, 2013.

R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu et al., Natural language processing (almost) from scratch, Journal of Machine Learning Research, vol.12, pp.2493-2537, 2011.

D. Cruse, Lexical Semantics, 1986.
DOI : 10.1016/B0-08-043076-7/02990-9

L. Deng and J. Wiebe, MPQA 3.0: An Entity/Event-Level Sentiment Corpus, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2015.
DOI : 10.3115/v1/N15-1146

C. Dyer, M. Ballesteros, W. Ling, A. Matthews, and N. A. Smith, Transition-Based Dependency Parsing with Stack Long Short-Term Memory, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2015.
DOI : 10.3115/v1/P15-1033
URL : http://arxiv.org/pdf/1505.08075

L. Jeffrey and . Elman, Finding structure in time, Cognitive science, vol.14, issue.2, pp.179-211, 1990.

A. Graves and J. Schmidhuber, Framewise phoneme classification with bidirectional LSTM and other neural network architectures, Neural Networks, vol.18, issue.5-6, pp.602-610, 2005.
DOI : 10.1016/j.neunet.2005.06.042
URL : http://www6.in.tum.de/pub/Main/Publications/Graves2005a.pdf

P. Gupta, H. Schütze, and B. Andrassy, Table Filling Multi-Task Recurrent Neural Network for Joint Entity and Relation Extraction, COLING, 2016.

K. Moritz-hermann, D. Das, J. Weston, and K. Ganchev, Semantic frame identification with distributed word representations, ACL, 2014.

S. Hochreiter and J. Schmidhuber, Long Short-Term Memory, Neural Computation, vol.4, issue.8, pp.1735-1780, 1997.
DOI : 10.1016/0893-6080(88)90007-X

Z. Huang, W. Xu, and K. Yu, Bidirectional LSTM-CRF models for sequence tagging, 2015.

E. Kiperwasser and Y. Goldberg, Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature Representations, 2016.

M. Kshirsagar, S. Thomson, N. Schneider, J. Carbonell, A. Noah et al., Frame-Semantic Role Labeling with Heterogeneous Annotations, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), 2015.
DOI : 10.3115/v1/P15-2036

G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, and C. Dyer, Neural Architectures for Named Entity Recognition, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2016.
DOI : 10.18653/v1/N16-1030
URL : http://arxiv.org/pdf/1603.01360

P. Liu, S. Joty, and H. Meng, Finegrained opinion mining with recurrent neural networks and word embeddings, EMNLP, 2015.
DOI : 10.18653/v1/d15-1168
URL : http://aclweb.org/anthology/D/D15/D15-1168.pdf

M. Luong, Q. V. Le, I. Sutskever, O. Vinyals, and L. Kaiser, Multi-task sequence to sequence learning, ICLR

X. Ma and E. Hovy, End-to-end Sequence Labeling via Bi-directional LSTM-CNNs- CRF. arXiv preprint, 2016.
DOI : 10.18653/v1/p16-1101
URL : http://arxiv.org/pdf/1603.01354

P. Mitchell, M. A. Marcus, B. Marcinkiewicz, and . Santorini, Building a Large Annotated Corpus of English: The Penn Treebank, Comput . Linguist, vol.19, issue.2, pp.313-330, 1993.

A. George, C. Miller, R. Leacock, . Tengi, T. Ross et al., A semantic concordance, Proceedings of the workshop on Human Language Technology, 1993.

J. Nivre, M. De-marneffe, F. Ginter, Y. Goldberg-hajic, D. Christopher et al., Universal dependencies v1: A multilingual treebank collection, LREC, 2016.

B. Plank, A. Søgaard, and Y. Goldberg, Multilingual Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Models and Auxiliary Loss, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2016.
DOI : 10.18653/v1/P16-2067

B. Plank, Keystroke dynamics as signal for shallow syntactic parsing, COLING, 2016.

R. Rosenthal, The file drawer problem and tolerance for null results., Psychological Bulletin, vol.86, issue.3, p.638, 1979.
DOI : 10.1037/0033-2909.86.3.638

K. Shah and L. Specia, Large-scale Multitask Learning for Machine Translation Quality Estimation, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2016.
DOI : 10.18653/v1/N16-1069

A. Søgaard and Y. Goldberg, Deep multi-task learning with low level tasks supervised at lower layers, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2016.
DOI : 10.18653/v1/P16-2038

C. Sutton, A. Mccallum, and K. Rohanimanesh, Dynamic conditional random fields, Twenty-first international conference on Machine learning , ICML '04, pp.693-723, 2007.
DOI : 10.1145/1015330.1015422

F. Erik, K. Sang, and F. De-meulder, Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition, HLT-NAACL, pp.142-147, 2003.

P. Vossen, L. Bloksma, H. Rodriguez, S. Climent, N. Calzolari et al., Antonietta Alonge, and Wim Peters. 1998. The eurowordnet base concepts and top ontology, p.36