F. Bach, R. Jenatton, J. Mairal, and G. Obozinski, Optimization with Sparsity-Inducing Penalties, Machine Learning, pp.1-106, 2011.
DOI : 10.1561/2200000015

URL : https://hal.archives-ouvertes.fr/hal-00613125

A. Bordes, X. Glorot, J. Weston, and Y. Bengio, A semantic matching energy function for learning with multi-relational data, Machine Learning, 2012.
DOI : 10.1007/s10994-013-5363-6

URL : https://hal.archives-ouvertes.fr/hal-00835282

L. Bottou and Y. Lecun, Large scale online learning, Advances in Neural Information Processing Systems, pp.217-224, 2004.

W. Chu and Z. Ghahramani, Probabilistic models for incomplete multi-dimensional arrays, Journal of Machine Learning Research -Proceedings Track, vol.5, pp.89-96, 2009.

R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu et al., Natural language processing (almost) from scratch, JMLR, vol.12, pp.2493-2537, 2011.

W. Denham, The detection of patterns in Alyawarra nonverbal behavior, 1973.

L. Getoor and B. Taskar, Introduction to Statistical Relational Learning (Adaptive Computation and Machine Learning), 2007.

R. A. Harshman and M. E. Lundy, PARAFAC: Parallel factor analysis, Computational Statistics & Data Analysis, vol.18, issue.1, pp.39-72, 1994.
DOI : 10.1016/0167-9473(94)90132-5

C. Kemp, J. B. Tenenbaum, T. L. Griffiths, T. Yamada, and N. Ueda, Learning systems of concepts with an infinite relational model, Proc. of AAAI, pp.381-388, 2006.

S. Kok and P. Domingos, Statistical predicate invention, Proceedings of the 24th international conference on Machine learning, ICML '07, pp.433-440, 2007.
DOI : 10.1145/1273496.1273551

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.143.4617

A. Korhonen, Y. Krymolowski, and T. Briscoe, A large subcategorization lexicon for natural language processing applications, Proceedings of LREC, 2006.

A. T. Mccray, An Upper-Level Ontology for the Biomedical Domain, Comparative and Functional Genomics, vol.28, issue.1, pp.80-88, 2003.
DOI : 10.1002/cfg.255

G. Miller, WordNet: a lexical database for English, Communications of the ACM, vol.38, issue.11, pp.39-41, 1995.
DOI : 10.1145/219717.219748

K. Miller, T. Griffiths, and M. Jordan, Nonparametric latent feature models for link prediction, Advances in Neural Information Processing Systems 22, pp.1276-1284, 2009.

M. Nickel, V. Tresp, and H. Kriegel, A three-way model for collective learning on multi-relational data, Proceedings of the 28th Intl Conf. on Mach. Learn, pp.809-816, 2011.

M. Nickel, V. Tresp, and H. Kriegel, Factorizing YAGO, Proceedings of the 21st international conference on World Wide Web, WWW '12, pp.271-280, 2012.
DOI : 10.1145/2187836.2187874

K. Nowicki and T. A. Snijders, Estimation and Prediction for Stochastic Blockstructures, Journal of the American Statistical Association, vol.96, issue.455, pp.1077-1087, 2001.
DOI : 10.1198/016214501753208735

A. Paccanaro and G. Hinton, Learning distributed representations of concepts using linear relational embedding, IEEE Transactions on Knowledge and Data Engineering, vol.13, issue.2, pp.232-244, 2001.
DOI : 10.1109/69.917563

T. Pedersen, S. Patwardhan, and J. Michelizzi, WordNet::Similarity, Demonstration Papers at HLT-NAACL 2004 on XX, HLT-NAACL '04, pp.38-41, 2004.
DOI : 10.3115/1614025.1614037

H. Poon and P. Domingos, Unsupervised ontology induction from text, Proceedings of the 48th Annual Meeting of the Association for Computl Linguistics, pp.296-305, 2010.

R. J. Rummel, Dimensionality of nations project: Attributes of nations and behavior of nation dyads, ICPSR data file, pp.1950-1965, 1999.

D. Shen, J. Sun, H. Li, Q. Yang, and Z. Chen, Document summarization using conditional random fields, Proc. of the 20th Intl Joint Conf. on Artif. Intel, pp.2862-2867, 2007.

A. P. Singh and G. J. Gordon, Relational learning via collective matrix factorization, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD 08, pp.650-658, 2008.
DOI : 10.1145/1401890.1401969

I. Sutskever, R. Salakhutdinov, and J. Tenenbaum, Modelling relational data using bayesian clustered tensor factorization, Adv. in Neur. Inf. Proc. Syst. 22, 2009.

L. R. Tucker, Some mathematical notes on three-mode factor analysis, Psychometrika, vol.64, issue.3, pp.279-311, 1966.
DOI : 10.1007/BF02289464

Y. J. Wang and G. Y. Wong, Stochastic Blockmodels for Directed Graphs, Journal of the American Statistical Association, vol.4, issue.397, 1987.
DOI : 10.1080/01621459.1987.10478406

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.143.322

D. Yang and D. M. Powers, Verb similarity on the taxonomy of wordnet, Proceedings of GWC-06, pp.121-128, 2006.

J. Zhu, Max-margin nonparametric latent feature models for link prediction, Proceedings of the 29th Intl Conference on Machine Learning, 2012.