F. Bach and Z. Harchaoui, Diffrac: a discriminative and flexible framework for clustering, NIPS, 2007.

A. Beck and M. Teboulle, A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems, SIAM Journal on Imaging Sciences, vol.2, issue.1, 2009.
DOI : 10.1137/080716542

K. Bellare and A. Mccallum, Learning extractors from unlabeled text using relevant databases, IIWeb, 2007.

P. Bojanowski, F. Bach, I. Laptev, J. Ponce, C. Schmid et al., Finding Actors and Actions in Movies, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.283

URL : https://hal.archives-ouvertes.fr/hal-00904991

S. Brin, Extracting Patterns and Relations from the World Wide Web, The World Wide Web and Databases, 1999.
DOI : 10.1007/10704656_11

A. Carlson, J. Betteridge, B. Kisiel, B. Settles, E. R. Hruschka-jr et al., Toward an architecture for never-ending language learning, AAAI, 2010.

M. Collins and Y. Singer, Unsupervised models for named entity classification, EMNLP, 1999.

M. Craven and J. Kumlien, Constructing biological knowledge bases by extracting information from text sources, ISMB, 1999.

J. R. Curran, T. Murphy, and B. Scholz, Minimising semantic drift with mutual exclusion bootstrapping, PACLING, 2007.

O. Etzioni, M. Cafarella, D. Downey, A. Popescu, T. Shaked et al., Unsupervised named-entity extraction from the Web: An experimental study, Artificial Intelligence, vol.165, issue.1, 2005.
DOI : 10.1016/j.artint.2005.03.001

J. Finkel, T. Grenager, and C. Manning, Incorporating non-local information into information extraction systems by Gibbs sampling, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics , ACL '05, 2005.
DOI : 10.3115/1219840.1219885

URL : http://acl.ldc.upenn.edu/p/p05/p05-1045.pdf

E. Grave, A convex relaxation for weakly supervised relation extraction, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014.
DOI : 10.3115/v1/D14-1166

URL : https://hal.archives-ouvertes.fr/hal-01080310

E. Grave, G. Obozinski, and F. Bach, A markovian approach to distributional semantics with application to semantic compositionality, COLING, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01080309

A. Joulin, F. Bach, and J. Ponce, Discriminative clustering for image co-segmentation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539868

B. Liu, Y. Dai, X. Li, W. S. Lee, Y. et al., Building text classifiers using positive and unlabeled examples, Third IEEE International Conference on Data Mining, 2003.
DOI : 10.1109/ICDM.2003.1250918

URL : http://array.bioengr.uic.edu/~yangdai/pub/liub_classifiers.pdf

B. Liu, W. S. Lee, P. Yu, L. , and X. , Partially supervised classification of text documents, ICML, 2002.

M. Mintz, S. Bills, R. Snow, and D. Jurafsky, Distant supervision for relation extraction without labeled data, Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2, ACL-IJCNLP '09, 2009.
DOI : 10.3115/1690219.1690287

Y. Nesterov, Gradient methods for minimizing composite objective function, 2007.
DOI : 10.1007/s10107-012-0629-5

V. Ramanathan, A. Joulin, P. Liang, and L. Fei-fei, Linking People in Videos with ???Their??? Names Using Coreference Resolution, ECCV, 2014.
DOI : 10.1007/978-3-319-10590-1_7

S. Riedel, L. Yao, and A. Mccallum, Modeling Relations and Their Mentions without Labeled Text, ECML / PKDD, 2010.
DOI : 10.1007/978-3-642-15939-8_10

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

E. Riloff and R. Jones, Learning dictionaries for information extraction by multi-level bootstrapping, AAAI, 1999.

A. Ritter, S. Clark, and O. Etzioni, Named entity recognition in tweets: an experimental study, EMNLP, 2011.

A. Ritter, L. Zettlemoyer, . Mausam, and O. Etzioni, Modeling missing data in distant supervision for information extraction, 2013.

P. P. Talukdar and F. Pereira, Experiments in graph-based semi-supervised learning methods for class-instance acquisition, ACL, 2010.

K. Toutanova, D. Klein, C. D. Manning, and Y. Singer, Feature-rich part-of-speech tagging with a cyclic dependency network, Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology , NAACL '03, 2003.
DOI : 10.3115/1073445.1073478

F. Wu and D. S. Weld, Autonomously semantifying wikipedia, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management , CIKM '07, 2007.
DOI : 10.1145/1321440.1321449

L. Xu, J. Neufeld, B. Larson, and D. Schuurmans, Maximum margin clustering, NIPS, 2004.