S. Abiteboul, Y. Amsterdamer, D. Deutch, T. Milo, and P. Senellart, Finding optimal probabilistic generators for XML collections, Proceedings of the 15th International Conference on Database Theory, ICDT '12, 2012.
DOI : 10.1145/2274576.2274591

URL : https://hal.archives-ouvertes.fr/hal-00765545

S. Abiteboul, Y. Amsterdamer, T. Milo, and P. Senellart, Auto-completion learning for XML Demonstration, SIGMOD Conference, pp.669-672, 2012.

S. Abiteboul, O. Benjelloun, and T. Milo, The Active XML project: an overview, The VLDB Journal, vol.7, issue.4, 2008.
DOI : 10.1007/s00778-007-0049-y

S. Abiteboul, P. Bourhis, A. Galland, and B. Marinoiu, The AXML Artifact Model, 2009 16th International Symposium on Temporal Representation and Reasoning, 2009.
DOI : 10.1109/TIME.2009.9

URL : https://hal.archives-ouvertes.fr/inria-00447694

S. Abiteboul, T. H. Chan, E. Kharlamov, W. Nutt, and P. Senellart, Aggregate queries for discrete and continuous probabilistic XML, Proceedings of the 13th International Conference on Database Theory, ICDT '10, 2010.
DOI : 10.1145/1804669.1804679

URL : https://hal.archives-ouvertes.fr/inria-00537632

S. Abiteboul, B. Kimelfeld, Y. Sagiv, and P. Senellart, On the expressiveness of probabilistic XML models, The VLDB Journal, vol.31, issue.4, 2009.
DOI : 10.1007/s00778-009-0146-1

URL : https://hal.archives-ouvertes.fr/inria-00429498

T. Antonopoulos, F. Geerts, W. Martens, and F. Neven, Generating, sampling and counting subclasses of regular tree languages, ICDT, 2011.

D. Barbosa, A. O. Mendelzon, J. Keenleyside, and K. A. Lyons, ToXgene, Proceedings of the 2002 ACM SIGMOD international conference on Management of data , SIGMOD '02, 2002.
DOI : 10.1145/564691.564769

G. J. Bex, W. Gelade, F. Neven, and S. Vansummeren, Learning deterministic regular expressions for the inference of schemas from XML data, WWW, 2008.

G. J. Bex, F. Neven, T. Schwentick, and K. Tuyls, Inference of concise DTDs from XML data, VLDB, 2006.

G. J. Bex, F. Neven, and S. Vansummeren, Inferring XML schema definitions from XML data, VLDB, 2007.

C. M. Bishop, Pattern Recognition and Machine Learning, 2006.

Z. Chi and S. Geman, Estimation of probabilistic context-free grammars, Comput. Linguist, vol.24, issue.2, 1998.

S. Cohen, Generating XML structure using examples and constraints, Proceedings of the VLDB Endowment, vol.1, issue.1, 2008.
DOI : 10.14778/1453856.1453910

S. Cohen, B. Kimelfeld, and Y. Sagiv, Incorporating constraints in probabilistic XML, PODS, 2008.

C. David, L. Libkin, and T. Tan, Efficient reasoning about data trees via integer linear programming, ICDT, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00835833

K. Etessami and M. Yannakakis, Recursive Markov chains, stochastic grammars, and monotone systems of nonlinear equations, JACM, vol.56, issue.1, 2009.

W. Fan and L. Libkin, On XML integrity constraints in the presence of DTDs, JACM, vol.49, issue.3, 2002.

M. Garofalakis, A. Gionis, R. Rastogi, S. Seshadri, and K. Shim, XTRACT: a system for extracting document type descriptors from XML documents, SIGMOD, 2000.

W. Gelade, T. Idziaszek, W. Martens, and F. Neven, Simplifying XML schema, Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems of data, PODS '10, 2010.
DOI : 10.1145/1807085.1807118

G. Grahne and J. Zhu, Discovering approximate keys in XML data, Proceedings of the eleventh international conference on Information and knowledge management , CIKM '02, 2002.
DOI : 10.1145/584792.584867

R. Kosala, H. Blockeel, M. Bruynooghe, and J. Van-den-bussche, Information extraction from structured documents using k-testable tree automaton inference, Data & Knowledge Engineering, vol.58, issue.2, 2006.
DOI : 10.1016/j.datak.2005.05.002

K. Lary and S. J. Young, The estimation of stochastic context-free grammars using the inside-outside algrithm, Computer Speech and Language, vol.4, 1990.

W. Martens, F. Neven, T. Schwentick, and G. J. Bex, Expressiveness and complexity of XML Schema, ACM Transactions on Database Systems, vol.31, issue.3, p.31, 2006.
DOI : 10.1145/1166074.1166076

W. Martens and J. Niehren, On the minimization of XML Schemas and tree automata for unranked trees, Journal of Computer and System Sciences, vol.73, issue.4, 2007.
DOI : 10.1016/j.jcss.2006.10.021

URL : https://hal.archives-ouvertes.fr/inria-00088406

T. Milo and D. Suciu, Type inference for queries on semistructured data, Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems , PODS '99, 1999.
DOI : 10.1145/303976.303998

M. Murata, D. Lee, M. Mani, and K. Kawaguchi, Taxonomy of XML schema languages using formal language theory, ACM Transactions on Internet Technology, vol.5, issue.4, 2005.
DOI : 10.1145/1111627.1111631

S. Nestorov, S. Abiteboul, and R. Motwani, Extracting schema from semistructured data, SIGMOD, 1998.

Y. Papakonstantinou and V. Vianu, DTD inference for views of XML data, Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems , PODS '00, 2000.
DOI : 10.1145/335168.335173