J. Nesreen-k-ahmed, . Neville, N. Ryan-a-rossi, and . Duffield, Efficient graphlet counting for large networks, Data Mining (ICDM), 2015 IEEE International Conference on, pp.1-10, 2015.

A. , Website traffic, statistics and analytics, 2017.

N. Alon and S. Gutner, Balanced families of perfect hash functions and their applications, ACM Trans. Algorithms, vol.6, issue.3, 2010.

R. Noga-alon, U. Yuster, and . Zwick, Color-coding, Journal of the ACM (JACM), vol.42, issue.4, pp.844-856, 1995.

A. Z. Broder, Identifying and filtering near-duplicate documents, Combinatorial Pattern Matching, 11th Annual Symposium, pp.1-10, 2000.
DOI : 10.1007/3-540-45123-4_1

URL : http://www.cs.princeton.edu/courses/archive/spring05/cos598E/bib/CPM 2000.pdf

D. Eppstein and J. Wang, Fast approximation of centrality, Proceedings of the twelfth annual ACM-SIAM symposium on Discrete algorithms, pp.228-229, 2001.
DOI : 10.1142/9789812773289_0004

URL : http://www.ics.uci.edu/~eppstein/pubs/EppWan-SODA-01.pdf

Y. Fang, W. Lin, W. Vincent, M. Zheng, K. Wu et al., Semantic proximity search on graphs with metagraph-based learning, IEEE 32nd International Conference on, pp.277-288, 2016.
DOI : 10.1109/icde.2016.7498247

J. Flum and M. Grohe, The parameterized complexity of counting problems, SIAM Journal on Computing, vol.33, issue.4, pp.892-922, 2004.
DOI : 10.1109/sfcs.2002.1181978

R. Michael, D. S. Garey, and . Johnson, Computers and intractability : a guide to the theory of NP-completeness, 1979.

P. Giscard and R. C. Wilson, The all-paths and cycles graph kernel, 2017.

. Imdb, , 2017.

S. Ioffe, Improved consistent sampling, weighted minhash and l1 sketching, 10th International Conference on Data Mining (ICDM), pp.246-255, 2010.
DOI : 10.1109/icdm.2010.80

URL : http://research.google.com/pubs/archive/36928.pdf

G. Jeh and J. Widom, Simrank: a measure of structural-context similarity, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pp.538-543, 2002.

G. Jeh and J. Widom, Scaling personalized web search, Proceedings of the 12th international conference on World Wide Web, pp.271-279, 2003.
DOI : 10.1145/775152.775191

L. Katz, A new status index derived from sociometric analysis, Psychometrika, vol.18, issue.1, pp.39-43, 1953.
DOI : 10.1007/bf02289026

P. Legendre and L. F. Legendre, Numerical Ecology. Developments in Environmental Modelling, 1998.
URL : https://hal.archives-ouvertes.fr/hal-00530195

P. Elizabeth-a-leicht, . Holme, . Mark, and . Newman, Vertex similarity in networks, Physical Review E, vol.73, issue.2, p.26120, 2006.

J. Leskovec and J. Mcauley, Learning to discover social circles in ego networks, Advances in neural information processing systems, pp.539-547, 2012.

W. Liu and L. Lü, Link prediction based on local random walk, Europhysics Letters), vol.89, issue.5, p.58007, 2010.
DOI : 10.1209/0295-5075/89/58007

URL : http://doc.rero.ch/record/20273/files/liu_lpb.pdf

Z. Liu, W. Vincent, Z. Zheng, F. Zhao, K. Zhu et al., Semantic proximity search on heterogeneous graph by proximity embedding, AAAI, pp.154-160, 2017.
DOI : 10.1145/3219819.3219953

L. Lü and T. Zhou, Link prediction in complex networks: A survey, Physica A: Statistical Mechanics and its Applications, vol.390, issue.6, pp.1150-1170, 2011.

S. Rothe and H. Schütze, Cosimrank: A flexible & efficient graphtheoretic similarity measure, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1392-1402, 2014.
DOI : 10.3115/v1/p14-1131

URL : https://doi.org/10.3115/v1/p14-1131

C. Shi, X. Kong, Y. Huang, Y. Philip, and B. Wu, Hetesim: A general framework for relevance measure in heterogeneous networks, IEEE Transactions on Knowledge and Data Engineering, vol.26, issue.10, pp.2479-2492, 2014.

C. Shi, Y. Li, J. Zhang, Y. Sun, and S. Philip, A survey of heterogeneous information network analysis, IEEE Transactions on Knowledge and Data Engineering, vol.29, issue.1, pp.17-37, 2017.
DOI : 10.1109/tkde.2016.2598561

URL : https://doi.org/10.1109/tkde.2016.2598561

C. Shi, C. Zhou, X. Kong, S. Philip, G. Yu et al., Heterecom: a semantic-based recommendation system in heterogeneous networks, Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, pp.1552-1555, 2012.

S. Netinf, , 2017.

Y. Sun, J. Han, X. Yan, S. Philip, T. Yu et al., Pathsim: Meta path-based top-k similarity search in heterogeneous information networks, Proceedings of the VLDB Endowment, vol.4, pp.992-1003, 2011.

, The Google Ngram Viewer Team, part of Google Research. Google Books Ngram Viewer, 2018.

H. Tong, C. Faloutsos, and J. Pan, Fast random walk with restart and its applications, Proceedings of the Sixth International Conference on Data Mining, ICDM '06, pp.613-622, 2006.

G. Wang, Q. Hu, and P. Yu, Influence and similarity on heterogeneous networks, Proceedings of the 21st ACM international conference on Information and knowledge management, pp.1462-1466, 2012.

W. Wu, B. Li, L. Chen, and C. Zhang, Consistent weighted sampling made more practical, Proceedings of the 26th International Conference on World Wide Web, pp.1035-1043, 2017.

Y. Xiong, Y. Zhu, and S. Philip, Top-k similarity join in heterogeneous information networks, IEEE Transactions on Knowledge and Data Engineering, vol.27, issue.6, pp.1710-1723, 2015.