H. Borges and M. T. Valente, What's in a github star? understanding repository starring practices in a social coding platform, Journal of Systems and Software, vol.146, pp.112-129, 2018.

L. Breiman, Random forests, Machine learning, vol.45, issue.1, pp.5-32, 2001.

A. Capiluppi and M. F. Michlmayr, From the cathedral to the bazaar: An empirical study of the lifecycle of volunteer community projects, IFIP International Federation for Information Processing, vol.234, pp.31-44, 2007.

I. N. Chengalur-smith, A. Sidorova, and S. L. Daniel, Sustainability of free/libre open source projects: A longitudinal study, Journal of the Association for Information Systems, vol.11, issue.11, pp.657-683, 2010.

J. Colazo and Y. Fang, Impact of license choice on open source software development activity, Journal of the American Society for Information Science and Technology, vol.60, issue.5, pp.997-1011, 2009.

L. F. Dias, I. Steinmacher, and G. Pinto, Who drives company-owned OSS projects: internal or external members?, J. Braz. Comp. Soc, vol.24, issue.1, p.17, 2018.

F. Figueiredo, On the prediction of popularity of trends and hits for user generated videos, Proceedings of the sixth ACM international conference on Web search and data mining, pp.741-746, 2013.

F. Figueiredo, J. M. Almeida, M. A. Gonçalves, and F. Benevenuto, On the dynamics of social media popularity: A youtube case study, ACM Transactions on Internet Technology (TOIT), vol.14, issue.4, p.24, 2014.

G. Gousios, M. Pinzger, and A. V. Deursen, An exploratory study of the pull-based software development model, 36th International Conference on Software Engineering, vol.2014, pp.345-355, 2014.

Y. Gupta, Y. Khan, K. Gallaba, and S. Mcintosh, The impact of the adoption of continuous integration on developer attraction and retention, 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp.491-494, 2017.

J. A. Hartigan, Clustering algorithms, 1975.

J. Hauke and T. Kossowski, Comparison of values of pearson's and spearman's correlation coefficients on the same sets of data, Quaestiones geographicae, vol.30, issue.2, pp.87-93, 2011.

W. Ke and P. Zhang, The effects of extrinsic motivations and satisfaction in open source software development, Journal of the Association for Information Systems, vol.11, issue.12, pp.784-808, 2010.

G. Louppe, L. Wehenkel, A. Sutera, and P. Geurts, Understanding variable importances in forests of randomized trees, Advances in neural information processing systems, pp.431-439, 2013.

W. Maalej, H. J. Happel, and A. Rashid, When users become collaborators: towards continuous and context-aware user input, Proceeding of the 24th ACM SIG-PLAN Conference Companion on Object Oriented Programming Systems Languages and Applications, 2009.

P. Meirelles, C. Santos, J. Miranda, F. Kon, A. Terceiro et al., A study of the relationships between source code metrics and attractiveness in free software projects, 2010 Brazilian Symposium on Software Engineering, 2010.

D. A. Menasce and V. A. Almeida, Capacity Planning for Web Services: metrics, models, and methods, 2002.

K. Nakakoji, Y. Yamamoto, Y. Nishinaka, K. Kishida, and Y. Ye, Evolution patterns of open-source software systems and communities, Proceedings of the International Workshop on Principles of Software Evolution, pp.76-85, 2002.

I. Qureshi and Y. Fang, Socialization in open source software projects: A growth mixture modeling approach, Organizational Research Methods, vol.14, issue.1, pp.208-238, 2011.

B. Ray, D. Posnett, V. Filkov, and P. Devanbu, A large scale study of programming languages and code quality in github, Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, pp.155-165, 2014.

M. Robnik-?ikonja, Improving random forests, European conference on machine learning, pp.359-370, 2004.

C. Santos, G. Kuk, F. Kon, and J. Pearson, The attraction of contributors in free and open source software projects, The Journal of Strategic Information Systems, vol.22, issue.1, pp.26-45, 2013.

M. R. Segal, Machine learning benchmarks and random forest regression, 2004.

S. K. Shah, Motivation, governance, and the viability of hybrid forms in open source software development, Management Science, vol.52, issue.7, pp.1000-1014, 2006.

T. Shi and S. Horvath, Unsupervised learning with random forest predictors, Journal of Computational and Graphical Statistics, vol.15, issue.1, pp.118-138, 2006.

M. Sokolova, N. Japkowicz, and S. Szpakowicz, Beyond accuracy, f-score and roc: a family of discriminant measures for performance evaluation, Australasian joint conference on artificial intelligence, pp.1015-1021, 2006.

M. Sokolova and G. Lapalme, A systematic analysis of performance measures for classification tasks, Information Processing & Management, vol.45, issue.4, pp.427-437, 2009.

I. Steinmacher, T. Conte, M. A. Gerosa, and D. Redmiles, Social barriers faced by newcomers placing their first contribution in open source software projects, Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, vol.15, pp.1379-1392, 2015.

I. Steinmacher, T. U. Conte, C. Treude, and M. A. Gerosa, Overcoming open source project entry barriers with a portal for newcomers, Proceedings of the 38th International Conference on Software Engineering, pp.273-284, 2016.

I. Steinmacher, M. A. Gerosa, and D. Redmiles, Attracting, onboarding, and retaining newcomer developers in open source software projects, Proceedings of the Workshop on Global Software Development in a CSCW Perspective. CSCW '14 Workshops, 2014.

G. Von-krogh, S. Spaeth, and K. R. Lakhani, Community, joining, and specialization in open source software innovation: A case study, Research Policy, vol.32, issue.7, pp.1217-1241, 2003.

J. Yang and J. Leskovec, Patterns of temporal variation in online media, Proceedings of the fourth ACM international conference on Web search and data mining, pp.177-186, 2011.

Y. Ye and K. Kishida, Toward an understanding of the motivation open source software developers, 25th International Conference on Software Engineering, pp.419-429, 2003.