P. Temple, M. Acher, and J. Jézéquel, Poster: Multimorphic Testing, ACM/IEEE 40th International Conference on Software Engineering: Companion Proceedings, pp.1-2, 2018.

K. Bak, K. Czarnecki, and A. Wasowski, Feature and meta-models in clafer: mixed, specialized, and coupled, SLE'10, 2011.

S. Apel, D. Batory, C. Kästner, and G. Saake, FeatureOriented Software Product Lines: Concepts and Implementation, 2013.

B. W. Silverman, Density estimation for statistics and data analysis. Routledge, 2018.

D. W. Scott, On optimal and data-based histograms, Biometrika, vol.66, issue.3, pp.605-610, 1979.

, Multivariate density estimation: theory, practice, and visualization, 2015.

M. Acher, P. Collet, P. Lahire, and R. France, Familiar: A domain-specific language for large scale management of feature models, Science of Computer Programming (SCP), vol.78, issue.6, pp.657-681, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00767175

T. Thüm, C. Kstner, F. Benduhn, J. Meinicke, G. Saake et al., Featureide: An extensible framework for feature-oriented software development, Science of Computer Programming, 2012.

J. A. Galindo, M. Alférez, M. Acher, B. Baudry, and D. Benavides, A variability-based testing approach for synthesizing video sequences, International Symposium on Software Testing and Analysis, pp.293-303, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01003148

M. Alférez, M. Acher, J. A. Galindo, B. Baudry, and D. Benavides, Modeling Variability in the Video Domain: Language and Experience Report, Software Quality Journal, pp.1-28, 2018.

J. Deng, W. Dong, R. Socher, L. Li, K. Li et al., Imagenet: A large-scale hierarchical image database, Computer Vision and Pattern Recognition, pp.248-255, 2009.

T. Lin, M. Maire, S. J. Belongie, L. D. Bourdev, R. B. Girshick et al., Microsoft COCO: common objects in context, CoRR, 2014.

, Pets 2016 dataset

G. Griffin, A. Holub, and P. Perona, Caltech256 image dataset, 2006.

D. ?trekelj, H. Leventi?, and I. Gali?, Performance overhead of haxe programming language for crossplatform game development, International Journal of Electrical and Computer Engineering Systems, vol.6, issue.1, pp.9-13, 2015.

M. Boussaa, O. Barais, B. Baudry, and G. Sunyé, Automatic non-functional testing of code generators families, Proceedings of the 2016 ACM SIGPLAN International Conference on Generative Programming: Concepts and Experiences, pp.202-212, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01356849

M. Boussaa, Automatic Non-functional Testing and Tuning of Configurable Generators
URL : https://hal.archives-ouvertes.fr/tel-01598821

M. Johnson-roberson, C. Barto, R. Mehta, S. N. Sridhar, and R. Vasudevan, Driving in the matrix: Can virtual worlds replace human-generated annotations for real world tasks, CoRR, 2016.

M. Patrick, M. D. Castle, R. O. Stutt, and C. A. Gilligan, Automatic test image generation using procedural noise, Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, ASE 2016, pp.654-659, 2016.

A. Shrivastava, T. Pfister, O. Tuzel, J. Susskind, W. Wang et al., Learning from simulated and unsupervised images through adversarial training, 2016.

J. Oh, D. S. Batory, M. Myers, and N. Siegmund, Finding near-optimal configurations in product lines by random sampling, Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering, ESEC/FSE 2017, pp.61-71, 2017.

K. Czarnecki, S. She, and A. Wasowski, Sample spaces and feature models: There and back again, SPLC, 2008.

A. Sarkar, J. Guo, N. Siegmund, S. Apel, and K. Czarnecki, Cost-efficient sampling for performance prediction of configurable systems (t), 2015.

F. Medeiros, C. Kästner, M. Ribeiro, R. Gheyi, and S. Apel, A comparison of 10 sampling algorithms for configurable systems, ICSE'16, 2016.

J. M. Rojas, M. Vivanti, A. Arcuri, and G. Fraser, A detailed investigation of the effectiveness of whole test suite generation, Empirical Software Engineering, vol.22, issue.2, pp.852-893, 2017.

J. C. Miller and C. J. Maloney, Systematic mistake analysis of digital computer programs, Commun. ACM, vol.6, issue.2, pp.58-63, 1963.

P. Godefroid, N. Klarlund, and K. Sen, Dart: Directed automated random testing, SIGPLAN Not, 2005.

P. Godefroid, M. Y. Levin, and D. Molnar, Sage: Whitebox fuzzing for security testing, Queue, 2012.

H. Malik, H. Hemmati, and A. E. Hassan, Automatic detection of performance deviations in the load testing of large scale systems, Proceedings of the 2013 International Conference on Software Engineering, ser. ICSE '13, pp.1012-1021, 2013.

Z. M. Jiang, Automated analysis of load testing results, Proceedings of the 19th International Symposium on Software Testing and Analysis, ser. ISSTA '10, pp.143-146, 2010.

J. H. Andrews, L. C. Briand, Y. Labiche, and A. S. Namin, Using mutation analysis for assessing and comparing testing coverage criteria, IEEE Transactions on Software Engineering, vol.32, issue.8, pp.608-624, 2006.

M. Gligoric, A. Groce, C. Zhang, R. Sharma, M. A. Alipour et al., Comparing non-adequate test suites using coverage criteria, Proceedings of the 2013 International Symposium on Software Testing and Analysis, ser. ISSTA 2013, pp.302-313, 2013.

M. Papadakis and N. Malevris, Automatic mutation test case generation via dynamic symbolic execution, 2010 IEEE 21st International Symposium on Software Reliability Engineering, pp.121-130, 2010.

S. Segura, G. Fraser, A. B. Sánchez, and A. R. Cortés, A survey on metamorphic testing, IEEE Trans. Software Eng, vol.42, issue.9, pp.805-824, 2016.

E. T. Barr, M. Harman, P. Mcminn, M. Shahbaz, and S. Yoo, The oracle problem in software testing: A survey, IEEE Transactions on Software Engineering, vol.41, issue.5, pp.507-525, 2015.

S. Segura, J. Troya, A. D. Toro, and A. R. Cortés, Performance metamorphic testing: Motivation and challenges, 39th IEEE/ACM International Conference on Software Engineering: New Ideas and Emerging Technologies Results Track, ICSE-NIER 2017, pp.7-10, 2017.

T. Thüm, S. Apel, C. Kästner, I. Schaefer, and G. Saake, A classification and survey of analysis strategies for software product lines, ACM Computing Surveys, 2014.

C. Kim, S. Khurshid, and D. Batory, Shared execution for efficiently testing product lines, Software Reliability Engineering (ISSRE), 2012.

H. V. Nguyen, C. Kästner, and T. N. Nguyen, Exploring variability-aware execution for testing pluginbased web applications, 2014.

X. Devroey, G. Perrouin, M. Papadakis, A. Legay, P. Schobbens et al., Featured modelbased mutation analysis, Proceedings of the 38th International Conference on Software Engineering, ser. ICSE '16, pp.655-666, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01406512

M. F. Johansen, Ø. Haugen, and F. Fleurey, An algorithm for generating t-wise covering arrays from large feature models, 16th International Software Product Line Conference, SPLC '12, pp.46-55, 2012.

C. Henard, M. Papadakis, G. Perrouin, J. Klein, P. Heymans et al., Bypassing the combinatorial explosion: Using similarity to generate and prioritize t-wise test configurations for software product lines, IEEE Trans. Software Eng, 2014.

B. , P. Lamancha, and M. Usaola, Testing Product Generation in Software Product Lines Using Pairwise for Features Coverage, 22nd IFIP WG 6.1 International Conference on Testing Software and Systems (ICTSS), ser. Testing Software and Systems, vol.6435, pp.111-125, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01055240

M. B. Cohen, M. B. Dwyer, and J. Shi, Constructing interaction test suites for highly-configurable systems in the presence of constraints: A greedy approach, IEEE Transactions on Software Engineering, vol.34, issue.5, pp.633-650, 2008.

A. Halin, A. Nuttinck, M. Acher, X. Devroey, G. Perrouin et al., Test them all, is it worth it? assessing configuration sampling on the jhipster web development stack, Empirical Software Engineering, vol.24, issue.2, pp.674-717, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01829928

C. Yilmaz, M. B. Cohen, and A. A. Porter, Covering arrays for efficient fault characterization in complex configuration spaces, IEEE Transactions on Software Engineering, vol.32, issue.1, pp.20-34, 2006.

C. H. Kim, D. Marinov, S. Khurshid, D. Batory, S. Souto et al., Splat: Lightweight dynamic analysis for reducing combinatorics in testing configurable systems, ESEC/FSE 2013, 2013.

J. Guo, E. Zulkoski, R. Olaechea, D. Rayside, K. Czarnecki et al., Scaling exact multiobjective combinatorial optimization by parallelization, ASE, 2014.

C. Henard, M. Papadakis, M. Harman, and Y. L. Traon, Combining multi-objective search and constraint solving for configuring large software product lines, 37th IEEE/ACM International Conference on Software Engineering, ICSE 2015, vol.1, pp.517-528, 2015.

I. Stuermer, M. Conrad, H. Doerr, and P. Pepper, Systematic testing of model-based code generators, IEEE Transactions on Software Engineering, vol.33, issue.9, p.622, 2007.

I. Sturmer and M. Conrad, Test suite design for code generation tools, Proceedings. 18th IEEE International Conference on, pp.286-290, 2003.

W. M. Mckeeman, Differential testing for software, Digital Technical Journal, vol.10, issue.1, pp.100-107, 1998.

M. A. Vouk, Back-to-back testing, Information and software technology, vol.32, issue.1, pp.34-45, 1990.

S. Jörges and B. Steffen, Back-to-back testing of model-based code generators, International Symposium On Leveraging Applications of Formal Methods, Verification and Validation, pp.425-444, 2014.

S. Stepasyuk and Y. Paunov, Evaluating the haxe programming language-performance comparison between haxe and platform-specific languages, 2015.

J. Richard-foy, O. Barais, and J. Jézéquel, Efficient high-level abstractions for web programming, ACM SIGPLAN Notices, vol.49, issue.3, pp.53-60, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00920786

M. Everingham, S. M. Eslami, L. Van-gool, C. K. Williams, J. Winn et al., The pascal visual object classes challenge: A retrospective, International Journal of Computer Vision, vol.111, issue.1, pp.98-136, 2015.

A. Nghiem, F. Bremond, M. Thonnat, and M. Ruihua, A New Evaluation Approach for Video Processing Algorithms, IEEE Workshop on Motion and Video Computing, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00502955

A. T. Nghiem, F. Bremond, M. Thonnat, and V. Valentin, Etiseo, performance evaluation for video surveillance systems, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance, pp.476-481, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00502945

J. Ponce, T. Berg, M. Everingham, D. Forsyth, M. Hebert et al., Dataset issues in object recognition, Towards Category-Level Object Recognition, ser. Lecture Notes in Computer Science (LNCS), vol.4170, pp.29-48, 2006.
URL : https://hal.archives-ouvertes.fr/inria-00548595

O. Zendel, M. Murschitz, M. Humenberger, and W. Herzner, Cv-hazop: Introducing test data validation for computer vision, 2015 IEEE International Conference on Computer Vision (ICCV), pp.2066-2074, 2015.

K. Pei, Y. Cao, J. Yang, and S. Jana, Deepxplore: Automated whitebox testing of deep learning systems, CoRR, 2017.

Y. Tian, K. Pei, S. Jana, and B. Ray, Deeptest: Automated testing of deep-neural-network-driven