. Van-der-aalst-w-m-p, Process mining in the large, pp.33-76, 2013.

S. Arain, A. Arain, and . Pakistan, An Illuminative Study

. Lulu, , 2016.

A. Bouguettaya, Q. Yu, and X. Liu, Efficient agglomerative hierarchical clustering, Expert Systems with Applications, vol.42, issue.5, pp.2785-2797, 2015.
DOI : 10.1016/j.eswa.2014.09.054

W. Dai and J. W. , A mapreduce implementation of C4. 5 decision tree algorithm, vol.7, pp.49-60, 2014.

T. Dietterich, An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization, Machine learning, vol.40, pp.139-157, 2000.

S. García, J. Luengo, and F. Herrera, Data preprocessing in data mining, 2015.

S. Hawkins, J. Korecki, and Y. Balagurunathan, Predicting outcomes of nonsmall cell lung cancer using CT image features, vol.2, pp.1418-1426, 2014.
DOI : 10.1109/access.2014.2373335

URL : https://doi.org/10.1109/access.2014.2373335

G. Kesavaraj and S. Sukumaran, A study on classification techniques in data mining, Communications and Networking Technologies (ICCCNT), 2013 Fourth International Conference on, pp.1-7, 2013.

M. Mundada, B. Gawali, and S. Kayte, Recognition and classification of speech and its related fluency disorders, International Journal of Computer Science and Information Technologies, 2014.
DOI : 10.5120/ijais2016451484

URL : https://doi.org/10.5120/ijais2016451484

T. Rumbell, S. Denham, and T. Wennekers, A spiking self-organizing map combining stdp, oscillations, and continuous learning, IEEE transactions on neural networks and learning systems, vol.25, pp.894-907, 2014.
DOI : 10.1109/tnnls.2013.2283140

URL : https://doi.org/10.1109/tnnls.2013.2283140

K. Satyanarayanan, B. Srikanth, and M. Murugesan, Tree Dataset Extraction Using HAC Based Algorithm, 2016.

S. Suthaharan, Machine Learning Models and Algorithms for Big Data Classification: Thinking with Examples for Effective Learning
DOI : 10.1007/978-1-4899-7641-3

. Springer, , 2015.

S. Basu, A. Banerjee, and R. Mooney, Semi-supervised clustering by seeding, Proceedings of 19th International Conference on Machine Learning (ICML-2002, 2002.

J. Horn, J. Krokstad, and J. Amdahl, Joint probability distribution of environmental conditions for design of offshore wind, ASME 2017 36th International Conference on Ocean, Offshore and Arctic Engineering, pp.10-19, 2017.

O. Kisi and K. S. Parmar, Application of least square support vector machine and multivariate adaptive regression spline models in long term prediction of river water pollution, J]. Journal of Hydrology, vol.534, pp.104-112, 2016.

K. Nazeer and M. P. Sebastian, Improving the Accuracy and Efficiency of the k-means Clustering Algorithm, vol.1, pp.1-3, 2009.