S. Ananiadou and J. Mcnaught, Text Mining for Biology and Biomedicine, 2006.

R. Baeza-yates and B. Ribeiro-neto, Modern Information Retrieval: The Concepts and Technology behind Search, 2011.

B. Chen, R. Harrison, Y. Pan, and P. Tai, Novel Hybrid Hierarchical-K-means Clustering Method (H-K-means) for Microarray Analysis, Proceedings of the 2005 IEEE Computational Systems Bioinformatics Conference -Workshops, pp.105-108, 2005.

A. M. Cohen and W. R. Herch, A survey of current work in biomedical text mining, Briefings in Bioinformatics, vol.6, issue.1, pp.57-71, 2005.
DOI : 10.1093/bib/6.1.57

H. J. Dai, J. Y. Lin, C. H. Huang, P. H. Chou, R. T. Tsai et al., A Survey of State of the Art Biomedical Text Mining Techniques for Semantic Analysis, 2008 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (sutc 2008), pp.410-417, 2008.
DOI : 10.1109/SUTC.2008.86

J. Dean and S. Ghemawat, MapReduce, Proceedings of the 6th Symposium on Operating Systems Design and Implementation, pp.137-150, 2004.
DOI : 10.1145/1327452.1327492

I. S. Dhillon, Y. Guan, and J. Kogan, Iterative clustering of high dimensional text data augmented by local search, 2002 IEEE International Conference on Data Mining, 2002. Proceedings., pp.131-138, 2002.
DOI : 10.1109/ICDM.2002.1183895

M. Georgitsi, E. Viennas, V. Gkantouna, E. Christodoulopoulou, Z. Zagoriti et al., Population-specific documentation of pharmacogenomic markers and their allelic frequencies in FINDbase, Pharmacogenomics, vol.12, issue.1, pp.49-58, 2011.
DOI : 10.2217/pgs.10.169

J. Han and M. Kamber, Data Mining, 2006.
DOI : 10.1007/978-1-4899-7993-3_104-2

M. Ioannou, C. Makris, G. Tzimas, and E. Viennas, A Text Mining Approach for Biomedical Documents, Proceedings of the 6th Conference of the Hellenic Society for Computational Biology and Bioinformatics, 2011.

M. Ioannou, G. Patrinos, and G. Tzimas, Genome-Based Population Clustering: Nuggets of Truth Buried in a Pile of Numbers?, Proceedings of the 1st Workshop on Algorithms for Data and Text Mining in Bioinformatics organized in the 8th Artificial Intelligence Applications and Innovations Conference, 2012.
DOI : 10.1007/978-3-642-33412-2_62

K. Inoue and K. Urahama, Fuzzy Clustering Based on Cooccurence Matrix and Its Application to Data Retrieval, Electron. Comm. Jpn. Pt, vol.2, issue.84, pp.10-19, 2001.

M. Ioannou, C. Makris, G. Patrinos, and G. Tzimas, A set of novel mining tools for efficient biological knowledge discovery, Artificial Intelligence Review, vol.1, issue.2, 2013.
DOI : 10.1007/s10462-013-9413-z

J. Kogan, Introduction to Clustering Large and High-Dimensional Data, pp.51-72, 2007.

Z. Lu, PubMed and beyond: a survey of web tools for searching biomedical literature, Database, vol.2011, issue.0, 2011.
DOI : 10.1093/database/baq036

M. Steinbach, G. Karypis, and V. Kumar, A Comparison of Document Clustering Techniques, Proceedings of the KDD Workshop on Text Mining, 6th ACM SIGKDD International Conference on Data Mining, 2000.

S. Van-baal, P. Kaimakis, M. Phommarinh, D. Koumbi, H. Cuppens et al., FINDbase: a relational database recording frequencies of genetic defects leading to inherited disorders worldwide, Nucleic Acids Research, vol.35, issue.Database, 2007.
DOI : 10.1093/nar/gkl934

E. Viennas, V. Gkantouna, M. Ioannou, M. Georgitsi, M. Rigou et al., Population-ethnic group specific genome variation allele frequency data: A querying and visualization journey, Genomics, vol.100, issue.2, pp.93-101, 2012.
DOI : 10.1016/j.ygeno.2012.05.009

T. White, Hadoop: The Definitive Guide, 2012.

C. Zhang and S. Xia, K-means Clustering Algorithm with Improved Initial Center, Knowledge Discovery and Data Mining, pp.790-792, 2009.

T. Zhang, R. Ramakrishnan, and M. Livny, BIRCH: an Efficient Data Clustering Method for Very Large Databases, Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, pp.103-114, 1996.

T. Zhang, R. Ramakrishnan, and M. Livny, BIRCH: a New Data Clustering Algorithm and its Applications, Data Mining and Knowledge Discovery, vol.1, issue.2, pp.141-182, 1997.
DOI : 10.1023/A:1009783824328