85 6.2 Feature selection ,
,
, Classication of antibacterials and non-antibacterials, p.94
,
,
, Feature selection is an important step in KDD, since it reduces the complexity of a dataset
, According to a recent report on antibiotic research released Sept. 17 by the London School of Economics and Political Science (LSE), 175,000 deaths are attributed to hospital-acquired infections each year in Europe alone
Fast algorithms for mining association rules, Proc. 20th int. conf. very large data bases, VLDB, vol.1215, p.487499, 1994. ,
Graph modularity maximization as an eective method for co-clustering text data. Knowledge-Based Systems, vol.109, p.160173, 2016. ,
Exploratory knowledge discovery over web of data ,
URL : https://hal.archives-ouvertes.fr/hal-01673439
, Discrete Applied Mathematics, vol.249, p.217, 2018.
Latviz: A new practical tool for performing interactive exploration over concept lattices, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01420751
Distinct types of diuse large b-cell lymphoma identied by gene expression proling, Nature, vol.403, issue.6769, p.503, 2000. ,
Multi-manifold matrix decomposition for data coclustering, Pattern Recognition, vol.64, p.386398, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01408092
In-Close, a fast algorithm for computing formal concepts, International Conference on Conceptual Structures (ICCS), 2009. ,
In-Close2, a high performance formal concept miner, International Conference on Conceptual Structures, vol.5062, 2011. ,
A Hybrid Classication Approach based on FCA and Emerging Patterns -An application for the classication of biological inhibitors, Proceedings of CLA. CEUR Workshop Proceedings, vol.972, p.211222, 2012. ,
Sequential pattern mining using a bitmap representation, Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, p.429435, 2002. ,
Discovering local structure in gene expression data: the order-preserving submatrix problem, Journal of computational biology, vol.10, issue.3-4, p.373384, 2003. ,
A simple algorithm to generate the minimal separators and the maximal cliques of a chordal graph, Information Processing Letters, vol.111, issue.11, p.508511, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00678694
Random forests, Machine Learning, vol.45, issue.1, p.532, 2001. ,
Receiver operating characteristics curves and related decision measures: A tutorial, Chemometrics and Intelligent Laboratory Systems, vol.80, issue.1, p.2438, 2006. ,
On mining complex sequential data by means of FCA and pattern structures, International Journal of General Systems, vol.45, issue.2, p.135159, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01186715
Automatic selection of molecular descriptors using random forest: Application to drug discovery, Expert Systems with Applications, vol.72, p.151159, 2017. ,
LIBSVM: A library for support vector machines, ACM Transactions on Intelligent Systems and Technology (TIST), vol.2, issue.3, p.27, 2011. ,
Biclustering of expression data, vol.8, p.93103, 2000. ,
A proposition for sequence mining using pattern structures, Proceedings of ICFCA, p.106121, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01549107
Lattice-based biclustering using partition pattern structures, Proceedings of the Twenty-rst European Conference on Articial Intelligence. pp. 213218, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01095865
Contributions à l'indexation et à la récupération d'information utilisant l'analyse formelle de concepts, 2015. ,
Support-vector networks, Machine Learning, vol.20, issue.3, p.273297, 1995. ,
Elements about exploratory, knowledge-based, hybrid, and explainable knowledge discovery, International Conference on Formal Concept Analysis, p.316, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02195480
Dimensionality reduction based on distance preservation to local mean for symmetric positive denite matrices and its application in braincomputer interfaces, Journal of Neural Engineering, vol.14, issue.3, p.36019, 2017. ,
Mining frequent gradual itemsets from large databases, International Symposium on Intelligent Data Analysis, 2009. ,
On the equivalence of nonnegative matrix factorization and spectral clustering, Proceedings of the 2005 SIAM International Conference on Data Mining, p.606610, 2005. ,
Orthogonal nonnegative matrix t-factorizations for clustering, Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, p.126135, 2006. ,
Ecient mining of emerging patterns: Discovering trends and dierences, Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, p.4352, 1999. ,
On measuring similarity for sequences of itemsets, Data Mining and Knowledge Discovery, vol.29, issue.3, p.732764, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-00740231
,
A density-based algorithm for discovering clusters in large spatial databases with noise, vol.96, p.226231, 1996. ,
Why so many clustering algorithms: A position paper, SIGKDD Explorations, vol.4, issue.1, p.6575, 2002. ,
From data mining to knowledge discovery in databases, AI Magazine, vol.17, issue.3, p.3737, 1996. ,
The global challenge of new classes of antibacterial agents: an industry perspective, Current Opinion in Pharmacology, vol.24, p.711, 2015. ,
, Advances in evolutionary computing, 2003.
Pattern structures and their projections, International Conference on Conceptual Structures, p.129142, 2001. ,
Formal concept analysis: mathematical foundations, Springer Science & Business Media, 2012. ,
Gene selection for cancer classication using support vector machines, Machine Learning, vol.46, issue.1-3, p.389422, 2002. ,
, The analysis of frequency data: Statistical research monographs, 1977.
PrexSpan: Mining sequential patterns eciently by prex-projected pattern growth, Proceedings of the 17th International Conference on Data Engineering, p.215224, 2001. ,
Direct clustering of a data matrix, Journal of the American Statistical Association, vol.67, issue.337, p.123129, 1972. ,
Gene-expression proles in hereditary breast cancer, New England Journal of Medicine, vol.344, issue.8, p.539548, 2001. ,
BicPAMS: software for biological data analysis with pattern-based biclustering, BMC Bioinformatics, vol.18, issue.1, p.82, 2017. ,
BicPAM: Pattern-based biclustering for biomedical data analysis, Algorithms for Molecular Biology, vol.9, issue.1, p.27, 2014. ,
BicSPAM: exible biclustering using sequential patterns, BMC Bioinformatics, vol.15, issue.1, p.130, 2014. ,
BiC2PAM: constraint-guided biclustering for biological data analysis with domain knowledge, Algorithms for Molecular Biology, vol.11, issue.1, p.23, 2016. ,
BicNET: Flexible module discovery in large-scale biological networks using biclustering, Algorithms for Molecular Biology, vol.11, issue.1, p.14, 2016. ,
F2G: Ecient discovery of full-patterns ,
, , p.19, 2013.
FABIA: Factor analysis for bicluster acquisition, Bioinformatics, vol.26, issue.12, p.15201527, 2010. ,
An experiment about the classication of antibacterial molecules, Orpailleur team, 2015. ,
Biclustering of human cancer microarray data using cosimilarity based co-clustering, Expert Systems with Applications, vol.55, p.520531, 2016. ,
Concept-based biclustering for internet advertisement, Data Mining Workshops (ICDMW), p.123130, 2012. ,
, Recommender system based on algorithm of bicluster analysis RecBi, 2012.
, Towards a unied taxonomy of biclustering methods, 2017.
Computational mapping tools for drug discovery, Drug Discovery Today, vol.14, p.767775, 2009. ,
Irrelevant features and the subset selection problem, Machine Learning Proceedings, p.121129, 1994. ,
Hierarchical clustering schemes, Psychometrika, vol.32, issue.3, p.241254, 1967. ,
A fast and high quality multilevel scheme for partitioning irregular graphs, SIAM Journal on Scientic Computing, vol.20, issue.1, p.359392, 1998. ,
Embedding tolerance relations in formal concept analysis: an application in information fusion, Proceedings of the 19th ACM international conference on Information and knowledge management, p.16891692 ,
URL : https://hal.archives-ouvertes.fr/inria-00600205
Biclustering numerical data in formal concept analysis, International Conference on Formal Concept Analysis, p.135150, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00600203
Mining gene expression data with pattern structures in formal concept analysis, Information Sciences, vol.181, issue.10, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00541100
Drug/nondrug classication using support vector machines with various feature selection strategies, Computer Methods and Programs in Biomedicine, vol.117, issue.2, p.5160, 2014. ,
Analysis and prediction of museum visitors' behavioral pattern types, Ubiquitous Display Environments, p.161176, 2012. ,
A fast algorithm for computing all intersections of objects from an arbitrary semilattice, p.1720, 1993. ,
, Concept stability for constructing taxonomies of web-site users, 2009.
Comparing performance of algorithms for generating concept lattices, Journal of Experimental & Theoretical Articial Intelligence, vol.14, issue.2-3, p.189216, 2002. ,
Hard and fuzzy diagonal co-clustering for document-term partitioning, Neurocomputing, vol.193, p.133147, 2016. ,
The inuence of a location-aware mobile guide on museum visitors' behavior, Interacting with Computers, vol.25, issue.6, p.443460, 2013. ,
Gene-gene interaction analysis for the accelerated failure time model using a unied model-based multifactor dimensionality reduction method ,
, Genomics & Informatics, vol.14, issue.4, p.166, 2016.
Using recursive classication to discover predictive features, Proceedings of the 2005 ACM Symposium on Applied Computing, p.10541058, 2005. ,
Analysis and prediction of drugdrug interaction by minimum redundancy maximum relevance and incremental feature selection, Journal of Biomolecular Structure and Dynamics, vol.35, issue.2, p.312329, 2017. ,
A comparative study on feature selection methods for drug discovery, Journal of Chemical Information and Computer Sciences, vol.44, issue.5, p.18231828, 2004. ,
Biclustering algorithms for biological data analysis: a survey ,
, IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), vol.1, issue.1, p.45, 2004.
AddIntent: A new incremental algorithm for constructing concept lattices, International Conference on Formal Concept Analysis, p.372385, 2004. ,
A systematic comparative evaluation of biclustering techniques, BMC bioinformatics, vol.18, issue.1, p.55, 2017. ,
A statistically ecient and scalable method for loglinear analysis of high-dimensional data, 2014 IEEE International Conference on Data Mining, p.480489, 2014. ,
Scaling log-linear analysis to datasets with thousands of variables, Proceedings of the 2015 SIAM International Conference on Data Mining, p.469477, 2015. ,
Scaling log-linear analysis to high-dimensional data, 2013 IEEE International Conference on Data Mining, vol.597606, 2013. ,
A novel biclustering algorithm for the discovery of meaningful biological correlations between microRNAs and their target genes, BMC bioinformatics, vol.14, issue.7, p.8, 2013. ,
Hierarchical and overlapping co-clustering of mrna: mirna interactions, ECAI. pp. 654659. Citeseer, 2012. ,
ComiRNet: a web-based system for the analysis of miRNA-gene regulatory networks, BMC Bioinformatics, vol.16, issue.9, p.7, 2015. ,
Biclustering on expression data: A review, Journal of biomedical informatics, vol.57, p.163180, 2015. ,
Combining in silico evolution and nonlinear dimensionality reduction to redesign responses of signaling networks, Physical Biology, vol.13, issue.6, p.66015, 2017. ,
, R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, 2014.
Nonlinear dimensionality reduction and mapping of compound libraries for drug discovery, Journal of Molecular Graphics and Modelling, vol.34, p.117, 2012. ,
Two-mode multi-partitioning, Computational Statistics & Data Analysis, vol.52, issue.4, p.19842003, 2008. ,
Comparison of automatic three-dimensional model builders using 639 x-ray structures, Journal of Chemical Information and Computer Sciences, vol.34, issue.4, p.10001008, 1994. ,
Word co-occurrence regularized non-negative matrix trifactorization for text data co-clustering, Thirty-Second AAAI Conference on Articial Intelligence, 2018. ,
Using Multivariate Statistics, vol.5, 2007. ,
Simultaneous feature selection and feature weighting using hybrid tabu search/k-nearest neighbor classier, Pattern Recognition Letters, vol.28, issue.4, p.438446, 2007. ,
Discovering statistically signicant biclusters in gene expression data, Bioinformatics, vol.18, issue.suppl_1, pp.136-144, 2002. ,
Prediction of cell-penetrating peptides with feature selection techniques, Biochemical and Biophysical Research Communications, vol.477, issue.1, p.150154, 2016. ,
Feature selection for classication: A review. Data Classication: Algorithms and Applications p, p.37, 2014. ,
, Molecular Descriptors for Chemoinformatics, vol.41, 2009.
Enumerating all maximal biclusters in numerical datasets, Information Sciences, vol.379, p.288309, 2017. ,
Double k-means clustering for simultaneous classication of objects and variables, Advances in Classication and Data Analysis, p.4352, 2001. ,
Ethnographie de l'exposition. Bibliothèque Publique d'Information, Centre Georges Pompidou, 1983. ,
Layered critical values: a powerful direct-adjustment approach to discovering signicant patterns, Machine Learning, vol.71, issue.2-3, p.307323, 2008. ,
Prediction of human disease-associated phosphorylation sites with combined feature selection approach and support vector machine, IET Systems Biology, vol.9, issue.4, p.155163, 2015. ,
Eect of molecular descriptor feature selection in support vector machine classication of pharmacokinetic and toxicological properties of chemical agents, Journal of Chemical Information and Computer Sciences, vol.44, issue.5, p.16301638, 2004. ,
A biclustering algorithm with coherent evolution on the contiguous columns facing time-series gene data, 2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), p.328333, 2014. ,
A new approach for the deep order preserving submatrix problem based on sequential pattern mining, International Journal of Machine Learning and Cybernetics, vol.9, issue.2, p.263279, 2018. ,
Nonlinear dimensionality reduction methods for synthetic biology biobricks' visualization, BMC Bioinformatics, vol.18, issue.1, p.47, 2017. ,
Development of in silico models for predicting p-glycoprotein inhibitors based on a two-step approach for feature selection and its application to chinese herbal medicine screening, Molecular Pharmaceutics, vol.12, issue.10, p.36913713, 2015. ,
Ecient algorithms for mining closed itemsets and their lattice structure, IEEE transactions on knowledge and data engineering, vol.17, issue.4, p.462478, 2005. ,
Analyzing museum visitors' behavior patterns, International Conference on User Modeling, p.238246 ,
, , 2007.
Feature selection using tabu search method, Pattern Recognition, vol.35, issue.3, p.701711, 2002. ,
Unsupervised 2d dimensionality reduction with adaptive structure learning, Neural Computation, vol.29, issue.5, p.13521374, 2017. ,