Vanessa Aguiar-Pulido, Jose A. Seoane, Marcos Gestal and Julian Dorado Pages 779 - 789 ( 11 )
Data mining, a part of the Knowledge Discovery in Databases process (KDD), is the process of extracting patterns from large data sets by combining methods from statistics and artificial intelligence with database management. Analyses of epigenetic data have evolved towards genome-wide and high-throughput approaches, thus generating great amounts of data for which data mining is essential. Part of these data may contain patterns of epigenetic information which are mitotically and/or meiotically heritable determining gene expression and cellular differentiation, as well as cellular fate. Epigenetic lesions and genetic mutations are acquired by individuals during their life and accumulate with ageing. Both defects, either together or individually, can result in losing control over cell growth and, thus, causing cancer development. Data mining techniques could be then used to extract the previous patterns. This work reviews some of the most important applications of data mining to epigenetics.
Epigenetics, data mining, knowledge discovery, bioinformatics
Department of Information and Communication Technologies, Computer Science Faculty, University of A Coruna, Campus de Elvina, S/N, 15071 A Coruña, Spain.