Models of the Gene Must Inform Data-Mining Strategies in Genomics

被引:4
|
作者
Huminiecki, Lukasz [1 ]
机构
[1] Polish Acad Sci, Inst Genet & Anim Biotechnol, Dept Mol Biol, PL-00901 Warsaw, Poland
基金
欧盟地平线“2020”;
关键词
gene concept; scientific method; experimentalism; reductionism; anti-reductionism; data-mining; NETWORK MEDICINE; EXPRESSION;
D O I
10.3390/e22090942
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The gene is a fundamental concept of genetics, which emerged with the Mendelian paradigm of heredity at the beginning of the 20th century. However, the concept has since diversified. Somewhat different narratives and models of the gene developed in several sub-disciplines of genetics, that is in classical genetics, population genetics, molecular genetics, genomics, and, recently, also, in systems genetics. Here, I ask how the diversity of the concept impacts data-integration and data-mining strategies for bioinformatics, genomics, statistical genetics, and data science. I also consider theoretical background of the concept of the gene in the ideas of empiricism and experimentalism, as well as reductionist and anti-reductionist narratives on the concept. Finally, a few strategies of analysis from published examples of data-mining projects are discussed. Moreover, the examples are re-interpreted in the light of the theoretical material. I argue that the choice of an optimal level of abstraction for the gene is vital for a successful genome analysis.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Optimizing data-mining processes: A CBR based Experience Factory for Data Mining
    Bartlmae, K
    INTERNET APPLICATIONS, 1999, 1749 : 21 - 30
  • [42] How do data-mining models consider arsenic contamination in sediments and variables importance?
    Mirchooli, Fahimeh
    Motevalli, Alireza
    Pourghasemi, Hamid Reza
    Mohammadi, Maziar
    Bhattacharya, Prosun
    Maghsood, Fatemeh Fadia
    Tiefenbacher, John P.
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2019, 191 (12)
  • [43] How do data-mining models consider arsenic contamination in sediments and variables importance?
    Fahimeh Mirchooli
    Alireza Motevalli
    Hamid Reza Pourghasemi
    Maziar Mohammadi
    Prosun Bhattacharya
    Fatemeh Fadia Maghsood
    John P. Tiefenbacher
    Environmental Monitoring and Assessment, 2019, 191
  • [44] STARLIGHT, STAR BRIGHT - DATA-MINING THE COSMOS
    PRICE, D
    IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1995, 10 (04): : 10 - 13
  • [45] Measuring innovation and innovativeness: a data-mining approach
    Sinclair-Desgagné B.
    Quality & Quantity, 2022, 56 (4) : 2415 - 2434
  • [46] Clinical Data-Mining: Integrating Practice and Research
    Whittaker, Andrew
    BRITISH JOURNAL OF SOCIAL WORK, 2011, 41 (02): : 404 - 406
  • [47] A data-mining approach to predict influent quality
    Kusiak, Andrew
    Verma, Anoop
    Wei, Xiupeng
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2013, 185 (03) : 2197 - 2210
  • [48] Special Issue on Data-Mining and Statistical Science
    Washio, Takashi
    NEW GENERATION COMPUTING, 2009, 27 (04) : 281 - 284
  • [49] A data-mining approach for the validation of aerosol retrievals
    Vucetic, Slobodan
    Han, Bo
    Mi, Wen
    Li, Zhanquing
    Obradovic, Zoran
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2008, 5 (01) : 113 - 117