Models of the Gene Must Inform Data-Mining Strategies in Genomics

被引:4
|
作者
Huminiecki, Lukasz [1 ]
机构
[1] Polish Acad Sci, Inst Genet & Anim Biotechnol, Dept Mol Biol, PL-00901 Warsaw, Poland
基金
欧盟地平线“2020”;
关键词
gene concept; scientific method; experimentalism; reductionism; anti-reductionism; data-mining; NETWORK MEDICINE; EXPRESSION;
D O I
10.3390/e22090942
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The gene is a fundamental concept of genetics, which emerged with the Mendelian paradigm of heredity at the beginning of the 20th century. However, the concept has since diversified. Somewhat different narratives and models of the gene developed in several sub-disciplines of genetics, that is in classical genetics, population genetics, molecular genetics, genomics, and, recently, also, in systems genetics. Here, I ask how the diversity of the concept impacts data-integration and data-mining strategies for bioinformatics, genomics, statistical genetics, and data science. I also consider theoretical background of the concept of the gene in the ideas of empiricism and experimentalism, as well as reductionist and anti-reductionist narratives on the concept. Finally, a few strategies of analysis from published examples of data-mining projects are discussed. Moreover, the examples are re-interpreted in the light of the theoretical material. I argue that the choice of an optimal level of abstraction for the gene is vital for a successful genome analysis.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Qualitative Assessment of Data-Mining Workflows
    Znidarsic, Martin
    Bohanec, Marko
    Trdin, Nejc
    FUSING DECISION SUPPORT SYSTEMS INTO THE FABRIC OF THE CONTEXT, 2012, 238 : 75 - 86
  • [32] Data-Mining Possibilities in Blended Learning
    Baksa-Hasko, Gabriella
    Baranyai, Brigitta
    TEACHING AND LEARNING IN A DIGITAL WORLD, 2018, 716 : 174 - 183
  • [33] Fehlende Daten beim Data-Mining
    Dieter William Joenssen
    Thomas Müllerleile
    HMD Praxis der Wirtschaftsinformatik, 2014, 51 (4) : 458 - 468
  • [34] Data mining in genomics
    Lee, Jae K.
    Williams, Paul D.
    Cheon, Sooyoung
    CLINICS IN LABORATORY MEDICINE, 2008, 28 (01) : 145 - +
  • [35] Big data analyticsA review of data-mining models for small and medium enterprises in the transportation sector
    Selamat, Siti Aishah Mohd
    Prakoonwit, Simant
    Sahandi, Reza
    Khan, Wajid
    Ramachandran, Manoharan
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (03)
  • [36] The Comprehensive Phytopathogen Genomics Resource: a web-based resource for data-mining plant pathogen genomes
    Hamilton, John P.
    Neeno-Eckwall, Eric C.
    Adhikari, Bishwo N.
    Perna, Nicole T.
    Tisserat, Ned
    Leach, Jan E.
    Levesque, C. Andre
    Buell, C. Robin
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,
  • [37] redAlert: Data-mining and visualisation for IP data analysis
    Kirkham, EA
    Botham, CP
    SAM '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON SECURITY AND MANAGEMENT, 2005, : 24 - 30
  • [38] Raw Wind Data Preprocessing: A Data-Mining Approach
    Zheng, Le
    Hu, Wei
    Min, Yong
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2015, 6 (01) : 11 - 19
  • [39] Data quality analysis using data-mining methods
    Windheuser, U
    OPERATIONS RESEARCH PROCEEDINGS 1999, 2000, : 304 - 310
  • [40] Urban data-mining: spatiotemporal exploration of multidimensional data
    Behnisch, Martin
    Ultsch, Alfred
    BUILDING RESEARCH AND INFORMATION, 2009, 37 (5-6): : 520 - 532