Maize Feature Store: A centralized resource to manage and analyze curated maize multi-omics features for machine learning applications

被引:3
|
作者
Sen, Shatabdi [1 ]
Woodhouse, Margaret R. [2 ]
Portwood II, John L. [2 ]
Andorf, Carson M. [2 ,3 ]
机构
[1] Iowa State Univ, Dept Plant Pathol & Microbiol, 1344 Adv Teaching & Res Bldg,2213 Pammel Dr, Ames, IA 50011 USA
[2] Corn Insects & Crop Genet Res Unit, USDA ARS, 819 Wallace Rd, Ames, IA 50011 USA
[3] Iowa State Univ, Dept Comp Sci, Atanasoff Hall,2434 Osborn Dr, Ames, IA 50011 USA
关键词
PAN-GENOME ANALYSIS; GENE-EXPRESSION; PREDICTION; ANNOTATION; LANDSCAPE; DIVERSITY; ATLAS; DNA;
D O I
10.1093/database/baad078
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The big-data analysis of complex data associated with maize genomes accelerates genetic research and improves agronomic traits. As a result, efforts have increased to integrate diverse datasets and extract meaning from these measurements. Machine learning models are a powerful tool for gaining knowledge from large and complex datasets. However, these models must be trained on high-quality features to succeed. Currently, there are no solutions to host maize multi-omics datasets with end-to-end solutions for evaluating and linking features to target gene annotations. Our work presents the Maize Feature Store (MFS), a versatile application that combines features built on complex data to facilitate exploration, modeling and analysis. Feature stores allow researchers to rapidly deploy machine learning applications by managing and providing access to frequently used features. We populated the MFS for the maize reference genome with over 14 000 gene-based features based on published genomic, transcriptomic, epigenomic, variomic and proteomics datasets. Using the MFS, we created an accurate pan-genome classification model with an AUC-ROC score of 0.87. The MFS is publicly available through the maize genetics and genomics database.Database URL https://mfs.maizegdb.org/
引用
收藏
页数:12
相关论文
共 13 条
  • [1] Multi-omics assists genomic prediction of maize yield with machine learning approaches
    Chengxiu Wu
    Jingyun Luo
    Yingjie Xiao
    Molecular Breeding, 2024, 44
  • [2] Multi-omics assists genomic prediction of maize yield with machine learning approaches
    Wu, Chengxiu
    Luo, Jingyun
    Xiao, Yingjie
    MOLECULAR BREEDING, 2024, 44 (02)
  • [3] Characterizing mitochondrial features in osteoarthritis through integrative multi-omics and machine learning analysis
    Wu, Yinteng
    Hu, Haifeng
    Wang, Tao
    Guo, Wenliang
    Zhao, Shijian
    Wei, Ruqiong
    FRONTIERS IN IMMUNOLOGY, 2024, 15
  • [4] Multi-omics identification of GPCR gene features in lung adenocarcinoma based on multiple machine learning combinations
    Xie, Yiluo
    Pan, Xinyu
    Wang, Ziqiang
    Ma, Hongyu
    Xu, Wanjie
    Huang, Hua
    Zhang, Jing
    Wang, Xiaojing
    Lian, Chaoqun
    JOURNAL OF CANCER, 2024, 15 (03): : 776 - 795
  • [5] Integrative multi-omics analysis and machine learning refine global histone modification features in prostate cancer
    He, Xiaofeng
    Ge, Qintao
    Zhao, Wenyang
    Yu, Chao
    Bai, Huiming
    Wu, Xiaotong
    Tao, Jing
    Xu, Wenhao
    Qiu, Yunhua
    Chen, Lei
    Yang, Jianfeng
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2025, 12
  • [6] The Recognition of Maize seeds Based on Multi-scale Feature Fusion and Extreme Learning Machine
    Du, Mingzhi
    Ke, Xiao
    Zhou, Mingke
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING AND INTELLIGENT SYSTEMS (ICMEIS 2015), 2015, 26 : 391 - 397
  • [7] A comprehensive review of machine learning techniques for multi-omics data integration: challenges and applications in precision oncology
    Acharya, Debabrata
    Mukhopadhyay, Anirban
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (05) : 549 - 560
  • [8] Characterizing hedgehog pathway features in senescence associated osteoarthritis through Integrative multi-omics and machine learning analysis
    Wang, Tao
    Li, Zhengrui
    Zhao, Shijian
    Liu, Ying
    Guo, Wenliang
    Rodriguez, Raquel Alarcon
    Wu, Yinteng
    Wei, Ruqiong
    FRONTIERS IN GENETICS, 2024, 15
  • [9] Multi-omics characterization of macrophage polarization-related features in osteoarthritis based on a machine learning computational framework
    Hu, Ping
    Li, Beining
    Yin, Zhenyu
    Peng, Peng
    Cao, Jiangang
    Xie, Wanyu
    Liu, Liang
    Cao, Fujiang
    Zhang, Bin
    HELIYON, 2024, 10 (09)
  • [10] Integrating Molecular Perspectives: Strategies for Comprehensive Multi-Omics Integrative Data Analysis and Machine Learning Applications in Transcriptomics, Proteomics, and Metabolomics
    Sanches, Pedro H. Godoy
    de Melo, Nicolly Clemente
    Porcari, Andreia M.
    de Carvalho, Lucas Miguel
    BIOLOGY-BASEL, 2024, 13 (11):