Dimensionality Reduction for Cluster Identification in Metagenomics using Autoencoders

被引:2
|
作者
Maduranga, Uditha [1 ]
Wijegunarathna, Kalana [1 ]
Weerasinghe, Sadeep [1 ]
Perera, Indika [1 ]
Wickramarachchi, Anuradha [2 ]
机构
[1] Univ Moratuwa, Dept Comp Sci & Engn, Katubedda, Sri Lanka
[2] Australian Natl Univ, Res Sch Comp Sci, Canberra, ACT, Australia
关键词
metagenomics; metagenomic data visualizations; nonlinear dimensionality reduction; autoencoders; clustering;
D O I
10.1109/ICTer51097.2020.9325447
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Metagenomics is the study of the genomic content of the microbial organisms extracted from a sample in their natural habitats. These unknown collections of genomic data are analyzed without any prior lab-based cultivation to avoid amplification bias. One of the vital aspects of metagenomics analysis is the visualization of the information that is derived from the genomic sequences of a microbiome sample. In a successful visualization, the congruent reads of the sequences should appear in clusters depending on the diversity and taxonomy of the microorganisms in the sequenced sample. In converting higher dimensional sequence data into lower dimensional data for visualization purposes, preserving the genomic characteristics is given the highest priority. In this process, the demand for precise and efficient methods of dimensionality reduction is crucial. Currently, Principle Component Analysis (PCA) and t-distributed Stochastic Neighbor Embedding (t-SNE) are used for dimensionality reduction purposes in metagenomics, which are linear and non-linear techniques respectively. Although the above-mentioned techniques are widely used, there are shortcomings in accuracy and efficiency in terms of visualizations. In this paper, we explore the possibility of using autoencoders, a deep learning technique, to get a rich dimensionality reduction, overcoming the prevailing impediments of PCA and t-SNE and outperforming them to achieve better metagenomic visualizations.
引用
收藏
页码:113 / 118
页数:6
相关论文
共 50 条
  • [1] Dimensionality Reduction Using Convolutional Autoencoders
    Mittal, Shweta
    Sangwan, Om Prakash
    [J]. ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 : 507 - 516
  • [2] Dimensionality Reduction for Image Features using Deep Learning and Autoencoders
    Petscharnig, Stefan
    Lux, Mathias
    Chatzichristofis, Savvas
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [3] Dimensionality Reduction of Mass Spectrometry Imaging Data using Autoencoders
    Thomas, Spencer A.
    Race, Alan M.
    Steven, Rory T.
    Gilmore, Ian S.
    Bunch, Josephine
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [4] Dimensionality Reduction Applied to Time Response of Linear Systems Using Autoencoders
    Paniagua, Jose L.
    Lopez, Jesus A.
    [J]. 2019 IEEE COLOMBIAN CONFERENCE ON APPLICATIONS IN COMPUTATIONAL INTELLIGENCE (COLCACI), 2019,
  • [5] Dimensionality Reduction Using Discriminative Autoencoders for Remote Sensing Image Retrieval
    Mohbat
    Mukhtar, Tooba
    Khurshid, Numan
    Taj, Murtaza
    [J]. IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 499 - 508
  • [6] Dimensionality Reduction of SDSS Spectra with Variational Autoencoders
    Portillo, Stephen K. N.
    Parejko, John K.
    Vergara, Jorge R.
    Connolly, Andrew J.
    [J]. ASTRONOMICAL JOURNAL, 2020, 160 (01):
  • [7] Big Data Dimensionality Reduction for Wireless Sensor Networks Using Stacked Autoencoders
    Sirshar, Muneeba
    Saleem, Sajid
    Ilyas, Muhammad U.
    Khan, Muhammad Murtaza
    Alkatheiri, Mohammed Saeed
    Alowibdi, Jalal S.
    [J]. RESEARCH & INNOVATION FORUM 2019: TECHNOLOGY, INNOVATION, EDUCATION, AND THEIR SOCIAL IMPACT, 2019, : 391 - 400
  • [8] Reducing Dimensionality of Data Using Autoencoders
    Janakiramaiah, B.
    Kalyani, G.
    Narayana, S.
    Krishna, T. Bala Murali
    [J]. SMART INTELLIGENT COMPUTING AND APPLICATIONS, VOL 2, 2020, 160 : 51 - 58
  • [9] Extreme Dimensionality Reduction for Network Attack Visualization with Autoencoders
    Ferreira, Daniel C.
    Vazquez, Felix Iglesias
    Zseby, Tanja
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [10] Multi-label Text Classification Using Semantic Features and Dimensionality Reduction with Autoencoders
    Alkhatib, Wael
    Rensing, Christoph
    Silberbauer, Johannes
    [J]. LANGUAGE, DATA, AND KNOWLEDGE, LDK 2017, 2017, 10318 : 380 - 394