GMASS: a novel measure for genome assembly structural similarity

被引:4
|
作者
Kwon, Daehong [1 ]
Lee, Jongin [1 ]
Kim, Jaebum [1 ]
机构
[1] Konkuk Univ, Dept Biomed Sci & Engn, Seoul 05029, South Korea
来源
BMC BIOINFORMATICS | 2019年 / 20卷
关键词
Measure; Genome; Assembly; Structural similarity; SHORT DNA-SEQUENCES; GAUGE; ABYSS;
D O I
10.1186/s12859-019-2710-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Thanks to the recent advancements in next-generation sequencing (NGS) technologies, large amount of genomic data, which are short DNA sequences known as reads, has been accumulating. Diverse assemblers have been developed to generate high quality de novo assemblies using the NGS reads, but their output is very different because of algorithmic differences. However, there are not properly structured measures to show the similarity or difference in assemblies. Results We developed a new measure, called the GMASS score, for comparing two genome assemblies in terms of their structure. The GMASS score was developed based on the distribution pattern of the number and coverage of similar regions between a pair of assemblies. The new measure was able to show structural similarity between assemblies when evaluated by simulated assembly datasets. The application of the GMASS score to compare assemblies in recently published benchmark datasets showed the divergent performance of current assemblers as well as its ability to compare assemblies. Conclusion The GMASS score is a novel measure for representing structural similarity between two assemblies. It will contribute to the understanding of assembly output and developing de novo assemblers.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] GMASS: a novel measure for genome assembly structural similarity
    Daehong Kwon
    Jongin Lee
    Jaebum Kim
    [J]. BMC Bioinformatics, 20
  • [2] A novel structural similarity measure on XML data for integrated document management
    Ng, K. L.
    Ng, T. Y.
    [J]. JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2007, 48 (01) : 42 - 52
  • [3] A novel spectra similarity measure
    Bodis, Lorant
    Ross, Alfred
    Pretsch, Ernoe
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2007, 85 (01) : 1 - 8
  • [4] Novel similarity measure for comparison of spectra
    Bodis, Lorant
    Ross, Alfred
    Pretsch, Ernoe
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 231
  • [5] A Novel Similarity Measure for Image Sequences
    Brehmer, Kai
    Wacker, Benjamin
    Modersitzki, Jan
    [J]. BIOMEDICAL IMAGE REGISTRATION, WBIR 2018, 2018, 10883 : 47 - 56
  • [6] A Novel Similarity Measure for Sequence Data
    Pandi, Mohammad. H.
    Kashefi, Omid
    Minaei, Behrouz
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2011, 7 (03): : 413 - 424
  • [7] A novel similarity measure for dependency trees
    Luo, Q
    Xi, JQ
    [J]. 2005 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, VOLS 1 AND 2, PROCEEDINGS: VOL 1: COMMUNICATION THEORY AND SYSTEMS, 2005, : 781 - 785
  • [8] A novel similarity measure for data clustering
    Yao, Yuhui
    Chen, Yan Qiu
    Chen, Lihui
    [J]. Intelligent Data Analysis, 2000, 4 (05) : 421 - 431
  • [9] A novel similarity measure for compression and classification
    Ozturk, Y
    Abut, H
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 2845 - 2848
  • [10] New Structural Similarity Measure for Image Comparison
    Premaratne, Prashan
    Premaratne, Malin
    [J]. EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, 2012, 304 : 292 - +