One novel representation of DNA sequence based on the global and local position information

被引:22
|
作者
Mo, Zhiyi [1 ]
Zhu, Wen [2 ]
Sun, Yi [2 ]
Xiang, Qilin [2 ]
Zheng, Ming [1 ]
Chen, Min [3 ]
Li, Zejun [3 ]
机构
[1] Wuzhou Univ, Sch Informat & Elect Engn, Wuzhu, Peoples R China
[2] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Hunan, Peoples R China
[3] Hunan Inst Technol, Coll Comp & Informat Sci, Hengyang, Peoples R China
来源
SCIENTIFIC REPORTS | 2018年 / 8卷
关键词
2D GRAPHICAL REPRESENTATION; PROTEIN SEQUENCES; SIMILARITY ANALYSIS; SIMILARITY/DISSIMILARITY ANALYSIS; CURVE; VISUALIZATION;
D O I
10.1038/s41598-018-26005-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
One novel representation of DNA sequence combining the global and local position information of the original sequence has been proposed to distinguish the different species. First, for the sufficient exploitation of global information, one graphical representation of DNA sequence has been formulated according to the curve of Fermat spiral. Then, for the consideration of local characteristics of DNA sequence, attaching each point in the curve of Fermat spiral with the related mass has been applied based on the relationships of neighboring four nucleotides. In this paper, the normalized moments of inertia of the curve of Fermat spiral which composed by the points with mass has been calculated as the numerical description of the corresponding DNA sequence on the first exons of beta-global genes. Choosing the Euclidean distance as the measurement of the numerical descriptions, the similarity between species has shown the performance of proposed method.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] One novel representation of DNA sequence based on the global and local position information
    Zhiyi Mo
    Wen Zhu
    Yi Sun
    Qilin Xiang
    Ming Zheng
    Min Chen
    Zejun Li
    Scientific Reports, 8
  • [2] Implant Global and Local Hierarchy Information to Sequence based Code Representation Models
    Zhang, Kechi
    Li, Zhuo
    Jin, Zhi
    Li, Ge
    2023 IEEE/ACM 31ST INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2023, : 157 - 168
  • [3] A novel representation of DNA sequence based on CMI coding
    Hou, Wenbing
    Pan, Qiuhui
    He, Mingfeng
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2014, 409 : 87 - 96
  • [4] Local and global sequence information in a β-clam protein
    Marcelino, Anna Marie
    Gierasch, Lila
    BIOPHYSICAL JOURNAL, 2007, : 403A - 403A
  • [5] PNP-DIPseAAC: Prediction of Nucleosome Position Based on the DNA Sequence Information
    Xiao, Xuan
    Liu, Zi
    Qiu, Wang-Ren
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 8559 - 8562
  • [6] MPSAGA: a matrix-based pair-wise sequence alignment algorithm for global alignment with position based sequence representation
    Lakhani, Jyoti
    Khunteta, Ajay
    Choudhary, Anupama
    Harwani, Dharmesh
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (07):
  • [7] Representation of a DNA Sequence by a Subchain of its Genetic Information
    Saada, Bacem
    Zhang, Jing
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2015, VOL II, 2015, : 536 - 540
  • [8] MPSAGA: a matrix-based pair-wise sequence alignment algorithm for global alignment with position based sequence representation
    Jyoti Lakhani
    Ajay Khunteta
    Anupama Choudhary
    Dharmesh Harwani
    Sādhanā, 2019, 44
  • [9] LOCAL AND GLOBAL SEMANTIC NETWORKS FOR THE REPRESENTATION OF MUSIC INFORMATION
    Barate, Adriano
    Ludovico, Luca A.
    JOURNAL OF E-LEARNING AND KNOWLEDGE SOCIETY, 2016, 12 (04): : 109 - 123
  • [10] A novel representation of sequence data based on structural information for effective music retrieval
    Lee, CH
    Cho, CW
    Wu, YH
    Chen, ALP
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 393 - 404