On complexity measures for biological sequences

被引:0
|
作者
Nan, F [1 ]
Adjeroh, D [1 ]
机构
[1] W Virginia Univ, Lane Dept Comp Sci & Elect Engn, Morgantown, WV 26506 USA
关键词
D O I
暂无
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
In this work, we perform an empirical study of different published measures of complexity for general sequences, to determine their effectiveness An dealing with biological sequences. By effectiveness, we refer to how closely the given complexity measure is able to identify known biologically relevant relationships, such as closeness on a phylogenic tree. In particular, we study three complexity measures, namely, the traditional Shanon's entropy, linguistic complexity, and T-complexity. For each complexity measure, we construct the complexity profile for each sequence An our test set, and based on the profiles we compare the sequences using different performance measures based on: (A) the information theoretic divergence measure of relative entropy; (H) apparent periodicity in the complexity profile; and (AN) correct phylogeny. The preliminary results show that the T-complexity was the Deast effective An identifying previously established known associations between the sequences in our test set. Shannon's entropy and linguistic-complexity provided better results, with Shannon's entropy having an upper hand.
引用
收藏
页码:522 / 526
页数:5
相关论文
共 50 条
  • [1] Artificial sequences and complexity measures
    Baronchelli, A
    Caglioti, E
    Loreto, V
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2005,
  • [2] Complexity measures for assembly sequences
    Goldwasser, MH
    Motwani, R
    INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 1999, 9 (4-5) : 371 - 417
  • [3] Complexity measures for assembly sequences
    Goldwasser, M
    Latombe, JC
    Motwani, R
    1996 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, PROCEEDINGS, VOLS 1-4, 1996, : 1851 - 1857
  • [4] On the complexity measures of genetic sequences
    Gusev, VD
    Nemytikova, LA
    Chuzhanova, NA
    BIOINFORMATICS, 1999, 15 (12) : 994 - 999
  • [5] Linear complexity and related complexity measures for sequences
    Niederreiter, H
    PROGRESS IN CRYPTOLOGY -INDOCRYPT 2003, 2003, 2904 : 1 - 17
  • [6] The performances of the chi-square test and complexity measures for signal recognition in biological sequences
    Pirhaji, Leila
    Kargar, Mehdi
    Sheari, Armita
    Poormohammadi, Hadi
    Sadeghi, Mehdi
    Pezeshk, Hamid
    Eslahchi, Changiz
    JOURNAL OF THEORETICAL BIOLOGY, 2008, 251 (02) : 380 - 387
  • [7] Complexity Measures for EEG Microstate Sequences: Concepts and Algorithms
    Von Wegner, Frederic
    Wiemers, Milena
    Hermann, Gesine
    Toedt, Inken
    Tagliazucchi, Enzo
    Laufs, Helmut
    BRAIN TOPOGRAPHY, 2024, 37 (02) : 296 - 311
  • [8] Complexity Measures for EEG Microstate Sequences: Concepts and Algorithms
    Frederic von Wegner
    Milena Wiemers
    Gesine Hermann
    Inken Tödt
    Enzo Tagliazucchi
    Helmut Laufs
    Brain Topography, 2024, 37 : 296 - 311
  • [9] A Survey on Complexity Measures for Pseudo-Random Sequences
    Li, Chunlei
    CRYPTOGRAPHY, 2024, 8 (02)
  • [10] Distance measures for biological sequences: Some recent approaches
    Mantaci, Sabrina
    Restivo, Antonio
    Sciortino, Marinella
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 47 (01) : 109 - 124