Neighbourhood counting metric for sequences

被引:0
|
作者
Wang, Hui [1 ]
Liu, Chang [1 ]
机构
[1] Univ Ulster, Sch Comp & Math, Jordanstown BT37 0QB, North Ireland
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The longest common subsequence (LCS) is a well known and popular method for measuring similarity between sequences. In this paper we consider all common subsequences (ACS) as a measure of sequence similarity with the view that all common information is captured. ACS is inspired and derived from a generic similarity measure, neighbourhood counting metric (NCM). The close connection of NCM with probability and the Bayes classifier helps gain an insight from the probabilistic perspective into ACS. We also design an algorithm to calculate this measure and carry out an experiment in the framework of k-nearest neighbours on a gene sequence classification task. The experiment shows that ACS and LCS have little difference for small k values, but differ significantly for large k values. ACS's performance remains steady as k gets larger, but LCS's performance drops sharply as k gets larger. Such a property may be useful for some tasks.
引用
收藏
页码:243 / 248
页数:6
相关论文
共 50 条
  • [1] Metric and non-metric distances on Zn by generalized neighbourhood sequences
    Nagy, B
    [J]. ISPA 2005: PROCEEDINGS OF THE 4TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2005, : 215 - 220
  • [2] Geometry of neighbourhood sequences
    Hajdu, A
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (15) : 2597 - 2606
  • [3] COUNTING SEQUENCES
    ROBINSON, JP
    COHN, M
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1981, 30 (01) : 17 - 23
  • [4] Counting metric spaces
    Kuba, Gerald
    [J]. ARCHIV DER MATHEMATIK, 2011, 97 (06) : 569 - 578
  • [5] Counting metric spaces
    Gerald Kuba
    [J]. Archiv der Mathematik, 2011, 97 : 569 - 578
  • [6] Metric Temporal Logic with Counting
    Krishna, Shankara Narayanan
    Madnani, Khushraj
    Pandya, Paritosh K.
    [J]. FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATION STRUCTURES (FOSSACS 2016), 2016, 9634 : 335 - 352
  • [7] MUTUALLY COUNTING SEQUENCES
    KAHAN, S
    [J]. FIBONACCI QUARTERLY, 1980, 18 (01): : 47 - 50
  • [8] COUNTING THE GOOD SEQUENCES
    RAWSTHORNE, DA
    [J]. FIBONACCI QUARTERLY, 1987, 25 (02): : 161 - 162
  • [9] Generating distance maps with neighbourhood sequences
    Strand, Robin
    Nagy, Benedek
    Fouard, Celine
    Borgefors, Gunilla
    [J]. DISCRETE GEOMETRY FOR COMPUTER IMAGERY, PROCEEDINGS, 2006, 4245 : 295 - 307
  • [10] Weighted distances based on neighbourhood sequences
    Strand, Robin
    [J]. PATTERN RECOGNITION LETTERS, 2007, 28 (15) : 2029 - 2036