Unordered tree mining with applications to phylogeny

被引:20
|
作者
Shasha, D [1 ]
Wang, JTL [1 ]
Zhang, S [1 ]
机构
[1] NYU, Courant Inst Math Sci, New York, NY 10012 USA
关键词
D O I
10.1109/ICDE.2004.1320039
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Frequent structure mining (FSM) aims to discover and extract patterns frequently occurring in structural data, such as trees and graphs. FSM finds many applications in bioinformatics, XML processing, Web log analysis, and so on. In this paper we present a new FSM technique for finding patterns in rooted unordered labeled trees. The patterns of interest are cousin pairs in these trees. A cousin pair is a pair of nodes sharing the same parent, the same grandparent, or the same great-grandpa rent, etc. Given a tree T, our algorithm finds all interesting cousin pairs of T in O(\T\(2)) time where \T\ is the number of nodes in T. Experimental results on synthetic data and phylogenies show the scalability and effectiveness of the proposed technique. To demonstrate the usefulness of our approach, we discuss its applications to locating co-occurring patterns in multiple evolutionary trees, evaluating the consensus of equally parsimonious trees, and finding kernel trees of groups of phylogenies. We also describe extensions of our algorithms for undirected acyclic graphs (or free trees).
引用
收藏
页码:708 / 719
页数:12
相关论文
共 50 条
  • [1] Mining maximal embedded unordered tree patterns
    Chehreghani, Mostafa Haghir
    Rahgozar, Masoud
    Lucas, Caro
    Chehreghani, Morteza Haghir
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 437 - 443
  • [2] A Novel Coverage Pattern Mining Method for Unordered Tree
    Xia, Ying
    Li, Hong-Xu
    [J]. 4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [3] UNORDERED TREE CONTRACTION
    HSU, LH
    WANG, JJJ
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1991, 497 : 347 - 349
  • [4] ORDERED AND UNORDERED TREE INCLUSION
    KILPELAINEN, P
    MANNILA, H
    [J]. SIAM JOURNAL ON COMPUTING, 1995, 24 (02) : 340 - 356
  • [5] Interactive Mining with Ordered and Unordered Attributes
    Wang, Weicheng
    Wong, Raymond Chi-Wing
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (11): : 2504 - 2516
  • [6] On Exact Learning of Unordered Tree Patterns
    Thomas R. Amoth
    Paul Cull
    Prasad Tadepalli
    [J]. Machine Learning, 2001, 44 : 211 - 243
  • [7] On exact learning of unordered tree patterns
    Amoth, TR
    Cull, P
    Tadepalli, P
    [J]. MACHINE LEARNING, 2001, 44 (03) : 211 - 243
  • [8] UNORDERED TREE MATCHING AND TREE PATTERN QUERIES IN XML DATABASES
    Chen, Yangjun
    [J]. ICSOFT 2009: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL 2, 2009, : 191 - 198
  • [9] New and improved algorithms for unordered tree inclusion
    Akutsu, Tatsuya
    Jansson, Jesper
    Li, Ruiming
    Takasu, Atsuhiro
    Tamura, Takeyuki
    [J]. THEORETICAL COMPUTER SCIENCE, 2021, 883 : 83 - 98
  • [10] EXACT AND APPROXIMATE ALGORITHMS FOR UNORDERED TREE MATCHING
    SHASHA, D
    WANG, JTL
    ZHANG, KZ
    SHIH, FY
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (04): : 668 - 678