Mining subtrees with frequent occurrence of similar subtrees

被引:0
|
作者
Tosaka, Hisashi [1 ]
Nakamura, Atsuyoshi [1 ]
Kudo, Mineichi [1 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Kita Ku, Kita 14,Nishi 9, Sapporo, Hokkaido 0600814, Japan
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study a novel problem of mining subtrees with frequent occurrence of similar subtrees, and propose an algorithm for this problem. In our problem setting, frequency of a subtree is counted not only for equivalent subtrees but also for similar subtrees. According to our experiment using tag trees of web pages, this problem can be solved fast enough for practical use. An encouraging result was obtained in a preliminary experiment for data record extraction from web pages using our mining method.
引用
收藏
页码:286 / +
页数:2
相关论文
共 50 条
  • [21] Finding Good Subtrees for Constraint Optimization Problems Using Frequent Pattern Mining
    Li, Hongbo
    Lee, Jimmy
    Mi, He
    Yin, Minghao
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1577 - 1584
  • [22] EFoX: A scalable method for extracting frequent subtrees
    Paik, J
    Shin, DR
    Kim, U
    [J]. COMPUTATIONAL SCIENCE - ICCS 2005, PT 3, 2005, 3516 : 813 - 817
  • [23] A Fast Algorithm of Mining Induced Subtrees
    Li, Yun
    Guo, Xin
    Yuan, Yunhao
    Wu, Jia
    Chen, Ling
    [J]. 2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 195 - 199
  • [24] Clustering XML Documents Using Frequent Subtrees
    Kutty, Sangeetha
    Tran, Tien
    Nayak, Richi
    Li, Yuefeng
    [J]. ADVANCES IN FOCUSED RETRIEVAL, 2009, 5631 : 436 - 445
  • [25] Research on a frequent maximal induced subtrees mining method based on the compression tree sequence
    Wang, Jing
    Liu, Zhaojun
    Li, Wei
    Li, Xiongfei
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (01) : 94 - 100
  • [26] Mining maximal frequent subtrees with lists-based pattern-growth method
    Paik, Juryon
    Nam, Junghyun
    Hwang, Jaegak
    Kim, Ung Mo
    [J]. PROGRESS IN WWW RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 4976 : 93 - +
  • [27] Probabilistic frequent subtrees for efficient graph classification and retrieval
    Welke, Pascal
    Horvath, Tamas
    Wrobel, Stefan
    [J]. MACHINE LEARNING, 2018, 107 (11) : 1847 - 1873
  • [28] Discovering frequent agreement subtrees from phylogenetic data
    Zhang, Sen
    Wang, Jason T. L.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (01) : 68 - 82
  • [29] Efficient Identification of Frequent Family Subtrees in Tree Database
    Lee, Kyung Mi
    Lee, Keon Myung
    [J]. INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 3165 - 3170
  • [30] Extraction of Frequent Tree Patterns without Subtrees Maintenance
    Paik, Juryon
    Lee, Eunjoo
    Choi, Wongil
    Kim, Ung Mo
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING SYMPOSIA, VOLS 1-5, PROCEEDINGS, 2008, : 150 - 155