A Phrase-Based Method for Hierarchical Clustering of Web Snippets

被引:0
|
作者
Li, Zhao [1 ]
Wu, Xindong [1 ]
机构
[1] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document clustering has been applied in web information retrieval, which facilitates users' quick browsing by organizing retrieved results into different groups. Meanwhile, a tree-like hierarchical structure is well-suited for organizing the retrieved results in favor of web users. In this regard, we introduce a new method for hierarchical clustering of web snippets by exploiting a phrase-based document index. In our method, a hierarchy of web snippets is built based on phrases instead of all snippets, and the snippets are then assigned to the corresponding clusters consisting of phrases. We show that, as opposed to the traditional hierarchical clustering, our method not only presents meaningful cluster labels but also improves clustering performance.
引用
收藏
页码:1947 / 1948
页数:2
相关论文
共 50 条
  • [21] Learning Word Reorderings for Hierarchical Phrase-based Statistical Machine Translation
    Zhang, Jingyi
    Utiyama, Masao
    Sumita, Eiichro
    Zhao, Hai
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 542 - 548
  • [22] UCCAApp: Web-application for Syntactic and Semantic Phrase-based Annotation
    Abend, Omri
    Yerushlami, Shai
    Rappoport, Ari
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 109 - 114
  • [23] Statistical phrase-based translation
    Koehn, P
    Och, FJ
    Marcu, D
    [J]. HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 127 - 133
  • [24] On the Cost of Phrase-Based Ranking
    Petri, Matthias
    Moffat, Alistair
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 931 - 934
  • [25] A PHRASE-BASED MATCHING FUNCTION
    GALBIATI, G
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1991, 42 (01): : 36 - 48
  • [26] Phrase-based Image Captioning
    Lebret, Remi
    Pinheiro, Pedro O.
    Collobert, Ronan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2085 - 2094
  • [27] Left-to-Right Target Generation for Hierarchical Phrase-based Translation
    Watanabe, Taro
    Tsukada, Hajime
    Isozaki, Hideki
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 777 - 784
  • [28] ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering
    Ren, Xiang
    El-Kishky, Ahmed
    Wang, Chi
    Tao, Fangbo
    Voss, Clare R.
    Ji, Heng
    Han, Jiawei
    [J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 995 - 1004
  • [29] Integrating Phrase Inseparability in Phrase-Based Model
    Shi, Lixin
    Nie, Jian-Yun
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 708 - 709
  • [30] The anatomy of a hierarchical clustering engine for web-page, news and book snippets
    Ferragina, P
    Gullì, A
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 395 - 398