A Phrase-Based Method for Hierarchical Clustering of Web Snippets

被引:0
|
作者
Li, Zhao [1 ]
Wu, Xindong [1 ]
机构
[1] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document clustering has been applied in web information retrieval, which facilitates users' quick browsing by organizing retrieved results into different groups. Meanwhile, a tree-like hierarchical structure is well-suited for organizing the retrieved results in favor of web users. In this regard, we introduce a new method for hierarchical clustering of web snippets by exploiting a phrase-based document index. In our method, a hierarchy of web snippets is built based on phrases instead of all snippets, and the snippets are then assigned to the corresponding clusters consisting of phrases. We show that, as opposed to the traditional hierarchical clustering, our method not only presents meaningful cluster labels but also improves clustering performance.
引用
收藏
页码:1947 / 1948
页数:2
相关论文
共 50 条
  • [1] Phrase-based hierarchical clustering of web search results
    Maslowska, I
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 555 - 562
  • [2] Phrase-based Hierarchical Method for Clustering Search Results
    Yang Ke
    Han Baoming
    Li Zujie
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION & INSTRUMENTATION, VOLS 1 - 4, 2010, : 1430 - 1435
  • [3] Hierarchical phrase-based translation
    Chiang, David
    [J]. COMPUTATIONAL LINGUISTICS, 2007, 33 (02) : 201 - 228
  • [4] Efficient phrase-based document indexing for web document clustering
    Hammouda, KM
    Kamel, MS
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (10) : 1279 - 1296
  • [5] A novel weighted phrase-based similarity for Web documents clustering
    Yang, Ruilong
    Zhu, Qingsheng
    Xia, Yunni
    [J]. Journal of Software, 2011, 6 (08) : 1521 - 1528
  • [6] A Comparative Study on Applying Hierarchical Phrase-based and Phrase-based on Thai-Chinese Translation
    Luekhong, Prasert
    Sukhauta, Rattasit
    Porkaew, Peerachet
    Ruangrajitpakorn, Taneth
    Supnithi, Thepchai
    [J]. 2012 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE, INFORMATION AND CREATIVITY SUPPORT SYSTEMS (KICSS 2012), 2012, : 126 - 133
  • [7] Efficient phrase-based document similarity for clustering
    Chim, Hung
    Deng, Xiaotie
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (09) : 1217 - 1229
  • [8] Efficient Incremental Phrase-Based Document Clustering
    Bakr, Ahmad M.
    Yousri, Noha A.
    Ismail, Mohamed A.
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 517 - 520
  • [9] Improved Reordering Rules for Hierarchical Phrase-based Translation
    Cai, Shu
    Lue, Yajuan
    Liu, Qun
    [J]. 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 65 - 70
  • [10] Phrase-based text representation for managing the Web documents
    Sharma, R
    Raman, S
    [J]. ITCC 2003: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: COMPUTERS AND COMMUNICATIONS, PROCEEDINGS, 2003, : 165 - 169