A Phrase-Based Method for Hierarchical Clustering of Web Snippets

被引:0
|
作者
Li, Zhao [1 ]
Wu, Xindong [1 ]
机构
[1] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document clustering has been applied in web information retrieval, which facilitates users' quick browsing by organizing retrieved results into different groups. Meanwhile, a tree-like hierarchical structure is well-suited for organizing the retrieved results in favor of web users. In this regard, we introduce a new method for hierarchical clustering of web snippets by exploiting a phrase-based document index. In our method, a hierarchy of web snippets is built based on phrases instead of all snippets, and the snippets are then assigned to the corresponding clusters consisting of phrases. We show that, as opposed to the traditional hierarchical clustering, our method not only presents meaningful cluster labels but also improves clustering performance.
引用
收藏
页码:1947 / 1948
页数:2
相关论文
共 50 条
  • [31] Soft syntactic constraints for Arabic-English hierarchical phrase-based translation
    Marton, Yuval
    Chiang, David
    Resnik, Philip
    [J]. MACHINE TRANSLATION, 2012, 26 (1-2) : 137 - 157
  • [32] Learning local word reorderings for hierarchical phrase-based statistical machine translation
    Zhang, Jingyi
    Utiyama, Masao
    Sumita, Eiichro
    Zhao, Hai
    Neubig, Graham
    Nakamura, Satoshi
    [J]. MACHINE TRANSLATION, 2016, 30 (1-2) : 1 - 18
  • [33] phi-LSTM: A Phrase-Based Hierarchical LSTM Model for Image Captioning
    Tan, Ying Hua
    Chan, Chee Seng
    [J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 101 - 117
  • [34] Deriving phrase-based language models
    Heeman, PA
    Damnati, G
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 41 - 48
  • [35] Improved techniques for phrase-based translation
    Ruiz Costa-Jussa, Marta
    Fonollosa, Jose A. R.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 351 - 356
  • [36] Statistical phrase-based speech translation
    Mathias, Lambert
    Byrne, William
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 561 - 564
  • [37] Case frame constraints for hierarchical phrase-based translation: Japanese-chinese as an example
    School of Computer and Information Technology, Beijing Jiaotong University, China
    不详
    [J]. Commun. Comput. Info. Sci., (123-137):
  • [38] An Empirical Study on Improving Hierarchical Phrase-based Translation Using Alignment Features
    Huang, Songfang
    Zhou, Bowen
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2112 - 2115
  • [39] RECURSIVE NEURAL NETWORK BASED WORD TOPOLOGY MODEL FOR HIERARCHICAL PHRASE-BASED SPEECH TRANSLATION
    Lu, Shixiang
    Wei, Wei
    Fu, Xiaoyin
    Xu, Bo
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [40] Syntactically lexicalized phrase-based SMT
    Hassan, Hany
    Sima'an, Khalil
    Way, Andy
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (07): : 1260 - 1273