Automatically Mining Facets for Queries from Their Search Results

被引:13
|
作者
Dou, Zhicheng [1 ,2 ]
Jiang, Zhengbao [1 ,2 ]
Hu, Sha [1 ,2 ]
Wen, Ji-Rong [1 ,2 ]
Song, Ruihua [3 ]
机构
[1] Renmin Univ China, Beijing Key Lab Big Data Management & Anal Method, Sch Informat, Beijing 100872, Peoples R China
[2] Renmin Univ China, DEKE, Beijing 100872, Peoples R China
[3] Microsoft Res, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Query facet; faceted search; summarization; user intent;
D O I
10.1109/TKDE.2015.2475735
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of finding query facets which are multiple groups of words or phrases that explain and summarize the content covered by a query. We assume that the important aspects of a query are usually presented and repeated in the query's top retrieved documents in the style of lists, and query facets can be mined out by aggregating these significant lists. We propose a systematic solution, which we refer to as QDMiner, to automatically mine query facets by extracting and grouping frequent lists from free text, HTML tags, and repeat regions within top search results. Experimental results show that a large number of lists do exist and useful query facets can be mined by QDMiner. We further analyze the problem of list duplication, and find better query facets can be mined by modeling fine-grained similarities between lists and penalizing the duplicated lists.
引用
收藏
页码:385 / 397
页数:13
相关论文
共 50 条
  • [31] Chapter 13: Liquid Queries and Liquid Results in Search Computing
    Bozzon, Alessandro
    Brambilla, Marco
    Ceri, Stefano
    Fraternali, Piero
    Manolescu, Ioana
    SEARCH COMPUTING: CHALLENGES AND DIRECTIONS, 2010, 5950 : 244 - +
  • [32] Mining Historic Query Trails to Label Long and Rare Search Engine Queries
    Bailey, Peter
    White, Ryen W.
    Liu, Han
    Kumaran, Giridhar
    ACM TRANSACTIONS ON THE WEB, 2010, 4 (04)
  • [33] Framework for Web Content Mining Using Semantic Search and Natural Language Queries
    Shaikh, A. J.
    Kolhe, V. L.
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2013, : 280 - 284
  • [34] Improving search results with data mining in a thematic search engine
    Caramia, M
    Felici, G
    Pezzoli, A
    COMPUTERS & OPERATIONS RESEARCH, 2004, 31 (14) : 2387 - 2404
  • [35] You Don't Know Search: Helping Users Find Code by Automatically Evaluating Alternative Queries
    van Tonder, Rijnard
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE, ICSE-SEIP, 2023, : 270 - 280
  • [36] Mining Favorable Facets
    Wong, Raymond Chi-Wing
    Pei, Jian
    Fu, Ada Wai-Chee
    Wang, Ke
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 804 - +
  • [37] Algorithms of mining data records from website automatically
    Qiu, Yong
    Lan, Yongjie
    Journal of Southeast University (English Edition), 2006, 22 (03) : 423 - 425
  • [38] Automatically acquired domain knowledge for ad hoc search: Evaluation results
    Kruschwitz, U
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 525 - 532
  • [39] The effectiveness of automatically structured queries in digital libraries
    Gonçalves, MA
    Fox, EA
    Krowne, A
    Calado, P
    Laender, AHF
    da Silva, AS
    Ribeiro-Neto, B
    JCDL 2004: PROCEEDINGS OF THE FOURTH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES: GLOBAL REACH AND DIVERSE IMPACT, 2004, : 98 - 107
  • [40] Annotating images by mining image search results
    Wang, Xin-Jing
    Zhang, Lei
    Li, Xirong
    Ma, Wei-Ying
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (11) : 1919 - 1932