Automatically Mining Facets for Queries from Their Search Results

被引:13
|
作者
Dou, Zhicheng [1 ,2 ]
Jiang, Zhengbao [1 ,2 ]
Hu, Sha [1 ,2 ]
Wen, Ji-Rong [1 ,2 ]
Song, Ruihua [3 ]
机构
[1] Renmin Univ China, Beijing Key Lab Big Data Management & Anal Method, Sch Informat, Beijing 100872, Peoples R China
[2] Renmin Univ China, DEKE, Beijing 100872, Peoples R China
[3] Microsoft Res, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Query facet; faceted search; summarization; user intent;
D O I
10.1109/TKDE.2015.2475735
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of finding query facets which are multiple groups of words or phrases that explain and summarize the content covered by a query. We assume that the important aspects of a query are usually presented and repeated in the query's top retrieved documents in the style of lists, and query facets can be mined out by aggregating these significant lists. We propose a systematic solution, which we refer to as QDMiner, to automatically mine query facets by extracting and grouping frequent lists from free text, HTML tags, and repeat regions within top search results. Experimental results show that a large number of lists do exist and useful query facets can be mined by QDMiner. We further analyze the problem of list duplication, and find better query facets can be mined by modeling fine-grained similarities between lists and penalizing the duplicated lists.
引用
收藏
页码:385 / 397
页数:13
相关论文
共 50 条
  • [1] Automatically Generating Queries for Prior Art Search
    Graf, Erik
    Azzopardi, Leif
    van Rijsbergen, Keith
    MULTILINGUAL INFORMATION ACCESS EVALUATION I: TEXT RETRIEVAL EXPERIMENTS, 2010, 6241 : 480 - 490
  • [2] Extracting Query Facets from Search Results
    Kong, Weize
    Allan, James
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 93 - 102
  • [3] Automatically Generating Structured Queries in XML Keyword Search
    Hummel, Felipe da C.
    da Silva, Altigran S.
    Moro, Mirella M.
    Laender, Alberto H. F.
    COMPARATIVE EVALUATION OF FOCUSED RETRIEVAL, 2011, 6932 : 194 - +
  • [4] Fusing Search Results from Possible Alternative Queries
    Bah, Ashraf
    Carterette, Ben
    2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 606 - 609
  • [5] Automatically mining result records from search engine response pages
    Mundluru, D
    Katukuri, JR
    Celebi, S
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 749 - 752
  • [6] Towards the disintermediation of creative music search: analysing queries to determine important facets
    Inskip, Charles
    Macfarlane, Andy
    Rafferty, Pauline
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2012, 12 (2-3) : 137 - 147
  • [7] Improving Search Results with Prior Similar Queries
    Moshfeghi, Yashar
    Velinov, Kristiyan
    Triantafillou, Peter
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1985 - 1988
  • [8] Data Mining From Web Search Queries: A Comparison of Google Trends and Baidu Index
    Vaughan, Liwen
    Chen, Yue
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2015, 66 (01) : 13 - 22
  • [9] Automatically transforming full length biomedical articles into search queries for retrieving related articles
    Bashir, Shariq
    Khattak, Akmal Saeed
    Alshara, Mohammed Ali
    EGYPTIAN INFORMATICS JOURNAL, 2021, 22 (01) : 75 - 84
  • [10] Mining related queries from web search engine query logs using an improved association rule mining model
    Shi, Xiaodong
    Yang, Christopher C.
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (12): : 1871 - 1883