Temporal-Textual Retrieval: Time and Keyword Search in Web Documents

被引:0
|
作者
Khodaei, Ali [1 ]
Shahabi, Cyrus [1 ,3 ,4 ]
Khodaei, Amir [2 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
[2] Univ Calif Berkeley, Elect Engn & Comp Sci Dept, Berkeley, CA 94720 USA
[3] Univ Southern Calif, Comp Sci & Elect Engn, Los Angeles, CA USA
[4] Univ Southern Calif, NSFs Integrated Media Syst Ctr IMSC, Los Angeles, CA USA
关键词
Web Search; Time-aware ranking; Indexing; Temporal information retrieval;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As the web ages, many web documents become relevant only to certain time periods, such as web-pages containing news and events or those documenting natural phenomena. Hence, to retrieve the most relevant pages, in addition to providing the relevant keywords, one may desire to identify the relevant time period(s) as well, e.g., "Barack Obama 1980-1985". Unfortunately, not much work has been done by industry or academia to support this type of searches. To the best of our knowledge, the only way that some search engines exploit the time information in the user query is to filter out those resulting web pages whose publication/modification time are not within the queried time interval. In this paper, we propose a new indexing and ranking framework for temporal-textual retrieval. The framework leverages the classical vector space model and provides a complete scheme for indexing, query processing and ranking of the temporal-textual queries. We propose a variety of approaches to exploit popular keyword and temporal index structures. We present a novel hybrid index structure which indexes both the temporal and the textual aspects of the documents in a unified, integrated manner. We also study how to rank documents by seamlessly combining their temporal and textual features. We develop a new scoring schema called temporal tf-idf to compute the temporal relevance of a document to a query, and we combine this score with the textual relevance to compute the overall relevance score of the document to the query. We present both a cost model analysis and an extensive set of experiments over real-world datasets (New York Times Annotated Corpus and Freebase) to evaluate the proposed framework and demonstrate its efficiency and effectiveness.
引用
收藏
页码:288 / +
页数:25
相关论文
共 50 条
  • [41] Automatic refinement of keyword annotations for web image search
    Wang, Bin
    Li, Zhiwei
    Li, Mingjing
    ADVANCES IN MULTIMEDIA MODELING, PT 1, 2007, 4351 : 259 - 268
  • [42] A keyword based prototype for web search result diversification
    Lin, G.-L., 1600, Institute of Information Science (28):
  • [43] Domain-specific web search with keyword spices
    Oyama, S
    Kokubo, T
    Ishida, T
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (01) : 17 - 27
  • [44] A Keyword Based Prototype for Web Search Result Diversification
    Lin, Gu-Li
    Peng, Hong
    Ma, Qian-Li
    Wei, Jia
    Qin, Jiang-Wei
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2012, 28 (03) : 601 - 615
  • [45] Diversified keyword search based web service composition
    Cheng, Huanyu
    Zhong, Ming
    Wang, Jian
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 163
  • [46] Exploiting Temporal Information in Retrieval of Archived Documents
    Kanhabua, Nattiya
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 849 - 849
  • [47] Analyzing Temporal Keyword Queries for Interactive Search over Temporal Databases
    Gao, Qiao
    Lee, Mong Li
    Ling, Tok Wang
    Dobbie, Gillian
    Zeng, Zhong
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2018, PT I, 2018, 11029 : 355 - 371
  • [48] Supporting Keyword Search for Image Retrieval with Integration of Probabilistic Annotation
    Zhou, Tie Hua
    Wang, Ling
    Ryu, Keun Ho
    SUSTAINABILITY, 2015, 7 (05) : 6303 - 6320
  • [49] Audio Retrieval Based on Chinese Keyword Search in Relational Databases
    Zhu, Boyan
    Liu, Guang
    Zhu, Liang
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 634 - 637
  • [50] Integration of keyword and feature based search for image retrieval applications
    Vadivel, A
    Sural, SK
    Majumdar, AK
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 570 - 575