Indexing for linear model-based information retrieval

被引:0
|
作者
Chang, YC [1 ]
Li, CS [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes the Onion technique, a special indexing structure for linear optimization queries. Linear optimization queries ask for top-N records subject to the maximization or minimisation of linearly weighted sum of record attribute values. Such query appears In many applications employing linear models and Is an effective way to summarise representative cases, such as the top-SO ranked colleges. The Onion indexing is based on a geometric property of convex hull, which guarantees that the optimal value can always be found at one or more of its vertices. The Onion indexing makes use of this property to construct convex hulls In layers with outer layers enclosing inner layers geometrically. A data record Is Indexed by its layer number or equivalently Its depth in the layered convex hull. Queries with linear weightings issued at run time are evaluated from the outmost layer inwards. We show experimentally that the Onion indexing achieves orders of magnitude speedup against sequential linear scan when N is small compared to the cardinality of the set. The Onion technique also enables progressive retrieval, which processes and returns ranked results in a progressive manner. Furthermore, the proposed indexing can be extended into a hierarchical organisation of data to accommodate both global and local queries.
引用
收藏
页码:359 / 362
页数:4
相关论文
共 50 条
  • [1] Click Model-Based Information Retrieval Metrics
    Chuklin, Aleksandr
    Serdyukov, Pavel
    de Rijke, Maarten
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 493 - 502
  • [2] Information Filtering and Query Indexing for an Information Retrieval Model
    Tryfonopoulos, Christos
    Koubarakis, Manolis
    Drougas, Yannis
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2009, 27 (02)
  • [3] An information retrieval terminology for model-based risk assessment
    Thunem, APJ
    Fredriksen, R
    Gran, BA
    PROBABILISTIC SAFETY ASSESSMENT AND MANAGEMENT, VOL 1- 6, 2004, : 418 - 423
  • [4] A model-based approach to information retrieval systems development
    Ferreira, Joao
    Silva, Alberto
    Delgado, Jose
    Proceedings of the 10th IASTED International Conference on Software Engineering and Applications, 2006, : 459 - 464
  • [5] An Information Retrieval Model Based on Latent Semantic Indexing with Intelligent Preprocessing
    Kumar, Ch.
    Gupta, Ankush
    Batool, Mahmooda
    Trehan, Shagun
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2005, 4 (04) : 279 - 285
  • [6] A subspace-based model for latent semantic indexing in information retrieval
    Zha, HY
    Simon, H
    DIMENSION REDUCTION, COMPUTATIONAL COMPLEXITY AND INFORMATION, 1998, 30 : 315 - 320
  • [7] A model-based approach to semantic-based retrieval of visual information
    Golshani, F
    Park, Y
    Panchanathan, S
    SOFSEM 2002: THEORY AND PRACTICE OF INFORMATICS, 2002, 2540 : 149 - 167
  • [8] Model-based classification of visual information for content-based retrieval
    Jaimes, A
    Chang, SF
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES VII, 1998, 3656 : 402 - 414
  • [9] An indexing matrix based retrieval model
    Zhao, Xu
    Jiang, Zongli
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 1001 - +
  • [10] Generative Model-Based MetaSearch for Data Fusion in Information Retrieval
    Efron, Miles
    JCDL 09: PROCEEDINGS OF THE 2009 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, 2009, : 153 - 162