Indexing for linear model-based information retrieval

被引:0
|
作者
Chang, YC [1 ]
Li, CS [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes the Onion technique, a special indexing structure for linear optimization queries. Linear optimization queries ask for top-N records subject to the maximization or minimisation of linearly weighted sum of record attribute values. Such query appears In many applications employing linear models and Is an effective way to summarise representative cases, such as the top-SO ranked colleges. The Onion indexing is based on a geometric property of convex hull, which guarantees that the optimal value can always be found at one or more of its vertices. The Onion indexing makes use of this property to construct convex hulls In layers with outer layers enclosing inner layers geometrically. A data record Is Indexed by its layer number or equivalently Its depth in the layered convex hull. Queries with linear weightings issued at run time are evaluated from the outmost layer inwards. We show experimentally that the Onion indexing achieves orders of magnitude speedup against sequential linear scan when N is small compared to the cardinality of the set. The Onion technique also enables progressive retrieval, which processes and returns ranked results in a progressive manner. Furthermore, the proposed indexing can be extended into a hierarchical organisation of data to accommodate both global and local queries.
引用
收藏
页码:359 / 362
页数:4
相关论文
共 50 条
  • [41] The content of model-based information
    Raphael van Riel
    Synthese, 2015, 192 : 3839 - 3858
  • [42] Design of New Indexing Techniques Based on Ontology for Information Retrieval Systems
    Saruladha, K.
    Aghila, G.
    Penchala, Sathish Kumar
    INFORMATION AND COMMUNICATION TECHNOLOGIES, 2010, 101 : 287 - +
  • [43] DL-VSM based document indexing approach for information retrieval
    Boukhari, Kabil
    Omri, Mohamed Nazih
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 14 (5) : 5383 - 5394
  • [44] DL-VSM based document indexing approach for information retrieval
    Kabil Boukhari
    Mohamed Nazih Omri
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 5383 - 5394
  • [45] Using WordNet for Concept-Based Document Indexing in Information Retrieval
    Boubekeur, Fatiha
    Boughanem, Mohand
    Tamine, Lynda
    Daoud, Mariam
    SEMAPRO 2010: THE FOURTH INTERNATIONAL CONFERENCE ON ADVANCES IN SEMANTIC PROCESSING, 2010, : 151 - 157
  • [46] Language model based temporal information indexing
    Bassara, Andrzej
    BUSINESS INFORMATION SYSTEMS, 2008, 7 : 24 - 35
  • [47] Language model-based retrieval for Farsi documents
    Taghva, K
    Coombs, J
    Pareda, R
    Nartker, T
    ITCC 2004: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 2, PROCEEDINGS, 2004, : 13 - 17
  • [48] Vector model based indexing and retrieval of handwritten medical forms
    Cao, Huaigu
    Govindaraju, Venu
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 88 - 92
  • [49] Image indexing and similarity retrieval based on spatial relation model
    Wang, YH
    PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 980 - 983