Indexing for linear model-based information retrieval

被引:0
|
作者
Chang, YC [1 ]
Li, CS [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes the Onion technique, a special indexing structure for linear optimization queries. Linear optimization queries ask for top-N records subject to the maximization or minimisation of linearly weighted sum of record attribute values. Such query appears In many applications employing linear models and Is an effective way to summarise representative cases, such as the top-SO ranked colleges. The Onion indexing is based on a geometric property of convex hull, which guarantees that the optimal value can always be found at one or more of its vertices. The Onion indexing makes use of this property to construct convex hulls In layers with outer layers enclosing inner layers geometrically. A data record Is Indexed by its layer number or equivalently Its depth in the layered convex hull. Queries with linear weightings issued at run time are evaluated from the outmost layer inwards. We show experimentally that the Onion indexing achieves orders of magnitude speedup against sequential linear scan when N is small compared to the cardinality of the set. The Onion technique also enables progressive retrieval, which processes and returns ranked results in a progressive manner. Furthermore, the proposed indexing can be extended into a hierarchical organisation of data to accommodate both global and local queries.
引用
收藏
页码:359 / 362
页数:4
相关论文
共 50 条
  • [31] ON RELEVANCE, PROBABILISTIC INDEXING AND INFORMATION RETRIEVAL
    MARON, ME
    KUHNS, JL
    JOURNAL OF THE ACM, 1960, 7 (03) : 216 - 244
  • [32] Term indexing in information retrieval systems
    Dvorsky, J
    Krátky, M
    Skopal, T
    Snásel, V
    CIC'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN COMPUTING, 2003, : 263 - 270
  • [33] A contribution to indexing in legal information retrieval
    Legrand, J
    EIGHTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 1997, : 213 - 218
  • [34] Vector Space Model for Arabic Information Retrieval - Application to "Hadith" Indexing
    Harrag, Fouzi
    Hamdi-Cherif, Aboubekeur
    El-Qawasmeh, Eyas
    2008 FIRST INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES, VOLS 1 AND 2, 2008, : 114 - +
  • [35] Comparison Probabilistic Latent Semantic Indexing Model In Chinese Information Retrieval
    Xie Fang
    Liu Xiaoguang
    Hu Quan
    2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 559 - +
  • [36] A neural network model for information retrieval using latent semantic indexing
    Syu, I
    Lang, SD
    Deo, N
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 1318 - 1323
  • [37] ON THE USE OF THE DEMPSTER-SHAFER MODEL IN INFORMATION INDEXING AND RETRIEVAL APPLICATIONS
    SCHOCKEN, S
    HUMMEL, RA
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1993, 39 (05): : 843 - 879
  • [38] Model-based linear clustering
    Yan, Guohua
    Welch, William J.
    Zamar, Ruben H.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (04): : 716 - 737
  • [39] The content of model-based information
    van Riel, Raphael
    SYNTHESE, 2015, 192 (12) : 3839 - 3858
  • [40] MODEL-BASED INFORMATION ACCESS
    JAGANATHAN, V
    KARINTHI, R
    ALMASI, G
    INTERNATIONAL JOURNAL OF INTELLIGENT & COOPERATIVE INFORMATION SYSTEMS, 1994, 3 (02): : 107 - 127