A High-Performance Index for Real-Time Matrix Retrieval

被引:0
|
作者
Wen, Zeyi [1 ]
Liang, Mingyu [2 ]
He, Bingsheng [3 ]
Xia, Zexin [4 ]
机构
[1] Univ Western Australia, Crawley, WA 6009, Australia
[2] Cornell Univ, Ithaca, NY 14850 USA
[3] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
[4] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
关键词
Real-time systems; Dictionaries; Streaming media; Search problems; Memory management; Indexing; search; matrices; SEARCH;
D O I
10.1109/TKDE.2020.3025925
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental technique in machine learning called "embedding" has made significant impact on data representation. Some examples of embedding include word embedding, image embedding and audio embedding. With the embedding techniques, many real-world objects can be represented using matrices. For example, a document can be represented by a matrix, where each row of the matrix represents a word. On the other hand, we have witnessed that many applications continuously generate new data represented by matrices and require real-time query answering on the data. These continuously generated matrices need to be well managed for efficient retrieval. In this paper, we propose a high-performance index for real-time matrix retrieval. Besides fast query response, the index also supports real-time insertion by exploiting the log-structured merge-tree (LSM-tree). Since the index is built for matrices, it consumes much more memory and requires much more time to search than the traditional index for information retrieval. To tackle the challenges, we propose an index with precise and fuzzy inverted lists, and design a series of novel techniques to improve the memory consumption and the search efficiency of the index. The proposed techniques include vector signature, vector residual sorting, hashing based lookup, and dictionary initialization to guarantee the index quality. Comprehensive experimental results show that our proposed index can support real-time search on matrices, and is more time and memory efficient than the state-of-the-art method.
引用
收藏
页码:3044 / 3056
页数:13
相关论文
共 50 条
  • [31] Real-Time Reconstruction System Using a High-Performance Computer for Electroholography
    Ichihashi, Yasuyuki
    Nakayama, Hirotaka
    Awazu, Shin
    Shimobaba, Tomoyoshi
    Masuda, Nobuyuki
    Ito, Tomoyoshi
    COMPUTER AND INFORMATION SCIENCE 2010, 2010, 317 : 141 - 152
  • [32] A high-performance and lightweight framework for real-time facial expression recognition
    Xu, Xuebin
    Liu, Chenguang
    Cao, Shuxin
    Lu, Longbin
    IET IMAGE PROCESSING, 2023, 17 (12) : 3500 - 3509
  • [33] Stream communication between real-time tasks in a high-performance multiprocessor
    Leijten, JAJ
    van Meerbergen, JL
    Timmer, AH
    Jess, JAG
    DESIGN, AUTOMATION AND TEST IN EUROPE, PROCEEDINGS, 1998, : 125 - 131
  • [34] DESIGN of a spaceborne high-performance and real-time image processing platform
    Pan Zheng
    Feng Xingtai
    Peng Chengxiang
    INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING, ICOPEN 2022, 2022, 12550
  • [35] HYPER - AN INTERACTIVE SYNTHESIS ENVIRONMENT FOR HIGH-PERFORMANCE REAL-TIME APPLICATIONS
    CHU, CM
    POTKONJAK, M
    THALER, M
    RABAEY, J
    PROCEEDINGS - IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN : VLSI IN COMPUTERS & PROCESSORS, 1989, : 432 - 435
  • [36] MPI/RT - An emerging standard for high-performance real-time systems
    Kanevsky, A
    Skjellum, A
    Rounbehler, A
    PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL III: EMERGING TECHNOLOGIES TRACK, 1998, : 157 - 166
  • [37] A co-design methodology for high-performance real-time systems
    Badawy, W
    Kumar, A
    Bayoumi, M
    CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING-REVUE CANADIENNE DE GENIE ELECTRIQUE ET INFORMATIQUE, 2001, 26 (3-4): : 141 - 146
  • [38] Exploiting high-performance DSP hardware for real-time CELP implementation
    Teo, TT
    Tan, EC
    Premkumar, AB
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 421 - 424
  • [39] Architectures and patterns for developing high-performance, real-time ORE endsystems
    Schmidt, DC
    Levine, DL
    Cleeland, C
    ADVANCES IN COMPUTERS, VOL 48, 1999, 48 : 1 - 118
  • [40] A High-Performance FPGA Platform for Adaptive Optics Real-Time Control
    Zhang, Heng
    Ljusic, Zoran
    Hovey, Gary
    Veran, Jean-Pierre
    Herriot, Glen
    Dumas, Maxime
    ADAPTIVE OPTICS SYSTEMS III, 2012, 8447