A High-Performance Index for Real-Time Matrix Retrieval

被引:0
|
作者
Wen, Zeyi [1 ]
Liang, Mingyu [2 ]
He, Bingsheng [3 ]
Xia, Zexin [4 ]
机构
[1] Univ Western Australia, Crawley, WA 6009, Australia
[2] Cornell Univ, Ithaca, NY 14850 USA
[3] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
[4] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
关键词
Real-time systems; Dictionaries; Streaming media; Search problems; Memory management; Indexing; search; matrices; SEARCH;
D O I
10.1109/TKDE.2020.3025925
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental technique in machine learning called "embedding" has made significant impact on data representation. Some examples of embedding include word embedding, image embedding and audio embedding. With the embedding techniques, many real-world objects can be represented using matrices. For example, a document can be represented by a matrix, where each row of the matrix represents a word. On the other hand, we have witnessed that many applications continuously generate new data represented by matrices and require real-time query answering on the data. These continuously generated matrices need to be well managed for efficient retrieval. In this paper, we propose a high-performance index for real-time matrix retrieval. Besides fast query response, the index also supports real-time insertion by exploiting the log-structured merge-tree (LSM-tree). Since the index is built for matrices, it consumes much more memory and requires much more time to search than the traditional index for information retrieval. To tackle the challenges, we propose an index with precise and fuzzy inverted lists, and design a series of novel techniques to improve the memory consumption and the search efficiency of the index. The proposed techniques include vector signature, vector residual sorting, hashing based lookup, and dictionary initialization to guarantee the index quality. Comprehensive experimental results show that our proposed index can support real-time search on matrices, and is more time and memory efficient than the state-of-the-art method.
引用
收藏
页码:3044 / 3056
页数:13
相关论文
共 50 条
  • [1] HIGH-PERFORMANCE REAL-TIME HETERODYNE INTERFEROMETRY
    MASSIE, NA
    NELSON, RD
    HOLLY, S
    APPLIED OPTICS, 1979, 18 (11) : 1797 - 1803
  • [2] HRHS: A High-Performance Real-Time Hardware Scheduler
    Derafshi, Danesh
    Norollah, Amin
    Khosroanjam, Mohsen
    Beitollahi, Hakem
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (04) : 897 - 908
  • [3] A NEW SERIES OF HIGH-PERFORMANCE REAL-TIME COMPUTERS
    ALLAN, ME
    SCHOENDORF, N
    CHATTERTON, CB
    CROSS, DM
    HEWLETT-PACKARD JOURNAL, 1984, 35 (02): : 3 - 6
  • [4] High-performance scalable computing for real-time applications
    Boggess, T
    Shirley, F
    SIXTH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, PROCEEDINGS, 1997, : 332 - 335
  • [5] A SCHEME FOR HIGH-PERFORMANCE REAL-TIME BER MEASUREMENT
    SCHOLZ, JB
    COOK, SC
    GILES, TC
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1992, 40 (10) : 1574 - 1576
  • [6] High-performance computing in real-time ultrasonic imaging
    Nocetti, DFG
    González, JS
    Casique, MFV
    Ramirez, RO
    Hernández, EM
    ACOUSTICAL IMAGING, VOL 24, 2000, 24 : 113 - 120
  • [7] A high-performance processor for embedded real-time control
    Cumplido, R
    Jones, S
    Goodall, RM
    Bateman, S
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2005, 13 (03) : 485 - 492
  • [8] High-Performance Siamese Network for Real-Time Tracking
    Du, Guocai
    Zhou, Peiyong
    Abudurexiti, Ruxianguli
    Mahpirat
    Aysa, Alimjan
    Ubul, Kurban
    SENSORS, 2022, 22 (22)
  • [9] High-performance computing for real-time spectral estimation
    Madeira, MM
    Bellis, SJ
    Beltran, LAA
    González, JS
    Nocetti, DFG
    Marnane, WP
    Tokhi, MO
    Ruano, MG
    CONTROL ENGINEERING PRACTICE, 1999, 7 (05) : 679 - 686
  • [10] High-performance real-time implementation of a spectral estimator
    Madeira, MM
    Beltran, LAA
    Gonzalez, JS
    Nocetti, FG
    Tokhi, MO
    Ruano, MG
    ALGORITHMS AND ARCHITECTURES FOR REAL-TIME CONTROL 1998 (AARTC'98), 1998, : 185 - 189