A High-Performance Index for Real-Time Matrix Retrieval

被引:0
|
作者
Wen, Zeyi [1 ]
Liang, Mingyu [2 ]
He, Bingsheng [3 ]
Xia, Zexin [4 ]
机构
[1] Univ Western Australia, Crawley, WA 6009, Australia
[2] Cornell Univ, Ithaca, NY 14850 USA
[3] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
[4] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
关键词
Real-time systems; Dictionaries; Streaming media; Search problems; Memory management; Indexing; search; matrices; SEARCH;
D O I
10.1109/TKDE.2020.3025925
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental technique in machine learning called "embedding" has made significant impact on data representation. Some examples of embedding include word embedding, image embedding and audio embedding. With the embedding techniques, many real-world objects can be represented using matrices. For example, a document can be represented by a matrix, where each row of the matrix represents a word. On the other hand, we have witnessed that many applications continuously generate new data represented by matrices and require real-time query answering on the data. These continuously generated matrices need to be well managed for efficient retrieval. In this paper, we propose a high-performance index for real-time matrix retrieval. Besides fast query response, the index also supports real-time insertion by exploiting the log-structured merge-tree (LSM-tree). Since the index is built for matrices, it consumes much more memory and requires much more time to search than the traditional index for information retrieval. To tackle the challenges, we propose an index with precise and fuzzy inverted lists, and design a series of novel techniques to improve the memory consumption and the search efficiency of the index. The proposed techniques include vector signature, vector residual sorting, hashing based lookup, and dictionary initialization to guarantee the index quality. Comprehensive experimental results show that our proposed index can support real-time search on matrices, and is more time and memory efficient than the state-of-the-art method.
引用
收藏
页码:3044 / 3056
页数:13
相关论文
共 50 条
  • [41] A HIGH-PERFORMANCE SYSTEM FOR REAL-TIME VIDEO IMAGE COMPRESSION APPLICATIONS
    WU, CM
    PERNG, DJ
    CHENG, WT
    HO, JS
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1995, 41 (01) : 125 - 131
  • [42] HIGH-PERFORMANCE SBCS DIVERSIFY ALONG UNIX, REAL-TIME LINES
    ANDREWS, W
    COMPUTER DESIGN, 1989, 28 (18): : 17 - 17
  • [43] Policies and patterns for high-performance, real-time object request brokers
    Schmidt, D
    DOA'01: 3RD INTERNATIONAL SYMPOSIUM ON DISTRIBUTED OBJECTS & APPLICATIONS, PROCEEDINGS, 2001, : 357 - 358
  • [44] DESIGN of a spaceborne high-performance and real-time image processing platform
    Pan Zheng
    Feng Xingtai
    Peng Chengxiang
    AOPC 2022: OPTICAL SENSING, IMAGING, AND DISPLAY TECHNOLOGY, 2022, 12557
  • [45] A co-design methodology for high-performance real-time systems
    Badawy, Wael
    Kumar, Ashok
    Bayoumi, Magdy
    Canadian Journal of Electrical and Computer Engineering, 2001, 26 (3-4) : 141 - 146
  • [46] High-Performance and Lightweight Real-Time Deep Face Emotion Recognition
    Schwan, Justus
    Ghaleb, Esam
    Hortal, Enrique
    Asteriadis, Stylianos
    2017 12TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP 2017), 2017, : 76 - 79
  • [47] HIGH-PERFORMANCE I/O PROCESSORS FOR REAL-TIME PULSE HANDLING
    TACHIBANA, M
    KONDO, Y
    YAMADA, Y
    TAKAHASHI, M
    TAGO, H
    PROCEEDINGS - IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN : VLSI IN COMPUTERS & PROCESSORS, 1989, : 415 - 418
  • [48] A protocol and correctness proofs for real-time high-performance broadcast networks
    Hermant, JF
    Le Lann, G
    18TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 1998, : 360 - 369
  • [49] Policies and patterns for high-performance, real-time object request brokers
    Schmidt, D
    LCN 2001: 26TH ANNUAL IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, 2001, : 2 - 2
  • [50] High-confidence control: Ensuring reliability in high-performance real-time systems
    Samad, T
    2002 FIRST INTERNATIONAL IEEE SYMPOSIUM INTELLIGENT SYSTEMS, VOL 1, PROCEEDINGS, 2002, : 4 - 9