A Hive-Based Retrieval Optimization Scheme for Long-Term Storage of Massive Call Detail Records

被引:1
|
作者
Peng, Xi [1 ,2 ]
Liu, Liang [2 ]
Zhang, Lei [2 ]
机构
[1] China Acad Telecommun Technol, Grad Fac, Beijing 100191, Peoples R China
[2] Sichuan Univ, Coll Cybersecur, Chengdu 610065, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Bucketing; call detail records; hash storage; long-term storage; MOBILITY;
D O I
10.1109/ACCESS.2019.2961692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the dramatic rise of mobile internet users and the administrative requirements of long-term data retention, telecom providers are facing increasingly challenging storage and retrieval issues of call detail records (CDRs). The existing storage system can only achieve the requirement of online query and offline analysis of the CDRs. However, to the best of our knowledge, few studies have focused on the topic of CDRs retrieval optimization with long-term storage. In order to improve the retrieval speed while ensuring a high compression ratio, in this paper we propose a novel hash storage scheme, termed dual-column bucketing (DCB), based on the Hive platform by making use of its Bucketing nature. Compared to the conventional scheme, the proposed DCB scheme can improve the performance both for CDRs compression and query. Second, similar storage scenarios such as storage of SMS, email and extended detail records (XDRs) are included in the optimization scope of the DCB. Experiments on real-world CDRs show that in contrast to the conventional scheme, the proposed DCB scheme can save the storage space by approximately 400025;, reduces the amount of disk read to 20025;, and improve the retrieval speed of known phone number queries by up to seven times.
引用
收藏
页码:431 / 444
页数:14
相关论文
共 50 条
  • [1] ENCODING AND RETRIEVAL FROM LONG-TERM STORAGE
    WOOD, G
    PENNINGTON, J
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1973, 99 (02): : 243 - 254
  • [2] STORAGE AND RETRIEVAL PROCESSES IN LONG-TERM MEMORY
    SHIFFRIN, RM
    ATKINSON, RC
    [J]. PSYCHOLOGICAL REVIEW, 1969, 76 (02) : 179 - &
  • [3] RETRIEVAL PROCESSES FOR ORGANIZED LONG-TERM STORAGE
    SEAMON, JG
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1973, 97 (02): : 170 - 176
  • [4] MoPS: A Modular Protection Scheme for Long-Term Storage
    Weinert, Christian
    Demirel, Denise
    Vigil, Martin
    Geihs, Matthias
    Buchmann, Johannes
    [J]. PROCEEDINGS OF THE 2017 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIA CCS'17), 2017, : 436 - 448
  • [5] MICROFICHE AS A MEDIUM FOR THE LONG-TERM STORAGE OF LABORATORY COMPUTER RECORDS
    MCVITTIE, JD
    WHITEHOUSE, C
    WILKINSON, RH
    [J]. JOURNAL OF CLINICAL PATHOLOGY, 1981, 34 (01) : 49 - 53
  • [6] Pictorial detail provide conceptual hooks allowing for massive pictorial long-term memory
    Evans, K. K.
    Baddeley, A.
    [J]. PERCEPTION, 2014, 43 (01) : 101 - 101
  • [7] Archive storage system design for long-term storage of massive amounts of data
    Bradshaw, P. L.
    Brannon, K. W.
    Clark, T.
    Dahman, K.
    Doraiswarny, S.
    Duyanovich, L.
    Hillsberg, B. L.
    Hineman, W.
    Kaczmarski, M.
    Klingenberg, B. J.
    Ma, X.
    Rees, R.
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2008, 52 (4-5) : 379 - 388
  • [8] Long-term Optimization for the Extension Scheme of Island Microgrids
    Chen, Yumin
    Dong, Chaoyu
    Koh, Leong Hai
    Wang, Peng
    Xu, Yan
    Xia, Yang
    Jatin, Verma
    [J]. PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 2842 - 2847
  • [9] Optimization Scheme of Massive Meteorological Data Storage Based on OpenStack Swift
    Xue, Shuangqing
    Wen, Chengyu
    Zhang, Xiaoli
    Wang, Zhuo
    [J]. 2020 12TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2020), 2020, : 302 - 306
  • [10] EVALUATION OF PARENTS BASED ON LONG-TERM SELECTION RECORDS
    TAI, GCC
    JUI, PY
    YOUNG, DA
    [J]. ZEITSCHRIFT FUR PFLANZENZUCHTUNG-JOURNAL OF PLANT BREEDING, 1986, 96 (01): : 39 - 46