Constrained Channel Capacity for DNA-Based Data Storage Systems

被引:2
|
作者
Fan, Kaixin [1 ]
Wu, Huaming [1 ]
Yan, Zihui [1 ]
机构
[1] Tianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
基金
国家重点研发计划;
关键词
DNA-based storage systems; constrained channels; channel capacity; CODES;
D O I
10.1109/LCOMM.2022.3212200
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Deoxyribonucleic acid (DNA)-based data storage has grown rapidly due to its advantages with the increase in infrequently large amounts of data. However, when the maximum homopolymer runlength (RLL) of the DNA strand is large and the GC-content is either too high or too low, the DNA synthesis and sequencing processes are prone to substitution, deletion and insertion errors. To reduce errors in DNA synthesis and sequencing, we require that the DNA storage channel satisfies both k-RLL and strong-(l,d)-locally-GC-balanced constraints, where the former refers to the maximum homopolymer runlength in each sequence is at most k, and the latter refers to the number of G and C of every length-(l' >= l) subsequence is bounded between [ (2)/(l') - delta,(2)/(l') + delta]. This constrained channel allows DNA data storage system to be less prone to errors during synthesis and sequencing and improves the success rate of Polymerase Chain Reaction (PCR) amplification. We propose a method to calculate the channel capacity. In particular, we provide a relationship between the 4-ary constrained channel capacity and the 2-ary constrained channel capacity, which makes it simpler to calculate the 4-ary constrained channel capacity.
引用
收藏
页码:70 / 74
页数:5
相关论文
共 50 条
  • [31] FrameD: framework for DNA-based data storage design, verification, and validation
    Volkel, Kevin D.
    Lin, Kevin N.
    Hook, Paul W.
    Timp, Winston
    Keung, Albert J.
    Tuck, James M.
    BIOINFORMATICS, 2023, 39 (10)
  • [32] Promiscuous molecules for smarter file operations in DNA-based data storage
    Kyle J. Tomek
    Kevin Volkel
    Elaine W. Indermaur
    James M. Tuck
    Albert J. Keung
    Nature Communications, 12
  • [33] Efficient DNA-based data storage using shortmer combinatorial encoding
    Preuss I.
    Rosenberg M.
    Yakhini Z.
    Anavy L.
    Scientific Reports, 14 (1)
  • [34] Evolutionary approach to construct robust codes for DNA-based data storage
    Rasool, Abdur
    Jiang, Qingshan
    Wang, Yang
    Huang, Xiaoluo
    Qu, Qiang
    Dai, Junbiao
    FRONTIERS IN GENETICS, 2023, 14
  • [35] Plenty of Room at at Bottom: Ten Years of DNA-Based Data Storage
    Kiah, Han Mao
    Siegel, Paul H.
    Yaakobi, Eitan
    IEEE TRANSACTIONS ON MOLECULAR BIOLOGICAL AND MULTI-SCALE COMMUNICATIONS, 2024, 10 (02): : 249 - 252
  • [36] DNA-Based Storage: Trends and Methods
    Yazdi, S. M. Hossein Tabatabaei
    Kiah, Han Mao
    Garcia-Ruiz, Eva
    Ma, Jian
    Zhao, Huimin
    Milenkovic, Olgica
    IEEE Transactions on Molecular, Biological, and Multi-Scale Communications, 2015, 1 (03): : 230 - 248
  • [37] Efficient DNA-based data storage using shortmer combinatorial encoding
    Preuss, Inbal
    Rosenberg, Michael
    Yakhini, Zohar
    Anavy, Leon
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [38] Promiscuous molecules for smarter file operations in DNA-based data storage
    Tomek, Kyle J.
    Volkel, Kevin
    Indermaur, Elaine W.
    Tuck, James M.
    Keung, Albert J.
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [39] On Single-Error-Detecting Codes for DNA-Based Data Storage
    Weber, Jos H.
    de Groot, Joost A. M.
    van Leeuwen, Charlot J.
    IEEE COMMUNICATIONS LETTERS, 2021, 25 (01) : 41 - 44
  • [40] A DNA-Based Archival Storage System
    Bornhol, James
    Lopez, Randolph
    Carmean, Douglas M.
    Ceze, Luis
    Seelig, Georg
    Strauss, Karin
    ACM SIGPLAN NOTICES, 2016, 51 (04) : 637 - 649