New Hardware Architectures for Montgomery Modular Multiplication Algorithm

被引:68
|
作者
Huang, Miaoqing [1 ]
Gaj, Kris [2 ]
El-Ghazawi, Tarek [3 ]
机构
[1] 1 Univ Arkansas, Dept Comp Sci & Comp Engn, Fayetteville, AR 72701 USA
[2] George Mason Univ, Dept Elect & Comp Engn, Fairfax, VA 22030 USA
[3] George Washington Univ, Dept Elect & Comp Engn, Washington, DC 20052 USA
关键词
Montgomery multiplication; MWR2MM algorithm; hardware optimization; field-programmable gate arrays;
D O I
10.1109/TC.2010.247
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Montgomery modular multiplication is one of the fundamental operations used in cryptographic algorithms, such as RSA and Elliptic Curve Cryptosystems. At CHES 1999, Tenca and Koc proposed the Multiple-Word Radix-2 Montgomery Multiplication (MWR2MM) algorithm and introduced a now-classic architecture for implementing Montgomery multiplication in hardware. With parameters optimized for minimum latency, this architecture performs a single Montgomery multiplication in approximately 2n clock cycles, where n is the size of operands in bits. In this paper, we propose two new hardware architectures that are able to perform the same operation in approximately n clock cycles with almost the same clock period. These two architectures are based on precomputing partial results using two possible assumptions regarding the most significant bit of the previous word. These two architectures outperform the original architecture of Tenca and Koc, in terms of the product latency times area by 23 and 50 percent, respectively, for several most common operand sizes used in cryptography. The architecture in radix-2 can be extended to the case of radix-4, while preserving a factor of two speedup over the corresponding radix-4 design by Tenca, Todorov, and Koc from CHES 2001. Our optimization has been verified by modeling it using Verilog-HDL, implementing it on Xilinx Virtex-II 6000 FPGA, and experimentally testing it using SRC-6 reconfigurable computer.
引用
收藏
页码:923 / 936
页数:14
相关论文
共 50 条
  • [21] Secure Systolic Architecture for Montgomery Modular Multiplication Algorithm
    Yang, Qi
    Hu, Xiaoting
    Qin, Zhongping
    [J]. 2013 IEEE 10TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2013,
  • [22] Optimisation of Montgomery modular multiplication algorithm for systolic arrays
    Shin, JB
    Kim, J
    Hyung, LK
    [J]. ELECTRONICS LETTERS, 1998, 34 (19) : 1830 - 1831
  • [23] New systolic modular multiplication architecture for efficient Montgomery multiplication
    Choi, Se-Hyu
    Lee, Keon-Jik
    [J]. IEICE ELECTRONICS EXPRESS, 2015, 12 (02):
  • [24] A new Montgomery modular multiplication algorithm and its VLSI design for RSA cryptosystem
    Chen, Jun-Hong
    Wu, Haw-Shiuan
    Shieh, Ming-Der
    Lin, Wen-Ching
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 3780 - 3783
  • [25] Reconfigurable hardware implementation of Montgomery modular multiplication and parallel binary exponentiation
    Nedjah, N
    Mourelle, LD
    [J]. EUROMICRO SYMPOSIUM ON DIGITAL SYSTEM DESIGN, PROCEEDINGS: ARCHITECTURES, METHODS AND TOOLS, 2002, : 226 - 233
  • [26] Two hardware implementations for the Montgomery modular multiplication: Sequential versus parallel
    Nedjah, N
    Mourelle, LD
    [J]. 15TH SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, PROCEEDINGS, 2002, : 3 - 8
  • [27] Architectures for Montgomery's multiplication
    Nibouche, O
    Bouridane, A
    Nibouche, M
    [J]. IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 2003, 150 (06): : 361 - 368
  • [28] Early-Word-Based Montgomery Modular Multiplication Algorithm
    Verma, Rupali
    Dutta, Maitreyee
    Vig, Renu
    [J]. 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) 2015, 2015, : 595 - 600
  • [29] Montgomery modular multiplication algorithm on multi-core systems
    Fan, Junfeng
    Sakiyama, Kazuo
    Verbauwhede, Ingrid
    [J]. 2007 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS, VOLS 1 AND 2, 2007, : 261 - 266
  • [30] A scalable architecture for modular multiplication based on Montgomery's algorithm
    Tenca, AF
    Koç, ÇK
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2003, 52 (09) : 1215 - 1221