Non-Speculative Store Coalescing in Total Store Order

被引:8
|
作者
Ros, Alberto [1 ]
Kaxiras, Stefanos [2 ]
机构
[1] Univ Murcia, Dept Comp Engn, Murcia, Spain
[2] Uppsala Univ, Dept Informat Technol, Uppsala, Sweden
基金
瑞典研究理事会;
关键词
store coalescing; memory consistency; store buffer; deadlock free; lexicographical order;
D O I
10.1109/ISCA.2018.00028
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a non-speculative solution for a coalescing store buffer in total store order (TSO) consistency. Coalescing violates TSO with respect to both conflicting loads and conflicting stores, if partial state is exposed to the memory system. Proposed solutions for coalescing in TSO resort to speculation-and-rollback or centralized arbitration to guarantee atomicity for the set of stores whose order is affected by coalescing. These solutions can suffer from scalability, complexity, resource-conflict deadlock, and livelock problems. A non-speculative solution that writes out coalesced cachelines, one at a time, over a typical directory-based MESI coherence layer, has the potential to transcend these problems if it can guarantee absence of deadlock in a practical way. There are two major problems for a non-speculative coalescing store buffer: i) how to present to the memory system a group of coalesced writes as atomic, and ii) how to not deadlock while attempting to do so. For this, we introduce a new lexicographical order. Relying on this order, conflicting atomic groups of coalesced writes can be individually performed per cache block, without speculation, rollback, or replay, and without deadlock or livelock, yet appear atomic to conflicting parties and preserve TSO. One of our major contributions is to show that lexicographical orders based on a small part of the physical address (sub-address order) are deadlock-free throughout the system when taking into account resource-conflict deadlocks. Our approach exceeds the performance and energy benefits of two baseline TSO store buffers and matches the coalescing (and energy savings) of a release-consistency store buffer, at comparable cost.
引用
收藏
页码:221 / 234
页数:14
相关论文
共 50 条
  • [41] A LOAD-BUFFER SEMANTICS FOR TOTAL STORE ORDERING
    Abdulla, Parosh Aziz
    Atig, Mohamed Faouzi
    Bouajjani, Ahmed
    Tuan Phong Ngo
    LOGICAL METHODS IN COMPUTER SCIENCE, 2018, 14 (01)
  • [42] Adaptive Non-speculative DFE with Extended Time Constraint for PAM-4 Receiver
    Kwon, Do-Hyeon
    Lee, Hyung-Wook
    Ko, Kyeong-Min
    An, Taek-Joon
    Kang, Jin-Ku
    JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2021, 21 (02) : 166 - 173
  • [43] Decomposition Model of the Total Store Purchase and Its Application
    Dong, Songting
    Zhao, Ping
    Wang, Gao
    Liu, Raymond R.
    ADVANCES IN CONSUMER RESEARCH, VOL 35, 2008, 35 : 669 - 670
  • [44] Symbolic Execution Proofs for Higher Order Store Programs
    Reus, Bernhard
    Charlton, Nathaniel
    Horsfall, Ben
    JOURNAL OF AUTOMATED REASONING, 2015, 54 (03) : 199 - 284
  • [45] A typed semantics of higher-order store and subtyping
    Schwinghammer, J
    THEORETICAL COMPUTER SCIENCE, PROCEEDINGS, 2005, 3701 : 390 - 405
  • [46] About hoare logics for higher-order store
    Reus, B
    Streicher, T
    AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2005, 3580 : 1337 - 1348
  • [47] Symbolic Execution Proofs for Higher Order Store Programs
    Bernhard Reus
    Nathaniel Charlton
    Ben Horsfall
    Journal of Automated Reasoning, 2015, 54 : 199 - 284
  • [48] Crowfoot: A Verifier for Higher-Order Store Programs
    Charlton, Nathaniel
    Horsfall, Ben
    Reus, Bernhard
    VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION, 2012, 7148 : 136 - 151
  • [49] Speculative Multi-Level Access in LSM Tree-Based KV Store
    Park, Jongwon
    Jeong, Jinkyu
    IEEE COMPUTER ARCHITECTURE LETTERS, 2022, 21 (02) : 145 - 148
  • [50] Total discount policy and two warehouses strategy to store raw materials with Economic Order Quantity model
    Taleizadeh, Ata Allah
    Mokaram, S.
    Shafii, N.
    Zarei, M.
    Journal of Applied Sciences, 2009, 9 (07) : 1267 - 1275