Testing probabilistic equivalence through Reinforcement Learning

被引:1
|
作者
Desharnais, Josee [1 ]
Laviolette, Francois [1 ]
Zhioua, Sami [2 ]
机构
[1] Univ Laval, Quebec City, PQ G1K 7P4, Canada
[2] King Fahd Univ Petr & Minerals, ICS, Dhahran 31261, Saudi Arabia
关键词
Verification; Stochastic systems; Markov processes; Distance; Divergence; Reinforcement Learning; Testing; Equivalence relations; BISIMULATION; DIFFERENCE;
D O I
10.1016/j.ic.2013.02.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Checking if a given system implementation respects its specification is often done by proving that the two are "equivalent". The equivalence is chosen, in particular, for its computability and of course for its meaning, that is, for its adequacy with what is observable from the two systems (implementation and specification). Trace equivalence is easily testable (decidable from interaction), but often considered too weak; in contrast, bisimulation is accepted as the canonical equivalence for interaction, but it is not testable. Richer than an equivalence is a form of distance: it is zero between equivalent systems, and it provides an estimation of their difference if the systems are not equivalent. Our main contribution is to define such a distance in a context where (1) the two systems to be compared have a stochastic behavior; (2) the model of one of them (e.g., the implementation) is unknown, hence our only knowledge is obtained by interacting with it; (3) consequently the target equivalence (observed when distance is zero) must be testable. To overcome the problem that the model is unknown, we use a Reinforcement Learning approach that provides powerful stochastic algorithms that only need to interact with the model. Our second main contribution is a new family of testable equivalences, called K-moment. The weakest of them, 1-moment equivalence, is trace equivalence; as K grows, K-moment equivalences become finer, all remaining, as well as their limit, weaker than bisimulation. We propose a framework to define (and test) a bigger class of testable equivalences: Test-Observation-Equivalences (TOEs), and we show how they can be made coarser or not, by tuning some parameters. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:21 / 57
页数:37
相关论文
共 50 条
  • [21] Probabilistic inference for determining options in reinforcement learning
    Daniel, Christian
    van Hoof, Herke
    Peters, Jan
    Neumann, Gerhard
    MACHINE LEARNING, 2016, 104 (2-3) : 337 - 357
  • [22] Probabilistic Policy Reuse for Safe Reinforcement Learning
    Garcia, Javier
    Fernandez, Fernando
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2019, 13 (03)
  • [23] Verified Probabilistic Policies for Deep Reinforcement Learning
    Bacci, Edoardo
    Parker, David
    NASA FORMAL METHODS (NFM 2022), 2022, 13260 : 193 - 212
  • [24] Probabilistic Inference in Reinforcement Learning Done Right
    Tarbouriech, Jean
    Lattimore, Tor
    O'Donoghue, Brendan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [25] PROBABILISTIC REINFORCEMENT LEARNING IN SCHIZOPHRENIA: RELATIONSHIP TO AMOTIVATION
    Dowd, Erin Connor
    Barch, Deanna Marie
    SCHIZOPHRENIA BULLETIN, 2011, 37 : 135 - 135
  • [26] Probabilistic inference for determining options in reinforcement learning
    Christian Daniel
    Herke van Hoof
    Jan Peters
    Gerhard Neumann
    Machine Learning, 2016, 104 : 337 - 357
  • [27] Reinforcement learning in a probabilistic learning task without time constraints
    Jablonska, Judyta
    Szumiec, Lukasz
    Parkitna, Jan Rodriguez
    PHARMACOLOGICAL REPORTS, 2019, 71 (06) : 1310 - 1310
  • [28] Efficient state synchronisation in model-based testing through reinforcement learning
    Turker, Uraz Cengiz
    Hierons, Robert M.
    Mousavi, Mohammad Reza
    Tyukin, Ivan Y.
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 368 - 380
  • [29] Testing shift-equivalence of polynomials by deterministic, probabilistic and quantum machines
    Grigoriev, D
    THEORETICAL COMPUTER SCIENCE, 1997, 180 (1-2) : 217 - 228
  • [30] The role of Reinforcement Learning in software testing
    Abo-eleneen, Amr
    Palliyali, Ahammed
    Catal, Cagatay
    INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 164