Testing probabilistic equivalence through Reinforcement Learning

被引:1
|
作者
Desharnais, Josee [1 ]
Laviolette, Francois [1 ]
Zhioua, Sami [2 ]
机构
[1] Univ Laval, Quebec City, PQ G1K 7P4, Canada
[2] King Fahd Univ Petr & Minerals, ICS, Dhahran 31261, Saudi Arabia
关键词
Verification; Stochastic systems; Markov processes; Distance; Divergence; Reinforcement Learning; Testing; Equivalence relations; BISIMULATION; DIFFERENCE;
D O I
10.1016/j.ic.2013.02.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Checking if a given system implementation respects its specification is often done by proving that the two are "equivalent". The equivalence is chosen, in particular, for its computability and of course for its meaning, that is, for its adequacy with what is observable from the two systems (implementation and specification). Trace equivalence is easily testable (decidable from interaction), but often considered too weak; in contrast, bisimulation is accepted as the canonical equivalence for interaction, but it is not testable. Richer than an equivalence is a form of distance: it is zero between equivalent systems, and it provides an estimation of their difference if the systems are not equivalent. Our main contribution is to define such a distance in a context where (1) the two systems to be compared have a stochastic behavior; (2) the model of one of them (e.g., the implementation) is unknown, hence our only knowledge is obtained by interacting with it; (3) consequently the target equivalence (observed when distance is zero) must be testable. To overcome the problem that the model is unknown, we use a Reinforcement Learning approach that provides powerful stochastic algorithms that only need to interact with the model. Our second main contribution is a new family of testable equivalences, called K-moment. The weakest of them, 1-moment equivalence, is trace equivalence; as K grows, K-moment equivalences become finer, all remaining, as well as their limit, weaker than bisimulation. We propose a framework to define (and test) a bigger class of testable equivalences: Test-Observation-Equivalences (TOEs), and we show how they can be made coarser or not, by tuning some parameters. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:21 / 57
页数:37
相关论文
共 50 条
  • [1] Testing probabilistic equivalence through reinforcement learning
    Desharnais, Josee
    Laviolette, Francois
    Zhioua, Sami
    FSTTCS 2006: FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE, PROCEEDINGS, 2006, 4337 : 236 - +
  • [2] Trace equivalence characterization through reinforcement learning
    Desharnais, Josee
    Laviolette, Francois
    Moturu, Krishna Priya Darsini
    Zhioua, Sami
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 371 - 382
  • [3] Probabilistic bisimilarity as testing equivalence
    Deng, Yuxin
    Feng, Yuan
    INFORMATION AND COMPUTATION, 2017, 257 : 58 - 64
  • [4] Learning domain structure through probabilistic policy reuse in reinforcement learning
    Fernandez, Fernando
    Veloso, Manuela
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2013, 2 (01) : 13 - 27
  • [5] Directing genetic algorithms for probabilistic reasoning through reinforcement learning
    Zhong, XM
    Santos, E
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2000, 8 (02) : 167 - 185
  • [6] IFRIT: Focused Testing through Deep Reinforcement Learning
    Romdhana, Andrea
    Ceccato, Mariano
    Merlo, Alessio
    Tonella, Paolo
    2022 IEEE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST 2022), 2022, : 24 - 34
  • [7] THE EQUIVALENCE AND LEARNING OF PROBABILISTIC-AUTOMATA
    TZENG, WG
    30TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, 1989, : 268 - 273
  • [8] Emergent reinforcement learning behaviors through novel testing conditions
    Zaroukian, Erin G.
    Basak, Anjon
    Sharma, Piyush K.
    Fernandez, Rolando
    Asher, Derrik E.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS III, 2021, 11746
  • [9] Fair testing through probabilistic testing
    Núñez, M
    Rupérez, D
    FORMAL METHODS FOR PROTOCOL ENGINEERING AND DISTRIBUTED SYSTEMS, 1999, 28 : 135 - 150
  • [10] Automated Post-Breach Penetration Testing through Reinforcement Learning
    Chaudhary, Sujita
    O'Brien, Austin
    Xu, Shengjie
    2020 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2020,