On the effectiveness of testing sentiment analysis systems with metamorphic testing

被引:0
|
作者
Jiang, Mingyue [1 ]
Chen, Tsong Yueh [2 ]
Wang, Shuai [3 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Informat Sci & Technol, Hangzhou, Peoples R China
[2] Swinburne Univ Technol, Dept Comp Sci & Software Engn, Hawthorn, Vic 3122, Australia
[3] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
关键词
Metamorphic testing; Metamorphic relation; Sentiment analysis; False satisfaction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Metamorphic testing (MT) has been successfully applied to a wide scope of software systems. In these applications, the testing results of MT form the basis for drawing conclusions about the target system's performance. Therefore, the effectiveness of MT is crucial to the trustfulness of the derived conclusions. Objective: However, due to the nature of MT, its effectiveness can be affected by various factors. Despite of MT's success, it is still important to study its effectiveness under different application contexts. Method: To investigate the effectiveness of MT, we focus on an important aspect, namely, false satisfactions (which are satisfactions of metamorphic relations that involve at least one failing execution), and revisit the application of MT to sentiment analysis (SA) systems. An in-depth analysis of the essence of false satisfactions reveals the situations where they would occur, and how they would affect the effectiveness of MT. Furthermore, 20 metamorphic relations (MRs) are identified for supporting a user-oriented evaluation of SA systems. Results: The occurrence rates of false satisfactions are reported with respect to four SA systems. For the majority of MRs, false satisfactions account for about 20% to 50% of all MR satisfactions, suggesting that false satisfactions occur quite frequently in the evaluation of SA systems. It is also demonstrated that such high occurrence rates of false satisfactions adversely affect the users' selection of SA systems. Conclusion: Our analysis reveals that without considering the occurrence of false satisfactions, MT may overestimate the system's conformance to the relevant MR. Furthermore, our experiments empirically show that conclusions derived from MT can be adversely affected when there are many false satisfactions. Our findings will help the MT community to adopt a more fair and reliable way of using the test outcomes of MT, and can also inspire the development of solid foundations for MT.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Metamorphic testing of named entity recognition systems: A case study
    Xu, Yezi
    Zhou, Zhi Quan
    Zhang, Xiaoxia
    Wang, Jing
    Jiang, Mingyue
    [J]. IET SOFTWARE, 2022, 16 (04) : 386 - 404
  • [42] Testing and validating machine learning classifiers by metamorphic testing
    Xie, Xiaoyuan
    Ho, Joshua W. K.
    Murphy, Christian
    Kaiser, Gail
    Xu, Baowen
    Chen, Tsong Yueh
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2011, 84 (04) : 544 - 558
  • [43] Metamorphic Testing for Traffic Light Recognition in Autonomous Driving Systems
    Bai, Tongtong
    Fan, Yong
    Pan, Ya
    Qing, Mingshuang
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 38 - 44
  • [44] Metamorphic Testing: Challenges Ahead
    Segura, Sergio
    [J]. 2018 IEEE/ACM 3RD INTERNATIONAL WORKSHOP ON METAMORPHIC TESTING (MET 2018), 2018, : 1 - 1
  • [45] Metamorphic Testing for Block Ciphers
    Zhang, Mingjia
    Towey, Dave
    Chen, Tsong Yueh
    Zhou, Zhi Quan
    [J]. 2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 1315 - 1320
  • [46] Metamorphic Testing of Datalog Engines
    Mansur, Muhammad Numair
    Christakis, Maria
    Wuestholz, Valentin
    [J]. PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 639 - 650
  • [47] Metamorphic testing and test automation
    Hierons, Robert M.
    Xie, Tao
    [J]. Software Testing Verification and Reliability, 2022, 32 (03)
  • [48] Metamorphic Testing with Causal Graphs
    Clark, Andrew G.
    Foster, Michael
    Walkinshaw, Neil
    Hierons, Robert M.
    [J]. 2023 IEEE CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION, ICST, 2023, : 153 - 164
  • [49] Interactive Metamorphic Testing of Debuggers
    Tolksdorf, Sandro
    Lehmann, Daniel
    Pradel, Michael
    [J]. PROCEEDINGS OF THE 28TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS (ISSTA '19), 2019, : 273 - 283
  • [50] Metamorphic Testing of Constraint Solvers
    Akgun, Ozgur
    Gent, Ian P.
    Jefferson, Christopher
    Miguel, Ian
    Nightingale, Peter
    [J]. PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, 2018, 11008 : 727 - 736