On the effectiveness of testing sentiment analysis systems with metamorphic testing

被引:0
|
作者
Jiang, Mingyue [1 ]
Chen, Tsong Yueh [2 ]
Wang, Shuai [3 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Informat Sci & Technol, Hangzhou, Peoples R China
[2] Swinburne Univ Technol, Dept Comp Sci & Software Engn, Hawthorn, Vic 3122, Australia
[3] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
关键词
Metamorphic testing; Metamorphic relation; Sentiment analysis; False satisfaction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Metamorphic testing (MT) has been successfully applied to a wide scope of software systems. In these applications, the testing results of MT form the basis for drawing conclusions about the target system's performance. Therefore, the effectiveness of MT is crucial to the trustfulness of the derived conclusions. Objective: However, due to the nature of MT, its effectiveness can be affected by various factors. Despite of MT's success, it is still important to study its effectiveness under different application contexts. Method: To investigate the effectiveness of MT, we focus on an important aspect, namely, false satisfactions (which are satisfactions of metamorphic relations that involve at least one failing execution), and revisit the application of MT to sentiment analysis (SA) systems. An in-depth analysis of the essence of false satisfactions reveals the situations where they would occur, and how they would affect the effectiveness of MT. Furthermore, 20 metamorphic relations (MRs) are identified for supporting a user-oriented evaluation of SA systems. Results: The occurrence rates of false satisfactions are reported with respect to four SA systems. For the majority of MRs, false satisfactions account for about 20% to 50% of all MR satisfactions, suggesting that false satisfactions occur quite frequently in the evaluation of SA systems. It is also demonstrated that such high occurrence rates of false satisfactions adversely affect the users' selection of SA systems. Conclusion: Our analysis reveals that without considering the occurrence of false satisfactions, MT may overestimate the system's conformance to the relevant MR. Furthermore, our experiments empirically show that conclusions derived from MT can be adversely affected when there are many false satisfactions. Our findings will help the MT community to adopt a more fair and reliable way of using the test outcomes of MT, and can also inspire the development of solid foundations for MT.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] On the effectiveness of testing sentiment analysis systems with metamorphic testing
    Jiang, Mingyue
    Chen, Tsong Yueh
    Wang, Shuai
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 150
  • [2] On the effectiveness of testing sentiment analysis systems with metamorphic testing
    Jiang, Mingyue
    Chen, Tsong Yueh
    Wang, Shuai
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 150
  • [3] Testing multiple linear regression systems with metamorphic testing
    Luu, Quang-Hung
    Lau, Man F.
    Ng, Sebastian P. H.
    Chen, Tsong Yueh
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2021, 182
  • [4] Teaching Software Testing Skills: Metamorphic Testing as Vehicle for Creativity and Effectiveness in Software Testing
    Towey, Dave
    Chen, Tsong Yueh
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON TEACHING, ASSESSMENT, AND LEARNING FOR ENGINEERING (TALE), 2015, : 161 - 162
  • [5] On the Cost-Effectiveness of Composite Metamorphic Relations for Testing Deep Learning Systems
    Arrieta, Aitor
    [J]. 2022 IEEE/ACM 7TH INTERNATIONAL WORKSHOP ON METAMORPHIC TESTING (MET 2022), 2022, : 42 - 47
  • [6] Metamorphic Testing for Reliability in System of Systems
    Chua, Kiat Kian Anthony
    Bae, Doo-Hwan
    Jee, Eunkyoung
    [J]. 2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2021), 2021, : 390 - 400
  • [7] Metamorphic Security Testing for Web Systems
    Mai, Phu X.
    Pastore, Fabrizio
    Goknil, Arda
    Briand, Lionel
    [J]. 2020 IEEE 13TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VALIDATION AND VERIFICATION (ICST 2020), 2020, : 186 - 197
  • [8] Metamorphic Testing: Testing the Untestable
    Segura, Sergio
    Towey, Dave
    Zhou, Zhi Quan
    Chen, Tsong Yueh
    [J]. IEEE SOFTWARE, 2020, 37 (03) : 46 - 53
  • [9] Integration testing and metamorphic testing
    Le Traon, Yves
    Xie, Tao
    [J]. Software Testing Verification and Reliability, 2022, 32 (04)
  • [10] Integration testing and metamorphic testing
    Le Traon, Yves
    Xie, Tao
    [J]. SOFTWARE TESTING VERIFICATION & RELIABILITY, 2022, 32 (04):