ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset

被引:4
|
作者
Jin, Zhihua [1 ]
Wang, Xingbo [1 ]
Cheng, Furui [1 ,2 ]
Sun, Chunhui [3 ]
Liu, Qun [4 ]
Qu, Huamin [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[3] Peking Univ, Beijing 100871, Peoples R China
[4] Huawei, Noahs Ark Lab, Hong Kong, Peoples R China
关键词
Benchmark testing; Task analysis; Natural language processing; Cognition; Guidelines; Predictive models; Computational modeling; Natural language understanding; shortcut; visual analytics;
D O I
10.1109/TVCG.2023.3236380
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Benchmark datasets play an important role in evaluating Natural Language Understanding (NLU) models. However, shortcuts-unwanted biases in the benchmark datasets-can damage the effectiveness of benchmark datasets in revealing models' real capabilities. Since shortcuts vary in coverage, productivity, and semantic meaning, it is challenging for NLU experts to systematically understand and avoid them when creating benchmark datasets. In this paper, we develop a visual analytics system, ShortcutLens, to help NLU experts explore shortcuts in NLU benchmark datasets. The system allows users to conduct multi-level exploration of shortcuts. Specifically, Statistics View helps users grasp the statistics such as coverage and productivity of shortcuts in the benchmark dataset. Template View employs hierarchical and interpretable templates to summarize different types of shortcuts. Instance View allows users to check the corresponding instances covered by the shortcuts. We conduct case studies and expert interviews to evaluate the effectiveness and usability of the system. The results demonstrate that ShortcutLens supports users in gaining a better understanding of benchmark dataset issues through shortcuts, inspiring them to create challenging and pertinent benchmark datasets.
引用
收藏
页码:3594 / 3608
页数:15
相关论文
共 50 条
  • [1] ExpLIMEable: A Visual Analytics Approach for Exploring LIME
    Laguna, Sonia
    Heidenreich, Julian N.
    Sun, Jiugeng
    Cetin, Nilufer
    Al-Hazwani, Ibrahim
    Schlegel, Udo
    Cheng, Furui
    El-Assady, Mennatallah
    [J]. 2023 WORKSHOP ON VISUAL ANALYTICS IN HEALTHCARE, VAHC, 2023, : 27 - 33
  • [2] Understanding Syndromic Hotspots - A Visual Analytics Approach
    Maciejewski, Ross
    Rudolph, Stephen
    Hafen, Ryan
    Abusalah, Ahmad
    Yakout, Mohamed
    Ouzzani, Mourad
    Cleveland, William S.
    Grannis, Shaun J.
    Wade, Michael
    Ebert, David S.
    [J]. IEEE SYMPOSIUM ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY 2008, PROCEEDINGS, 2008, : 35 - 42
  • [3] Understanding Hotspots: A Topological Visual Analytics Approach
    Lukasczyk, Jonas
    Maciejewski, Ross
    Garth, Christoph
    Hagen, Hans
    [J]. 23RD ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2015), 2015,
  • [4] A visual analytics approach to understanding cycling behaviour
    Beecham, Roger
    Wood, Jo
    Bowerman, Audrey
    [J]. 2012 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2012, : 207 - 208
  • [5] A Visual Analytics Approach to Understanding Spatiotemporal Hotspots
    Maciejewski, Ross
    Rudolph, Stephen
    Hafen, Ryan
    Abusalah, Ahmad M.
    Yakout, Mohamed
    Ouzzani, Mourad
    Cleveland, William S.
    Grannis, Shaun J.
    Ebert, David S.
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (02) : 205 - 220
  • [6] A Visual Analytics Approach to Exploring Protein Flexibility Subspaces
    Barlowe, Scott
    Yang, Jing
    Jacobs, Donald J.
    Livesay, Dennis R.
    Alsakran, Jamal
    Zhao, Ye
    Verma, Deeptak
    Mottonen, James
    [J]. 2013 IEEE SYMPOSIUM ON PACIFIC VISUALIZATION (PACIFICVIS), 2013, : 193 - 200
  • [7] LitVis: a visual analytics approach for managing and exploring literature
    Tian, Min
    Li, Guozheng
    Yuan, Xiaoru
    [J]. JOURNAL OF VISUALIZATION, 2023, 26 (06) : 1445 - 1458
  • [8] LitVis: a visual analytics approach for managing and exploring literature
    Min Tian
    Guozheng Li
    Xiaoru Yuan
    [J]. Journal of Visualization, 2023, 26 : 1445 - 1458
  • [9] Understanding the Dimensions of Medical Crowdfunding: A Visual Analytics Approach
    Ren, Jie
    Raghupathi, Viju
    Raghupathi, Wullianallur
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (07)
  • [10] Pragmatic approach in natural language understanding
    Mitsuiyoshi, S
    Ren, F
    Lin, Y
    Ogawa, JR
    [J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 40 - 49