Kaleidoscope: Semantically-grounded, context-specific ML model evaluation

被引:2
|
作者
Suresh, Harini [1 ]
Shanmugam, Divya [1 ]
Chen, Tiffany [1 ]
Bryan, Annie [1 ]
D'Amour, Alexander [2 ]
Guttag, John V. [1 ]
Satyanarayan, Arvind [1 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
[2] Google Res, San Francisco, CA USA
关键词
D O I
10.1145/3544548.3581482
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Desired model behavior often difers across contexts (e.g., diferent geographies, communities, or institutions), but there is little infrasnotions tructure to facilitate context-specifc evaluations key to deployment decisions and building trust. Here, we present Kaleidoscope, a sysfor evaluating models in terms of user-driven, domain-relevant concepts. Kaleidoscope's iterative workfow enables generalizing from a few examples into a larger, diverse set representing an imsubreddits portant concept. These example sets can be used to test model or shifts in model behavior in semantically-meaningful ways. For instance, we might construct a "xenophobic comments" and test that its examples are more likely to be fagged by a content moderation model than a "civil discussion" set. To evalumoderators ate Kaleidoscope, we compare it against template- and DSL-based grouping methods, and conduct a usability study with 13 Reddit users testing a content moderation model. We fnd that Kaleidoscope facilitates iterative, exploratory hypothesis testing across conceptually-meaningful example sets.
引用
收藏
页数:13
相关论文
共 43 条
  • [1] A model for context-specific route directions
    Richter, KF
    Klippel, A
    [J]. SPATIAL COGNITION IV, REASONING, ACTION, INTERACTION, 2004, 3343 : 58 - 78
  • [2] Context-specific Quality Evaluation of Test Cases
    Jovanovikj, Ivan
    Narasimhan, Vishwak
    Engels, Gregor
    Sauer, Stefan
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING AND SOFTWARE DEVELOPMENT, 2018, : 594 - 601
  • [3] Context-Specific Multi-Model-Template Retrieval
    Hilbert, Frank
    Scherer, Raimar J.
    [J]. COLLABORATIVE NETWORKS IN THE INTERNET OF SERVICES, 2012, 380 : 234 - 241
  • [4] CODA-ML: context-specific biological knowledge representation for systemic physiology analysis
    Kwon, Mijin
    Yim, Soorin
    Kim, Gwangmin
    Lee, Saehwan
    Jeong, Chungsun
    Lee, Doheon
    [J]. BMC BIOINFORMATICS, 2019, 20 (Suppl 10)
  • [5] CODA-ML: context-specific biological knowledge representation for systemic physiology analysis
    Mijin Kwon
    Soorin Yim
    Gwangmin Kim
    Saehwan Lee
    Chungsun Jeong
    Doheon Lee
    [J]. BMC Bioinformatics, 20
  • [6] Context-specific interface model for architectural design in the virtual environment
    [J]. Architectural Science Review, 1998, 41 (03): : 105 - 111
  • [7] An association matrix model of context-specific vertical vergence adaptation
    McCandless, JW
    Schor, CM
    [J]. NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1997, 8 (03) : 239 - 258
  • [8] Generalized framework for context-specific metabolic model extraction methods
    Estevez, Semidan Robaina
    Nikoloski, Zoran
    [J]. FRONTIERS IN PLANT SCIENCE, 2014, 5
  • [9] Model-based analysis of context-specific cognitive control
    King, Joseph A.
    Donkin, Christopher
    Korb, Franziska M.
    Egner, Tobias
    [J]. FRONTIERS IN PSYCHOLOGY, 2012, 3
  • [10] On the effects of alternative optima in context-specific metabolic model predictions
    Robaina-Estevez, Semidan
    Nikoloski, Zoran
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (05)