Cross-functional Analysis of Generalization in Behavioral Learning

被引:0
|
作者
de Araujo, Pedro Henrique Luz [1 ,2 ]
Roth, Benjamin [1 ,3 ]
机构
[1] Univ Vienna, Fac Comp Sci, Vienna, Austria
[2] UniVie Doctoral Sch Comp Sci, Vienna, Austria
[3] Univ Vienna, Fac Philol & Cultural Studies, Vienna, Austria
关键词
D O I
10.1162/tacl_a_00590
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In behavioral testing, system functionalities underrepresented in the standard evaluation setting (with a held-out test set) are validated through controlled input-output pairs. Optimizing performance on the behavioral tests during training (behavioral learning) would improve coverage of phenomena not sufficiently represented in the i.i.d. data and could lead to seemingly more robust models. However, there is the risk that the model narrowly captures spurious correlations from the behavioral test suite, leading to overestimation and misrepresentation of model performance-one of the original pitfalls of traditional evaluation.In this work, we introduce BeLUGA, an analysis method for evaluating behavioral learning considering generalization across dimensions of different granularity levels. We optimize behavior-specific loss functions and evaluate models on several partitions of the behavioral test suite controlled to leave out specific phenomena. An aggregate score measures generalization to unseen functionalities (or overfitting). We use BeLUGA to examine three representative NLP tasks (sentiment analysis, paraphrase identification, and reading comprehension) and compare the impact of a diverse set of regularization and domain generalization methods on generalization performance.(1)
引用
收藏
页码:1066 / 1081
页数:16
相关论文
共 50 条
  • [1] Situated learning in cross-functional virtual teams
    Robey, D
    Khoo, HM
    Powers, C
    TECHNICAL COMMUNICATION, 2000, 47 (01) : 51 - 66
  • [2] A social learning theory of cross-functional case education
    Crittenden, WF
    JOURNAL OF BUSINESS RESEARCH, 2005, 58 (07) : 960 - 966
  • [3] Team composition and learning behaviors in cross-functional teams
    Yeh, YJ
    Chou, HW
    SOCIAL BEHAVIOR AND PERSONALITY, 2005, 33 (04): : 391 - 402
  • [4] CROSS-FUNCTIONAL BARRIERS
    MORGAN, BW
    SLOAN MANAGEMENT REVIEW, 1995, 36 (04): : 7 - 7
  • [5] CROSS-FUNCTIONAL COLLABORATION
    PARKER, GM
    TRAINING & DEVELOPMENT, 1994, 48 (10): : 49 - &
  • [6] Cross-functional individuals
    H Craig Mak
    Nature Biotechnology, 2011, 29 (1) : 49 - 49
  • [7] Cross-functional analysis of the Microviridae internal scaffolding protein
    Burch, AD
    Ta, J
    Fane, BA
    JOURNAL OF MOLECULAR BIOLOGY, 1999, 286 (01) : 95 - 104
  • [8] Cross-functional knowledge sharing, coordination and firm performance: The role of cross-functional competition
    Nguyen Phong Nguyen
    Liem Viet Ngo
    Bucic, Tania
    Nguyen Dong Phong
    INDUSTRIAL MARKETING MANAGEMENT, 2018, 71 : 123 - 134
  • [9] The basics of cross-functional teams
    Anwar, S
    QUALITY PROGRESS, 1998, 31 (08) : 145 - 145
  • [10] Cross-Functional Inventory Research
    Shen, Wenjing
    INTERFACES, 2017, 47 (04) : 366 - 368