Cross-functional Analysis of Generalization in Behavioral Learning

被引:0
|
作者
de Araujo, Pedro Henrique Luz [1 ,2 ]
Roth, Benjamin [1 ,3 ]
机构
[1] Univ Vienna, Fac Comp Sci, Vienna, Austria
[2] UniVie Doctoral Sch Comp Sci, Vienna, Austria
[3] Univ Vienna, Fac Philol & Cultural Studies, Vienna, Austria
关键词
D O I
10.1162/tacl_a_00590
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In behavioral testing, system functionalities underrepresented in the standard evaluation setting (with a held-out test set) are validated through controlled input-output pairs. Optimizing performance on the behavioral tests during training (behavioral learning) would improve coverage of phenomena not sufficiently represented in the i.i.d. data and could lead to seemingly more robust models. However, there is the risk that the model narrowly captures spurious correlations from the behavioral test suite, leading to overestimation and misrepresentation of model performance-one of the original pitfalls of traditional evaluation.In this work, we introduce BeLUGA, an analysis method for evaluating behavioral learning considering generalization across dimensions of different granularity levels. We optimize behavior-specific loss functions and evaluate models on several partitions of the behavioral test suite controlled to leave out specific phenomena. An aggregate score measures generalization to unseen functionalities (or overfitting). We use BeLUGA to examine three representative NLP tasks (sentiment analysis, paraphrase identification, and reading comprehension) and compare the impact of a diverse set of regularization and domain generalization methods on generalization performance.(1)
引用
收藏
页码:1066 / 1081
页数:16
相关论文
共 50 条
  • [41] If Cross-functional Teams are the Answer, What is the Question?
    Pun, W. K. Daniel
    Santa, Ricardo
    NSS: 2009 3RD INTERNATIONAL CONFERENCE ON NETWORK AND SYSTEM SECURITY, 2009, : 501 - +
  • [42] Dynamics of Decision Making in Cross-Functional Teams
    Anibaba, Yetunde
    Akaighe, Godbless
    CONTEMPORARY ECONOMICS, 2018, 12 (04) : 485 - 496
  • [43] Transcending Knowledge Differences in Cross-Functional Teams
    Majchrzak, Ann
    More, Philip H. B.
    Faraj, Samer
    ORGANIZATION SCIENCE, 2012, 23 (04) : 951 - 970
  • [44] Development of successful cross-functional project teams
    Auchterlonie, C
    AQP'S 18TH ANNUAL SPRING CONFERENCE AND RESOURCE MART - THE SPIRIT OF WORKING TOGETHER, 1996 PROCEEDINGS, 1996, : 457 - 461
  • [45] SM323: The cross-functional core
    Arnold, P
    Khurana, A
    DECISION SCIENCES INSTITUTE, 1997 ANNUAL MEETING, PROCEEDINGS, VOLS 1-3, 1997, : 73 - 74
  • [46] End user computing: A cross-functional approach
    Gillard, S
    Rhim, JC
    Kim, K
    ASSOCIATION FOR INFORMATION SYSTEMS PROCEEDINGS OF THE AMERICAS CONFERENCE ON INFORMATION SYSTEMS, 1998, : 725 - 727
  • [47] Improving hospital flow with cross-functional teams
    Yong, HG
    ASQC'S 51ST ANNUAL QUALITY CONGRESS PROCEEDINGS, 1997, : 264 - 273
  • [48] Managerial mental models and cross-functional coordination: Clues to the link between individual learning and organizational learning
    Bharadwaj, N
    1996 AMA EDUCATORS' PROCEEDINGS, VOL 7 - ENHANCING KNOWLEDGE DEVELOPMENT IN MARKETING, 1996, 7 : 12 - 22
  • [49] CROSS-FUNCTIONAL TEAMS AND ORGANIZATIONAL LEARNING: A MODEL AND CASES FROM TELECOMMUNICATIONS OPERATING COMPANIES
    Slepian, Joan
    INTERNATIONAL JOURNAL OF INNOVATION AND TECHNOLOGY MANAGEMENT, 2013, 10 (01)
  • [50] THE IMPACT OF CROSS-FUNCTIONAL TEAMWORK ON WORKFORCE INTEGRATION
    IRVINE, D
    BAKER, GR
    INTERNATIONAL JOURNAL OF CONFLICT MANAGEMENT, 1995, 6 (02) : 171 - 191