Cross-functional Analysis of Generalization in Behavioral Learning

被引：0

作者：

de Araujo, Pedro Henrique Luz ^{[1
,2
]}

Roth, Benjamin ^{[1
,3
]}

机构：

[1] Univ Vienna, Fac Comp Sci, Vienna, Austria

[2] UniVie Doctoral Sch Comp Sci, Vienna, Austria

[3] Univ Vienna, Fac Philol & Cultural Studies, Vienna, Austria

来源：

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS | 2023年 / 11卷

关键词：

D O I：

10.1162/tacl_a_00590

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In behavioral testing, system functionalities underrepresented in the standard evaluation setting (with a held-out test set) are validated through controlled input-output pairs. Optimizing performance on the behavioral tests during training (behavioral learning) would improve coverage of phenomena not sufficiently represented in the i.i.d. data and could lead to seemingly more robust models. However, there is the risk that the model narrowly captures spurious correlations from the behavioral test suite, leading to overestimation and misrepresentation of model performance-one of the original pitfalls of traditional evaluation.In this work, we introduce BeLUGA, an analysis method for evaluating behavioral learning considering generalization across dimensions of different granularity levels. We optimize behavior-specific loss functions and evaluate models on several partitions of the behavioral test suite controlled to leave out specific phenomena. An aggregate score measures generalization to unseen functionalities (or overfitting). We use BeLUGA to examine three representative NLP tasks (sentiment analysis, paraphrase identification, and reading comprehension) and compare the impact of a diverse set of regularization and domain generalization methods on generalization performance.(1)

引用

页码：1066 / 1081

页数：16

共 50 条

[21] VARIOUS APPROACHES ON CROSS-FUNCTIONAL MANAGEMENT
Dinca, Laura
POLITICAL SCIENCES, LAW, FINANCE, ECONOMICS AND TOURISM, VOL III, 2014, : 781 - 786
[22] Creating cross-functional Web teams
Guenther, K
ONLINE, 2001, 25 (03): : 79 - 81
[23] A CROSS-FUNCTIONAL STRATEGY FOR PRODUCT DEVELOPMENT
HNAT, DL
FOOD TECHNOLOGY, 1994, 48 (08) : 62 - &
[24] Organizing a cross-functional integration team
Wilson, RC
POLLUTION ENGINEERING, 1999, 31 (04) : 31 - 31
[25] VITAL CROSS-FUNCTIONAL LINKAGES WITH MARKETING
LIM, JS
REID, DA
INDUSTRIAL MARKETING MANAGEMENT, 1992, 21 (02) : 159 - 165
[26] Cross-functional elements in undergraduate curriculum
Arnold, P
Kannan, V
McQuaid, PA
DECISION SCIENCES INSTITUTE 1998 PROCEEDINGS, VOLS 1-3, 1998, : 339 - 339
[27] Cross-functional cases in management education
Crittenden, VL
Dickson, P
JOURNAL OF BUSINESS RESEARCH, 2005, 58 (07) : 944 - 945
[28] Compensating Nondedicated Cross-Functional Teams
Wang, Sijun
He, Yuanjie
ORGANIZATION SCIENCE, 2008, 19 (05) : 753 - 765
[29] Cross-functional team processes and patient functional improvement
Alexander, JA
Lichtenstein, R
Jinnett, K
Wells, R
Zazzali, J
Liu, DW
HEALTH SERVICES RESEARCH, 2005, 40 (05) : 1335 - 1355
[30] FUNCTIONAL, MULTIFUNCTIONAL, AND CROSS-FUNCTIONAL: CONSIDERATIONS FOR MARKETING MANAGEMENT
Kahn, Kenneth
JOURNAL OF MARKETING THEORY AND PRACTICE, 2009, 17 (01) : 75 - 84

← 1 2 3 4 5 →