Comparing Text Representations: A Theory-Driven Approach

被引:0
|
作者
Yauney, Gregory [1 ]
Mimno, David [1 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much of the progress in contemporary NLP has come from learning representations, such as masked language model (MLM) contextual embeddings, that turn challenging problems into simple classification tasks. But how do we quantify and explain this effect? We adapt general tools from computational learning theory to fit the specific characteristics of text datasets and present a method to evaluate the compatibility between representations and tasks. Even though many tasks can be easily solved with simple bag-of-words (BOW) representations, BOW does poorly on hard natural language inference tasks. For one such task we find that BOWcannot distinguish between real and randomized labelings, while pre-trained MLM representations show 72x greater distinction between real and random labelings than BOW. This method provides a calibrated, quantitative measure of the difficulty of a classification-based NLP task, enabling comparisons between representations without requiring empirical evaluations that may be sensitive to initializations and hyperparameters. The method provides a fresh perspective on the patterns in a dataset and the alignment of those patterns with specific labels.
引用
收藏
页码:5527 / 5539
页数:13
相关论文
共 50 条
  • [21] Theory-driven choice models
    Erdem, T
    Srinivasan, K
    Amaldoss, W
    Bajari, P
    Che, H
    Ho, T
    Hutchinson, W
    Katz, M
    Keane, M
    Meyer, R
    Reiss, P
    MARKETING LETTERS, 2005, 16 (3-4) : 225 - 237
  • [22] Theory-Driven Choice Models
    Tülin Erdem
    Kannan Srinivasan
    Wilfred Amaldoss
    Patrick Bajari
    Hai Che
    Teck Ho
    Wes Hutchinson
    Michael Katz
    Michael Keane
    Robert Meyer
    Peter Reiss
    Marketing Letters, 2005, 16 : 225 - 237
  • [23] Towards an evidence base of theory-driven evaluations: Some questions for proponents of theory-driven evaluation
    Sridharan, Sanjeev
    Nakaima, April
    EVALUATION, 2012, 18 (03) : 378 - 395
  • [24] COMPARING DATA-DRIVEN AND THEORY-DRIVEN APPROACHES TO CLASSIFY SOCIAL RELATIONSHIPS ACROSS THE LIFE COURSE
    Friedman, Elliot
    Teas, Elizabeth
    Marceau, Kristine
    INNOVATION IN AGING, 2022, 6 : 718 - 719
  • [25] Are Theory-Driven Behavior Change Interventions Truly Theory Driven?
    Conn, Vicki S.
    WESTERN JOURNAL OF NURSING RESEARCH, 2009, 31 (03) : 287 - 288
  • [26] Evaluating complex interventions: A theory-driven realist-informed approach
    Douthwaite, Boru
    Mayne, John
    McDougall, Cynthia
    Paz-Ybarnegaray, Rodrigo
    EVALUATION, 2017, 23 (03) : 294 - 311
  • [27] Executive Dysfunction in Patients With Korsakoff's Syndrome: A Theory-Driven Approach
    Moerman-van den Brink, W. G.
    van Aken, L.
    Verschuur, E. M. L.
    Walvoort, S. J. W.
    Egger, J. I. M.
    Kessels, R. P. C.
    ALCOHOL AND ALCOHOLISM, 2019, 54 (01): : 23 - 29
  • [28] RULER: A Theory-Driven, Systemic Approach to Social, Emotional, and Academic Learning
    Brackett, Marc A.
    Bailey, Craig S.
    Hoffmann, Jessica D.
    Simmons, Dena N.
    EDUCATIONAL PSYCHOLOGIST, 2019, 54 (03) : 144 - 161
  • [29] Substance use disorders: a theory-driven approach to the integration of genetics and neuroimaging
    Karoly, Hollis C.
    Harlaar, Nicole
    Hutchison, Kent E.
    ADDICTION REVIEWS, 2013, 1282 : 71 - 91
  • [30] Denoising cosine similarity: A theory-driven approach for efficient representation learning
    Nakagawa, Takumi
    Sanada, Yutaro
    Waida, Hiroki
    Zhang, Yuhui
    Wada, Yuichiro
    Takanashi, Kosaku
    Yamada, Tomonori
    Kanamori, Takafumi
    NEURAL NETWORKS, 2024, 169 : 226 - 241