Comparing Text Representations: A Theory-Driven Approach

被引:0
|
作者
Yauney, Gregory [1 ]
Mimno, David [1 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much of the progress in contemporary NLP has come from learning representations, such as masked language model (MLM) contextual embeddings, that turn challenging problems into simple classification tasks. But how do we quantify and explain this effect? We adapt general tools from computational learning theory to fit the specific characteristics of text datasets and present a method to evaluate the compatibility between representations and tasks. Even though many tasks can be easily solved with simple bag-of-words (BOW) representations, BOW does poorly on hard natural language inference tasks. For one such task we find that BOWcannot distinguish between real and randomized labelings, while pre-trained MLM representations show 72x greater distinction between real and random labelings than BOW. This method provides a calibrated, quantitative measure of the difficulty of a classification-based NLP task, enabling comparisons between representations without requiring empirical evaluations that may be sensitive to initializations and hyperparameters. The method provides a fresh perspective on the patterns in a dataset and the alignment of those patterns with specific labels.
引用
收藏
页码:5527 / 5539
页数:13
相关论文
共 50 条
  • [1] THE THEORY-DRIVEN APPROACH TO VALIDITY
    CHEN, HT
    ROSSI, PH
    EVALUATION AND PROGRAM PLANNING, 1987, 10 (01) : 95 - 103
  • [2] IMPLEMENTATION THEORY AND THE THEORY-DRIVEN APPROACH TO VALIDITY
    PALUMBO, DJ
    OLIVERIO, A
    EVALUATION AND PROGRAM PLANNING, 1989, 12 (04) : 337 - 344
  • [3] EVALUATING WITH SENSE - THE THEORY-DRIVEN APPROACH
    CHEN, HT
    ROSSI, PH
    EVALUATION REVIEW, 1983, 7 (03) : 283 - 302
  • [4] A Theory-Driven Approach to Predict Frustration in an ITS
    Rajendran, Ramkumar
    Iyer, Sridhar
    Murthy, Sahana
    Wilson, Campbell
    Sheard, Judithe
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2013, 6 (04): : 378 - 388
  • [5] Anaesthesia monitor alarms: a theory-driven approach
    Raymer, Karen E.
    Bergstrom, Johan
    Nyce, James M.
    ERGONOMICS, 2012, 55 (12) : 1487 - 1501
  • [7] A theory-driven approach to evaluating quality of nursing care
    Sidani, S
    Doran, DM
    Mitchell, PH
    JOURNAL OF NURSING SCHOLARSHIP, 2004, 36 (01) : 60 - 65
  • [8] A Theory-driven Approach to Subject Design in Teacher Education
    Zundans-Fraser, Lucia
    Auhl, Greg
    AUSTRALIAN JOURNAL OF TEACHER EDUCATION, 2016, 41 (03): : 140 - 157
  • [9] A theory-driven approach to assessing the cognitive effects of PBL
    Hmelo, CE
    Gotterer, GS
    Bransford, JD
    INSTRUCTIONAL SCIENCE, 1997, 25 (06) : 387 - 408
  • [10] Evaluating nursing interventions. A theory-driven approach
    Closs, SJ
    JOURNAL OF ADVANCED NURSING, 1999, 29 (01) : 267 - 267