CHAPTERBREAK: A Challenge Dataset for Long-Range Language Models

被引:0
|
作者
Sun, Simeng [1 ]
Thai, Katherine [1 ]
Iyyer, Mohit [1 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While numerous architectures for long-range language models (LRLMs) have recently been proposed, a meaningful evaluation of their discourse-level language understanding capabilities has not yet followed. To this end, we introduce CHAPTERBREAK, a challenge dataset that provides an LRLM with a long segment from a narrative that ends at a chapter boundary and asks it to distinguish the beginning of the ground-truth next chapter from a set of negative segments sampled from the same narrative. A fine-grained human annotation reveals that our dataset contains many complex types of chapter transitions (e.g., parallel narratives, cliffhanger endings) that require processing global context to comprehend. Experiments on CHAPTERBREAK show that existing LRLMs fail to effectively leverage long-range context, substantially underperforming a segment-level model trained directly for this task. We publicly release our CHAPTERBREAK dataset to spur more principled future research into LRLMs.(1)
引用
收藏
页码:3704 / 3714
页数:11
相关论文
共 50 条
  • [1] Do Long-Range Language Models Actually Use Long-Range Context?
    Sun, Simeng
    Krishna, Kalpesh
    Mattarella-Micke, Andrew
    Iyyer, Mohit
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 807 - 822
  • [2] Verifying the Long-range Dependency of RNN Language Models
    Tseng, Tzu-Hsuan
    Yang, Tzu-Hsuan
    Chen, Chia-Ping
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 75 - 78
  • [3] Chinese WPLC: A Chinese Dataset for Evaluating Pretrained Language Models on Word Prediction Given Long-Range Context
    Ge, Huibin
    Sun, Chenxi
    Xiong, Deyi
    Liu, Qun
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3770 - 3778
  • [4] Long-Range Correlation Underlying Childhood Language and Generative Models
    Tanaka-Ishii, Kumiko
    [J]. FRONTIERS IN PSYCHOLOGY, 2018, 9
  • [5] Cirrus: A Long-range B -pattern LiDAR Dataset
    Wang, Ze
    Ding, Sihao
    Li, Ying
    Fenn, Jonas
    Roychowdhury, Sohini
    Wallin, Andreas
    Martin, Lane
    Ryvola, Scott
    Sapiro, Guillermo
    Qiu, Qiang
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 5744 - 5750
  • [7] LONG-RANGE PLANNING - CHALLENGE TO MANAGEMENT SCIENCE
    DRUCKER, PF
    [J]. MANAGEMENT SCIENCE, 1959, 5 (03) : 238 - 249
  • [8] MULTIPERIPHERAL MODELS WITH LONG-RANGE CORRELATIONS
    STEINHOFF, J
    [J]. NUCLEAR PHYSICS B, 1974, B 82 (03) : 461 - 476
  • [9] Correlation models with long-range dependence
    Ma, CS
    [J]. JOURNAL OF APPLIED PROBABILITY, 2002, 39 (02) : 370 - 382
  • [10] Energy bonds as correlators for long-range symmetry-protected topological models and models with long-range topological order
    Yu, Wing Chi
    Cheng, Chen
    Sacramento, P. D.
    [J]. PHYSICAL REVIEW B, 2020, 101 (24)