NarrativeXL: a Large-scale Dataset for Long-Term Memory Models

被引:0
|
作者
Moskvichev, Arseny [1 ]
Mai, Ky-Vinh [2 ]
机构
[1] Santa Fe Inst, Santa Fe, NM 87501 USA
[2] Univ Calif Irvine, Irvine, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new large-scale (nearly a million questions) ultra-long-context (more than 50,000 words average document length) reading comprehension dataset. Using GPT 3.5, we summarized each scene in 1,500 hand-curated fiction books from Project Gutenberg, which resulted in approximately 150 scene-level summaries per book. After that, we created a number of reading comprehension questions based on these summaries, including three types of multiple-choice scene recognition questions, as well as free-form narrative reconstruction questions. With 990,595 total questions, our dataset is an order of magnitude larger than the closest alternatives. Crucially, most questions have a known "retention demand", indicating how long-term of a memory is needed to answer them, which should aid long-term memory performance evaluation. We validate our data in four small-scale experiments: one with human labelers, and three with existing language models. We show that our questions 1) adequately represent the source material 2) can be used to diagnose a model's memory capacity 3) are not trivial for modern language models even when the memory demand does not exceed those models' context lengths. Lastly, we provide our code which can be used to further expand the dataset with minimal human labor.
引用
收藏
页码:15058 / 15072
页数:15
相关论文
共 50 条
  • [1] PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking
    Zheng, Yang
    Harley, Adam W.
    Shen, Bokui
    Wetzstein, Gordon
    Guibas, Leonidas J.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19798 - 19808
  • [2] The importance of spatial resolution in large-scale, long-term planning models
    Serpe, Louisa
    Cole, Wesley
    Sergi, Brian
    Brown, Maxwell
    Carag, Vincent
    Karmakar, Akash
    APPLIED ENERGY, 2025, 385
  • [3] LIABILITY AND LARGE-SCALE, LONG-TERM HAZARDS
    RINGLEB, AH
    WIGGINS, SN
    JOURNAL OF POLITICAL ECONOMY, 1990, 98 (03) : 574 - 595
  • [4] LONG-TERM FORECASTING AND PROBLEM OF LARGE-SCALE WARS
    STEFFLRE, V
    FUTURES, 1974, 6 (04) : 302 - 308
  • [5] Long-term dynamics of the large-scale magnetic structures
    Ambroz, P
    SOLAR PHYSICS, 2004, 224 (01) : 61 - 68
  • [6] LONG-TERM LARGE-SCALE CLINICAL EVALUATION OF INDOMETHACIN
    ENGLUND, DW
    ARTHRITIS AND RHEUMATISM, 1966, 9 (03): : 502 - &
  • [7] Long-Term Dynamics of the Large-Scale Magnetic Structures
    P. Ambrož
    Solar Physics, 2004, 224 : 61 - 68
  • [8] Large-scale corridors and long-term landscape change
    Papadimitriou, FT
    KEY CONCEPTS IN LANDSCAPE ECOLOGY, 1998, : 303 - 307
  • [9] Long-term and large-scale multispecies dataset tracking population changes of common European breeding birds
    Vojtěch Brlík
    Eva Šilarová
    Jana Škorpilová
    Hany Alonso
    Marc Anton
    Ainars Aunins
    Zoltán Benkö
    Gilles Biver
    Malte Busch
    Tomasz Chodkiewicz
    Przemysław Chylarecki
    Dick Coombes
    Elisabetta de Carli
    Juan C. del Moral
    Antoine Derouaux
    Virginia Escandell
    Daniel P. Eskildsen
    Benoît Fontaine
    Ruud P. B. Foppen
    Anna Gamero
    Richard D. Gregory
    Sarah Harris
    Sergi Herrando
    Iordan Hristov
    Magne Husby
    Christina Ieronymidou
    Frédéric Jiquet
    John A. Kålås
    Johannes Kamp
    Primož Kmecl
    Petras Kurlavičius
    Aleksi Lehikoinen
    Lesley Lewis
    Åke Lindström
    Aris Manolopoulos
    David Martí
    Dario Massimino
    Charlotte Moshøj
    Renno Nellis
    David Noble
    Alain Paquet
    Jean-Yves Paquet
    Danae Portolou
    Iván Ramírez
    Cindy Redel
    Jiří Reif
    Jozef Ridzoň
    Hans Schmid
    Benjamin Seaman
    Laura Silva
    Scientific Data, 8
  • [10] Long-term and large-scale multispecies dataset tracking population changes of common European breeding birds
    Brlik, Vojtech
    Silarova, Eva
    Skorpilova, Jana
    Alonso, Hany
    Anton, Marc
    Aunins, Ainars
    Benkoe, Zoltan
    Biver, Gilles
    Busch, Malte
    Chodkiewicz, Tomasz
    Chylarecki, Przemyslaw
    Coombes, Dick
    de Carli, Elisabetta
    del Moral, Juan C.
    Derouaux, Antoine
    Escandell, Virginia
    Eskildsen, Daniel P.
    Fontaine, Benoit
    Foppen, Ruud P. B.
    Gamero, Anna
    Gregory, Richard D.
    Harris, Sarah
    Herrando, Sergi
    Hristov, Iordan
    Husby, Magne
    Ieronymidou, Christina
    Jiquet, Frederic
    Kalas, John A.
    Kamp, Johannes
    Kmecl, Primoz
    Kurlavicius, Petras
    Lehikoinen, Aleksi
    Lewis, Lesley
    Lindstroem, Ake
    Manolopoulos, Aris
    Marti, David
    Massimino, Dario
    Moshoj, Charlotte
    Nellis, Renno
    Noble, David
    Paquet, Alain
    Paquet, Jean-Yves
    Portolou, Danae
    Ramirez, Ivan
    Redel, Cindy
    Reif, Jiri
    Ridzon, Jozef
    Schmid, Hans
    Seaman, Benjamin
    Silva, Laura
    SCIENTIFIC DATA, 2021, 8 (01)