Change Summarization of Diachronic Scholarly Paper Collections by Semantic Evolution Analysis

被引:0
|
作者
Paharia, Naman [1 ]
Pozi, Muhammad Syafiq Mohd [2 ]
Jatowt, Adam [3 ]
机构
[1] IIT Kharagpur, Kharagpur, W Bengal, India
[2] Univ Utara Malaysia, SOC, Sintok, Kedah, Malaysia
[3] Univ Innsbruck, Innsbruck, Tirol, Austria
关键词
temporal mining; summarization; visualization;
D O I
10.1109/JCDL52503.2021.00067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The amount of scholarly data has been increasing dramatically over the last years. For newcomers to a particular science domain (e.g., IR, physics, NLP) it is often difficult to spot larger trends and to position the latest research in the context of prior scientific achievements and breakthroughs. Similarly, researchers in the history of science are interested in tools that allow them to analyze and visualize changes in particular scientific domains. Temporal summarization and related methods should be then useful for making sense of large volumes of scientific discourse data aggregated over time. We demonstrate a novel approach to analyze the collections of research papers published over longer time periods to provide a high level overview of important semantic changes that occurred over the progress of time. Our approach is based on comparing word semantic representations over time and aims to support users in better understanding of large domain-focused archives of scholarly publications. As an example dataset we use the ACL Anthology Reference Corpus that spans from 1979 to 2015 and contains 22,878 scholarly articles.
引用
收藏
页码:234 / 237
页数:4
相关论文
共 50 条
  • [41] A Comprehensive Method for Text Summarization Based on Latent Semantic Analysis
    Wang, Yingjie
    Ma, Jun
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 394 - 401
  • [42] Semantic analysis and retrieval in personal and social photo collections
    Philipp Sandhaus
    Susanne Boll
    Multimedia Tools and Applications, 2011, 51 : 5 - 33
  • [43] Semantic analysis and retrieval in personal and social photo collections
    Sandhaus, Philipp
    Boll, Susanne
    MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 51 (01) : 5 - 33
  • [44] An approach to software evolution based on semantic change
    Robbes, Romain
    Lanza, Michele
    Lungu, Mircea
    FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING, PROCEEDINGS, 2007, 4422 : 27 - +
  • [45] Unpacking research lock-in through a diachronic analysis of topic cluster trajectories in scholarly publications
    Lascialfari, Matteo
    Magrini, Marie-Benoit
    Cabanac, Guillaume
    SCIENTOMETRICS, 2022, 127 (11) : 6165 - 6189
  • [46] Unpacking research lock-in through a diachronic analysis of topic cluster trajectories in scholarly publications
    Matteo Lascialfari
    Marie-Benoît Magrini
    Guillaume Cabanac
    Scientometrics, 2022, 127 : 6165 - 6189
  • [47] Automatic Text Summarization of Konkani Texts Using Latent Semantic Analysis
    D'Silva, Jovi
    Sharma, Uzzal
    More, Chaitali
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 425 - 437
  • [48] Chinese text summarization using a trainable summarizer and latent semantic analysis
    Yeh, JY
    Ke, HR
    Yang, WP
    DIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGS, 2002, 2555 : 76 - 87
  • [49] Latent Semantic Analysis Approach for Document Summarization Based on Word Embeddings
    Al-Sabahi, Kamal
    Zhang Zuping
    Kang, Yang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (01): : 254 - 276
  • [50] A diachronic corpus-assisted semantic domain analysis of US presidential debates
    Hayes, Nicholas
    Poole, Robert
    CORPORA, 2022, 17 (03) : 449 - 469