Change-Oriented Summarization of Temporal Scholarly Document Collections by Semantic Evolution Analysis

被引:0
|
作者
Paharia, Naman [1 ]
Pozi, Muhammad Syafiq Mohd [2 ]
Jatowt, Adam [3 ]
机构
[1] Indian Inst Technol IIT Kharagpur, Dept Elect Engn, Kharagpur 721302, W Bengal, India
[2] Univ Utara Malaysia, Sch Comp, Sintok 06010, Kedah, Malaysia
[3] Univ Innsbruck, Dept Comp Sci, Innsbruck, Tirol, Austria
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Semantics; Task analysis; Context modeling; Machine learning; Linguistics; Analytical models; Syntactics; Temporal summarization; ACL; clustering; semantic changes; cluster analysis; TEXT;
D O I
10.1109/ACCESS.2021.3135051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The number of scholarly publications has dramatically increased over the last decades. For anyone new to a particular science domain it is not easy to understand the major trends and significant changes that the domain has undergone over time. Temporal summarization and related approaches should be then useful to make sense of scholarly temporal collections. In this paper we demonstrate an approach to analyze the dataset of research papers by providing a high level overview of important changes that occurred over time in this dataset. The novelty of our approach lies in the adaptation of methods used for semantic term evolution analysis. However, we analyze not just semantic evolution of single words independently, but we estimate common semantic drifts shared by groups of semantically converging words. As an example dataset we study the ACL Anthology Reference Corpus that spans from 1974 to 2015 and contains 22,878 scholarly articles.
引用
下载
收藏
页码:76401 / 76415
页数:15
相关论文
共 41 条
  • [1] Change Summarization of Diachronic Scholarly Paper Collections by Semantic Evolution Analysis
    Paharia, Naman
    Pozi, Muhammad Syafiq Mohd
    Jatowt, Adam
    2021 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2021), 2021, : 234 - 237
  • [2] Fuzzy clustering for topic analysis and summarization of document collections
    Witte, Rene
    Bergler, Sabine
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4509 : 476 - +
  • [3] Change-oriented requirements traceability. Support for evolution of embedded systems
    von Knethen, A
    INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, PROCEEDINGS, 2002, : 482 - 485
  • [4] Temporal Analysis of Document Collections: Framework and Applications
    Alonso, Omar
    Gertz, Michael
    Baeza-Yates, Ricardo
    STRING PROCESSING AND INFORMATION RETRIEVAL, 2010, 6393 : 290 - +
  • [5] An Enhanced Latent Semantic Analysis Approach for Arabic Document Summarization
    Al-Sabahi, Kamal
    Zhang, Zuping
    Long, Jun
    Alwesabi, Khaled
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 8079 - 8094
  • [6] An Enhanced Latent Semantic Analysis Approach for Arabic Document Summarization
    Kamal Al-Sabahi
    Zuping Zhang
    Jun Long
    Khaled Alwesabi
    Arabian Journal for Science and Engineering, 2018, 43 : 8079 - 8094
  • [7] Fostering change-oriented OCBS: an analysis of India's IT talent
    Kataria, Aakanksha
    Rashmi, Kumari
    Rastogi, Mansi
    JOURNAL OF ASIA BUSINESS STUDIES, 2022, : 57 - 78
  • [8] Topic Oriented Multi-document Summarization Using LSA, Syntactic and Semantic Features
    Anjaneyulu, M.
    Sarma, S. S. V. N.
    Reddy, P. Vijaya Pal
    Chander, K. Prem
    Nagaprasad, S.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 487 - 502
  • [9] Latent Semantic Analysis Approach for Document Summarization Based on Word Embeddings
    Al-Sabahi, Kamal
    Zhang Zuping
    Kang, Yang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (01): : 254 - 276
  • [10] Semantic Analysis for Focused Multi-Document Summarization (fMDS) of Text
    Israel, Quinsulon
    Han, Hyoil
    Song, Il-Yeol
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 339 - 344