Revealing the technology development of natural language processing: A Scientific entity-centric perspective

被引:5
|
作者
Zhang, Heng [1 ]
Zhang, Chengzhi [1 ]
Wang, Yuzhuo [2 ]
机构
[1] Nanjing Univ Sci & Technol, Dept Informat Management, Nanjing 210094, Peoples R China
[2] Anhui Univ, Sch Management, Dept Management Sci & Engn, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Technology development; Automatic scientific entity identification; Entity co -occurrence network; z -score metric; INFORMATION-SCIENCE; EVOLUTION; COOCCURRENCE; EXTRACTION; NETWORK; LIBRARY;
D O I
10.1016/j.ipm.2023.103574
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most studies on technology development have been conducted from a thematic perspective, but the topics are coarse-grained and insufficient to accurately represent technology. The development of automatic entity recognition techniques makes it possible to extract technology-related entities on a large scale. Thus, we perform a more accurate analysis of technology development from an entity-centric perspective. To begin with, we extract technology-related entities such as methods, datasets, metrics, and tools in articles on Natural Language Processing (NLP), and we apply a semi-automatic approach to normalize the entities. Subsequently, we calculate the z-scores of entities based on their co-occurrence networks to measure their impact. We then analyze the development trends of new technologies in the NLP domain since the beginning of the 21st century. The findings of this paper include three aspects: Firstly, the continued increase in the average number of entities per paper implies a growing burden on researchers to acquire relevant technical background knowledge. However, the emergence of pre-trained language models has injected new vitality into the technological innovation of the NLP domain. Secondly, Methods dominate among the 179 high-impact entities. An analysis of the z-score trend about the top 10 entities reveals that pre-trained language models, exemplified by BERT and Transformer, have become mainstream in recent years. Unlike the trend of the other eight method entities, the impact of Wikipedia dataset and BLEU metric has continued to rise in the long term. Thirdly, in recent years, there has been a remarkable surge in popularity for new high-impact technologies than ever before, and their acceptance by researchers has accelerated at an unprecedented speed. Our study provides a new perspective on analyzing technology development in a specific domain. This work can help researchers grasp the current hot technologies in the field of NLP and contribute to the future development of NLP technologies. The source code and dataset for this paper can be accessed at https://github.com/ZH-heng/technology_development.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] An entity-centric approach to manage court judgments based on Natural Language Processing
    Bellandi, Valerio
    Bernasconi, Christian
    Lodi, Fausto
    Palmonari, Matteo
    Pozzi, Riccardo
    Ripamonti, Marco
    Siccardi, Stefano
    [J]. COMPUTER LAW & SECURITY REVIEW, 2024, 52
  • [2] On the Development of an Entity-centric Timeline Extraction Tool
    Piskorski, Jakub
    Zavarella, Vanni
    Atkinson, Martin
    [J]. 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 821 - 824
  • [3] Space-Efficient Representation of Entity-centric Query Language Models
    Van Gysel, Christophe
    Hannemann, Mirko
    Pusateri, Ernest
    Oualil, Youssef
    Oparin, Ilya
    [J]. INTERSPEECH 2022, 2022, : 679 - 683
  • [4] Somun: entity-centric summarization incorporating pre-trained language models
    Inan, Emrah
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10): : 5301 - 5311
  • [5] Somun: entity-centric summarization incorporating pre-trained language models
    Emrah Inan
    [J]. Neural Computing and Applications, 2021, 33 : 5301 - 5311
  • [6] FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented Environments
    Papadopoulos, Dimitris
    Metropoulou, Katerina
    Matsatsinis, Nikolaos
    Papadakis, Nikolaos
    [J]. PROCEEDINGS OF THE 12TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE, SETN 2022, 2022,
  • [7] ON THE USE OF NATURAL-LANGUAGE PROCESSING TECHNOLOGY IN SCIENTIFIC-RESEARCH
    HENDRIX, GG
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1983, 185 (MAR): : 3 - CINF
  • [8] Event-Centric Natural Language Processing
    Chen, Muhao
    Zhang, Hongming
    Ning, Qiang
    Li, Manling
    Ji, Heng
    McKeown, Kathleen
    Roth, Dan
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: TUTORIAL ABSTRACTS, 2021, : 6 - 14
  • [9] Natural language processing: a prolog perspective
    Bitter, Christian
    Elizondo, David A.
    Yang, Yingjie
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2010, 33 (1-2) : 151 - 173
  • [10] Natural language processing: a prolog perspective
    Christian Bitter
    David A. Elizondo
    Yingjie Yang
    [J]. Artificial Intelligence Review, 2010, 33 : 151 - 173