Extracting Methodology Components from AI Research Papers: A Data-driven Factored Sequence Labeling Approach

被引:1
|
作者
Ghosh, Madhusudan [1 ]
Ganguly, Debasis [2 ]
Basuchowdhuri, Partha [1 ]
Naskar, Sudip Kumar [3 ]
机构
[1] Indian Assoc Cultivat Sci, Kolkata, India
[2] Univ Glasgow, Glasgow, Scotland
[3] Jadavpur Univ, Kolkata, India
关键词
Information Extraction; Factored Model; Clustering; Scientific Literature;
D O I
10.1145/3583780.3615258
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extraction of methodology component names from scientific articles is a challenging task due to the diversified contexts around the occurrences of these entities, and the different levels of granularity and containment relationships exhibited by these entities. We hypothesize that standard sequence labeling approaches may not adequately model the dependence of methodology name mentions with their contexts, due to the problems of their large, fast evolving, and domain-specific vocabulary. As a solution, we propose a factored approach, where the mention-context dependencies are represented in a more fine-grained manner, thus allowing the model parameters to better adjust to the different characteristic patterns inherent within the data. In particular, we experiment with two variants of this factored approach - one that uses the per-entity category information derived from an ontology, and the other that makes use of the topology of the sentence embedding space to infer a category for each entity constituting that sentence. We demonstrate that both these factored variants of SciBERT outperform their non-factored counterpart, a state-of-the-art model for scientific concept extraction.
引用
收藏
页码:3897 / 3901
页数:5
相关论文
共 50 条
  • [41] A Policy-Driven Approach to Secure Extraction of COVID-19 Data From Research Papers
    Elluri, Lavanya
    Piplai, Aritran
    Kotal, Anantaa
    Joshi, Anupam
    Joshi, Karuna Pande
    FRONTIERS IN BIG DATA, 2021, 4
  • [42] Predicting odor from vibrational spectra: a data-driven approach
    Ameta, Durgesh
    Behera, Laxmidhar
    Chakraborty, Aniruddha
    Sandhan, Tushar
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [43] Assessment of Smart Transformation in the Manufacturing Process of Aerospace Components Through a Data-Driven Approach
    Bernabei M.
    Eugeni M.
    Gaudenzi P.
    Costantino F.
    Global Journal of Flexible Systems Management, 2023, 24 (1) : 67 - 86
  • [44] Combined Data-driven and Knowledge-driven Methodology Research Advances and Its Applied Prospect in Power Systems
    Li F.
    Wang Q.
    Hu J.
    Tang Y.
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2021, 41 (13): : 4377 - 4389
  • [45] An innovative data-driven AI approach for detecting and isolating faults in gas turbines at power plants
    Amiri, Mohammad Hussein
    Hashjin, Nastaran Mehrabi
    Najafabadi, Maryam Khanian
    Beheshti, Amin
    Khodadadi, Nima
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
  • [46] A Radar Data-Driven AI Approach for Rainfall Nowcasting: Towards Flood Preparedness in Urban Regions
    Dandekar, Sharvil
    Limbashia, Taksha
    Parab, Om
    Kotecha, Radhika
    Chakravarty, Kaustav
    Ukarande, Suresh
    Hosalikar, Krishnanand
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2025,
  • [47] Research on Breast Lesion Localization and Diagnosis based on Knowledge-driven and Data-driven Approach
    Song, Lintao
    Li, Jianqiang
    Liu, Xiaoling
    Liu, Yiming
    Ma, Tianbao
    Bai, Jun
    Zhao, Linna
    Zhao, Qing
    Xu, Xi
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 2147 - 2152
  • [48] Data-driven approach for AI-based crack detection: techniques, challenges, and future scope
    Chakurkar, Priti S.
    Vora, Deepali
    Patil, Shruti
    Mishra, Sashikala
    Kotecha, Ketan
    FRONTIERS IN SUSTAINABLE CITIES, 2023, 5
  • [49] Characterizing parking systems from sensor data through a data-driven approach
    Arjona Martinez, Jamie
    Paz Linares, Maria
    Casanovas, Josep
    TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2021, 13 (03): : 183 - 192
  • [50] A novel fMRI group data analysis method based on data-driven reference extracting from group subjects
    Shi, Yuhu
    Zeng, Weiming
    Wang, Nizhuan
    Chen, Dongtailang
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2015, 122 (03) : 362 - 371