Extracting Methodology Components from AI Research Papers: A Data-driven Factored Sequence Labeling Approach

被引:1
|
作者
Ghosh, Madhusudan [1 ]
Ganguly, Debasis [2 ]
Basuchowdhuri, Partha [1 ]
Naskar, Sudip Kumar [3 ]
机构
[1] Indian Assoc Cultivat Sci, Kolkata, India
[2] Univ Glasgow, Glasgow, Scotland
[3] Jadavpur Univ, Kolkata, India
关键词
Information Extraction; Factored Model; Clustering; Scientific Literature;
D O I
10.1145/3583780.3615258
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extraction of methodology component names from scientific articles is a challenging task due to the diversified contexts around the occurrences of these entities, and the different levels of granularity and containment relationships exhibited by these entities. We hypothesize that standard sequence labeling approaches may not adequately model the dependence of methodology name mentions with their contexts, due to the problems of their large, fast evolving, and domain-specific vocabulary. As a solution, we propose a factored approach, where the mention-context dependencies are represented in a more fine-grained manner, thus allowing the model parameters to better adjust to the different characteristic patterns inherent within the data. In particular, we experiment with two variants of this factored approach - one that uses the per-entity category information derived from an ontology, and the other that makes use of the topology of the sentence embedding space to infer a category for each entity constituting that sentence. We demonstrate that both these factored variants of SciBERT outperform their non-factored counterpart, a state-of-the-art model for scientific concept extraction.
引用
收藏
页码:3897 / 3901
页数:5
相关论文
共 50 条
  • [11] EXPLAINING DATA-DRIVEN DECISIONS MADE BY AI SYSTEMS: THE COUNTERFACTUAL APPROACH
    Fernandez-Loria, Carlos
    Provost, Foster
    Han, Xintian
    MIS QUARTERLY, 2022, 46 (03) : 1635 - 1660
  • [12] Measuring for Knowledge: A Data-Driven Research Approach for eGovernment
    Verdegem, Pieter
    Stragier, Jeroen
    Verleye, Gino
    PROCEEDINGS OF THE 10TH EUROPEAN CONFERENCE ON EGOVERNMENT, 2010, : 417 - 424
  • [13] Extracting Prominent Aspects of Online Customer Reviews: A Data-Driven Approach to Big Data Analytics
    Ali, Noaman M.
    Alshahrani, Abdullah
    Alghamdi, Ahmed M.
    Novikov, Boris
    ELECTRONICS, 2022, 11 (13)
  • [14] From bottleneck to enabler: a new approach to regulating data-driven medical research
    Stamhuis, Evert
    Evangelista, Laura
    van der Voort, Sebastian
    Mele, Anastasia M.
    Spiller, Elisa
    Demir, Esra
    Pin, Andrea
    Heine, Klaus
    CLINICAL AND TRANSLATIONAL IMAGING, 2023, 11 (04) : 311 - 313
  • [15] From bottleneck to enabler: a new approach to regulating data-driven medical research
    Evert Stamhuis
    Laura Evangelista
    Sebastian van der Voort
    Anastasia M. Mele
    Elisa Spiller
    Esra Demir
    Andrea Pin
    Klaus Heine
    Clinical and Translational Imaging, 2023, 11 : 311 - 313
  • [16] Data-driven methodology for signal timing plan development: A computational approach
    Smith, BL
    Scherer, WT
    Hauser, TA
    Park, BB
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2002, 17 (06) : 387 - 395
  • [17] Exploring Data-Driven Components of Socially Intelligent AI through Cooperative Game Paradigms
    Bennett, Casey
    Weiss, Benjamin
    Suh, Jaeyoung
    Yoon, Eunseo
    Jeong, Jihong
    Chae, Yejin
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2022, 6 (02)
  • [18] Research on a Data-Driven Modeling Method for Precast Concrete Balcony Components
    Cai, Jie
    Wang, Xin
    Shi, Junfeng
    Xie, Xingxing
    Feng, Yu
    Wu, Yingjun
    BUILDINGS, 2024, 14 (01)
  • [19] From IMU Measurement Sequence to Velocity Estimate Sequence: An Effective and Efficient Data-Driven Inertial Odometry Approach
    Wang, Yingying
    Cheng, Hu
    Zhang, Ang
    Meng, Max Q. -H.
    IEEE SENSORS JOURNAL, 2023, 23 (15) : 17117 - 17126
  • [20] Failure prognosis of the components with unlike degradation trends: A data-driven approach
    Dash, Balyogi Mohan
    Prakash, Om
    Samantaray, Arun Kumar
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART O-JOURNAL OF RISK AND RELIABILITY, 2023, 237 (06) : 1132 - 1149