ARElight: Context Sampling of Large Texts for Deep Learning Relation Extraction

被引:0
|
作者
Rusnachenko, Nicolay [1 ]
Liang, Huizhi [1 ]
Kalameyets, Maksim [1 ]
Shi, Lei [1 ]
机构
[1] Newcastle Univ, Sch Comp, Newcastle Upon Tyne, Tyne & Wear, England
基金
英国科研创新办公室;
关键词
Data Processing Pipeline; Information Retrieval; Visualisation;
D O I
10.1007/978-3-031-56069-9_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The escalating volume of textual data necessitates adept and scalable Information Extraction (IE) systems in the field of Natural Language Processing (NLP) to analyse massive text collections in a detailed manner. While most deep learning systems are designed to handle textual information as it is, the gap in the existence of the interface between a document and the annotation of its parts is still poorly covered. Concurrently, one of the major limitations of most deep-learning models is a constrained input size caused by architectural and computational specifics. To address this, we introduce ARElight(1), a system designed to efficiently manage and extract information from sequences of large documents by dividing them into segments with mentioned object pairs. Through a pipeline comprising modules for text sampling, inference, optional graph operations, and visualisation, the proposed system transforms large volumes of text in a structured manner. Practical applications of ARElight are demonstrated across diverse use cases, including literature processing and social network analysis.((1)https://github.com/nicolay-r/ARElight)
引用
收藏
页码:229 / 235
页数:7
相关论文
共 50 条
  • [41] Extraction and Categorization of Transition Information from Large Volume of Texts Using Patterns and Machine Learning
    Hori, Sanako
    Murata, Masaki
    Tokuhisa, Masato
    Ma, Qing
    2014 JOINT 7TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 15TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2014, : 1102 - 1107
  • [42] Temporal relation extraction with contrastive prototypical sampling
    Yuan, Chenhan
    Xie, Qianqian
    Ananiadou, Sophia
    KNOWLEDGE-BASED SYSTEMS, 2024, 286
  • [43] The impact of learning Unified Medical Language System knowledge embeddings in relation extraction from biomedical texts
    Weinzierl, Maxwell A.
    Maldonado, Ramon
    Harabagiu, Sanda M.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (10) : 1556 - 1567
  • [44] Auxiliary Learning for Relation Extraction
    Lyu, Shengfei
    Cheng, Jin
    Wu, Xingyu
    Cui, Lizhen
    Chen, Huanhuan
    Miao, Chunyan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (01): : 182 - 191
  • [45] Hybrid Framework of Context Based Relation Extraction for Relation Completion
    Patil, Jaydeep
    Kumar, Suneel
    TECHNO-SOCIETAL 2018: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SOCIETAL APPLICATIONS - VOL 1, 2020, : 33 - 41
  • [46] Large capacity semi structured data extraction algorithm combining machine learning and deep learning
    Zhang, Lei
    Jiao, Jing
    Li, Bo-Xin
    Zhou, Yan-Jie
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (09): : 2631 - 2637
  • [47] A deep learning approach for relationship extraction from interaction context in social manufacturing paradigm
    Leng, Jiewu
    Jiang, Pingyu
    KNOWLEDGE-BASED SYSTEMS, 2016, 100 : 188 - 199
  • [48] DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction
    Chua, Freddy C.
    Duffy, Nigel P.
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 838 - 853
  • [49] Integrating deep learning and multi-attention for joint extraction of entities and relationships in engineering consulting texts
    Gao, Binwei
    Hu, Yuquan
    Gu, Jianan
    Han, Xueqiao
    AUTOMATION IN CONSTRUCTION, 2024, 168
  • [50] Context and deep learning design
    Boyle, Tom
    Ravenscroft, Andrew
    COMPUTERS & EDUCATION, 2012, 59 (04) : 1224 - 1233