ARElight: Context Sampling of Large Texts for Deep Learning Relation Extraction

被引:0
|
作者
Rusnachenko, Nicolay [1 ]
Liang, Huizhi [1 ]
Kalameyets, Maksim [1 ]
Shi, Lei [1 ]
机构
[1] Newcastle Univ, Sch Comp, Newcastle Upon Tyne, Tyne & Wear, England
基金
英国科研创新办公室;
关键词
Data Processing Pipeline; Information Retrieval; Visualisation;
D O I
10.1007/978-3-031-56069-9_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The escalating volume of textual data necessitates adept and scalable Information Extraction (IE) systems in the field of Natural Language Processing (NLP) to analyse massive text collections in a detailed manner. While most deep learning systems are designed to handle textual information as it is, the gap in the existence of the interface between a document and the annotation of its parts is still poorly covered. Concurrently, one of the major limitations of most deep-learning models is a constrained input size caused by architectural and computational specifics. To address this, we introduce ARElight(1), a system designed to efficiently manage and extract information from sequences of large documents by dividing them into segments with mentioned object pairs. Through a pipeline comprising modules for text sampling, inference, optional graph operations, and visualisation, the proposed system transforms large volumes of text in a structured manner. Practical applications of ARElight are demonstrated across diverse use cases, including literature processing and social network analysis.((1)https://github.com/nicolay-r/ARElight)
引用
收藏
页码:229 / 235
页数:7
相关论文
共 50 条
  • [21] Relation Extraction Between Medical Entities Using Deep Learning Approach
    Patel, Ruchi
    Tanwani, Sanjay
    Patidar, Chhaya
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (03): : 359 - 366
  • [22] A Survey on Deep Learning Techniques for Joint Named Entities and Relation Extraction
    Kambar, Mina Esmail Zadeh Nojoo
    Esmaeilzadeh, Armin
    Heidari, Maryam
    2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 218 - 224
  • [23] Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning
    Qin, Pengda
    Xu, Weiran
    Wang, William Yang
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2137 - 2147
  • [24] Relation extraction between medical entities using deep learning approach
    Patel R.
    Tanwani S.
    Patidar C.
    Informatica (Slovenia), 2021, 45 (03): : 359 - 366
  • [25] Context-aware Sampling of Large Networks via Graph Representation Learning
    Zhou, Zhiguang
    Shi, Chen
    Shen, Xilong
    Cai, Lihong
    Wang, Haoxuan
    Liu, Yuhua
    Zhao, Ying
    Chen, Wei
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 1709 - 1719
  • [26] Freepal: A Large Collection of Deep Lexico-Syntactic Patterns for Relation Extraction
    Kirschnick, Johannes
    Akbik, Alan
    Hemsen, Holmer
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2071 - 2075
  • [27] Keyword extraction from news corpus by deep learning in the context of internet of things
    Yan, Xiao
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2023, 14 (2-3) : 75 - 93
  • [28] Deep Learning Multimodal Fusion for Road Network Extraction: Context and Contour Improvement
    Filho Antonio
    Shimabukuro, Milton
    Poz, Aluir Dal
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [29] Down-Sampling of Large LiDAR Dataset in the Context of Off-Road Objects Extraction
    Blaszczak-Bak, Wioleta
    Janicka, Joanna
    Suchocki, Czeslaw
    Masiero, Andrea
    Sobieraj-Zlobinska, Anna
    GEOSCIENCES, 2020, 10 (06)
  • [30] A large-scale dataset for korean document-level relation extraction from encyclopedia texts
    Son, Suhyune
    Lim, Jungwoo
    Koo, Seonmin
    Kim, Jinsung
    Kim, Younghoon
    Lim, Youngsik
    Hyun, Dongseok
    Lim, Heuiseok
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8681 - 8701