Systematic Literature Review of Information Extraction From Textual Data: Recent Methods, Applications, Trends, and Challenges

被引:6
|
作者
Abdullah, Mohd Hafizul Afifi [1 ]
Aziz, Norshakirah [1 ]
Abdulkadir, Said Jadid [1 ]
Alhussian, Hitham Seddig Alhassan [1 ]
Talpur, Noureen [1 ]
机构
[1] Univ Teknol PETRONAS, Ctr Res Data Sci CeRDaS, Comp Informat Sci Dept, Seri Iskandar 32610, Malaysia
关键词
Data mining; Hidden Markov models; Analytical models; Systematics; Market research; Task analysis; Feature extraction; Information extraction; text extraction; named entity; named entity recognition; relation extraction; event extraction; deep learning; NAMED-ENTITY RECOGNITION; CHINESE RELATION EXTRACTION; NEURAL-NETWORKS; MODEL; CONSTRUCTION; FRAMEWORK; DOCUMENTS; SUPPORT;
D O I
10.1109/ACCESS.2023.3240898
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information extraction (IE) is a challenging task, particularly when dealing with highly heterogeneous data. State-of-the-art data mining technologies struggle to process information from textual data. Therefore, various IE techniques have been developed to enable the use of IE for textual data. However, each technique differs from one another because it is designed for different data types and has different target information to be extracted. This study investigated and described the most contemporary methods for extracting information from textual data, emphasizing their benefits and shortcomings. To provide a holistic view of the domain, this comprehensive systematic literature review employed a systematic mapping process to summarize studies published in the last six years (from 2017 to 2022). It covers fundamental concepts, recent approaches, applications, and trends, in addition to challenges and future research prospects in this domain area. Based on an analysis of 161 selected studies, we found that the state-of-the-art models employ deep learning to extract information from textual data. Finally, this study aimed to guide novice and experienced researchers in future research and serve as a foundation for this research area.
引用
收藏
页码:10535 / 10562
页数:28
相关论文
共 50 条
  • [31] A Systematic Literature Review of Data Mining Applications in Healthcare
    Niaksu, Olegas
    Skinulyte, Jolita
    Duhaze, Hermine Grubinger
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013 WORKSHOPS, 2014, 8182 : 313 - 324
  • [32] Fabric Waste Recycling: a Systematic Review of Methods, Applications, and Challenges
    D. G. K. Dissanayake
    D.U. Weerasinghe
    [J]. Materials Circular Economy, 2021, 3 (1):
  • [33] A Systematic Literature Review of Personality Trait Classification from Textual Content
    Ahmad, Hussain
    Asghar, Muhammad Zubair
    Khan, Alam Sher
    Habib, Anam
    [J]. OPEN COMPUTER SCIENCE, 2020, 10 (01) : 175 - 193
  • [34] NILM applications: Literature review of learning approaches, recent developments and challenges
    Angelis, Georgios-Fotios
    Timplalexis, Christos
    Krinidis, Stelios
    Ioannidis, Dimosthenis
    Tzovaras, Dimitrios
    [J]. ENERGY AND BUILDINGS, 2022, 261
  • [35] Big Data: Opportunities and Challenges in Libraries, a Systematic Literature Review
    Garoufallou, Emmanouel
    Gaitanou, Panorea
    [J]. COLLEGE & RESEARCH LIBRARIES, 2021, 82 (03): : 410 - 435
  • [36] Challenges and Issues in Unstructured Big Data: A Systematic Literature Review
    Nafis, Nur Syafiqah Mohd
    Awang, Suryanti
    [J]. ADVANCED SCIENCE LETTERS, 2018, 24 (10) : 7716 - 7722
  • [37] Trends Information Technology in E-Agriculture : A Systematic Literature Review
    Fernando, Erick
    Assegaff, Setiawan
    Rohayani, Hetty A. H.
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, COMPUTER, AND ELECTRICAL ENGINEERING (ICITACEE), 2016, : 351 - 355
  • [38] A Systematic Literature Review of Question Answering: Research Trends, Datasets, Methods
    Bakir, Dilan
    Aktas, Mehmet S.
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2022 WORKSHOPS, PT I, 2022, 13377 : 47 - 62
  • [39] A Literature Review on Methods for the Extraction of Usage Statements of Software and Data
    Krueger, Frank
    Schindler, David
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2020, 22 (01) : 26 - 38
  • [40] Power Line Extraction and Reconstruction Methods from Laser Scanning Data: A Literature Review
    Munir, Nosheen
    Awrangjeb, Mohammad
    Stantic, Bela
    [J]. REMOTE SENSING, 2023, 15 (04)