Systematic Literature Review of Information Extraction From Textual Data: Recent Methods, Applications, Trends, and Challenges

被引:6
|
作者
Abdullah, Mohd Hafizul Afifi [1 ]
Aziz, Norshakirah [1 ]
Abdulkadir, Said Jadid [1 ]
Alhussian, Hitham Seddig Alhassan [1 ]
Talpur, Noureen [1 ]
机构
[1] Univ Teknol PETRONAS, Ctr Res Data Sci CeRDaS, Comp Informat Sci Dept, Seri Iskandar 32610, Malaysia
关键词
Data mining; Hidden Markov models; Analytical models; Systematics; Market research; Task analysis; Feature extraction; Information extraction; text extraction; named entity; named entity recognition; relation extraction; event extraction; deep learning; NAMED-ENTITY RECOGNITION; CHINESE RELATION EXTRACTION; NEURAL-NETWORKS; MODEL; CONSTRUCTION; FRAMEWORK; DOCUMENTS; SUPPORT;
D O I
10.1109/ACCESS.2023.3240898
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information extraction (IE) is a challenging task, particularly when dealing with highly heterogeneous data. State-of-the-art data mining technologies struggle to process information from textual data. Therefore, various IE techniques have been developed to enable the use of IE for textual data. However, each technique differs from one another because it is designed for different data types and has different target information to be extracted. This study investigated and described the most contemporary methods for extracting information from textual data, emphasizing their benefits and shortcomings. To provide a holistic view of the domain, this comprehensive systematic literature review employed a systematic mapping process to summarize studies published in the last six years (from 2017 to 2022). It covers fundamental concepts, recent approaches, applications, and trends, in addition to challenges and future research prospects in this domain area. Based on an analysis of 161 selected studies, we found that the state-of-the-art models employ deep learning to extract information from textual data. Finally, this study aimed to guide novice and experienced researchers in future research and serve as a foundation for this research area.
引用
收藏
页码:10535 / 10562
页数:28
相关论文
共 50 条
  • [1] Sentiment analysis methods, applications, and challenges: A systematic literature review
    Mao, Yanying
    Liu, Qun
    Zhang, Yu
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (04)
  • [2] Technical Trends and Challenges in Mobile Health A Systematic Review of Recent Available Literature
    Callegari, Daniel Antonio
    Jersak, Luis Carlos
    da Costa, Adriana Cassia
    [J]. ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 2, 2013, : 519 - 525
  • [3] Challenges and Advances in Information Extraction from Scientific Literature: a Review
    Hong, Zhi
    Ward, Logan
    Chard, Kyle
    Blaiszik, Ben
    Foster, Ian
    [J]. JOM, 2021, 73 (11) : 3383 - 3400
  • [4] Challenges and Advances in Information Extraction from Scientific Literature: a Review
    Zhi Hong
    Logan Ward
    Kyle Chard
    Ben Blaiszik
    Ian Foster
    [J]. JOM, 2021, 73 : 3383 - 3400
  • [5] A systematic literature review on recent trends of machine learning applications in additive manufacturing
    Xames, Md Doulotuzzaman
    Torsha, Fariha Kabir
    Sarwar, Ferdous
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (06) : 2529 - 2555
  • [6] A systematic literature review on recent trends of machine learning applications in additive manufacturing
    Md Doulotuzzaman Xames
    Fariha Kabir Torsha
    Ferdous Sarwar
    [J]. Journal of Intelligent Manufacturing, 2023, 34 : 2529 - 2555
  • [7] Clinical information extraction applications: A literature review
    Wang, Yanshan
    Wang, Liwei
    Rastegar-Mojarad, Majid
    Moon, Sungrim
    Shen, Feichen
    Afzal, Naveed
    Liu, Sijia
    Zeng, Yuqun
    Mehrabi, Saeed
    Sohn, Sunghwan
    Liu, Hongfang
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 77 : 34 - 49
  • [8] Recent Trends in Sedentary Time: A Systematic Literature Review
    Fang, Hui
    Jing, Yuan
    Chen, Jie
    Wu, Yanqi
    Wan, Yuehua
    [J]. HEALTHCARE, 2021, 9 (08)
  • [9] Trends and challenges in operations strategy research: Findings from a systematic literature review
    Vivares, Jorge A.
    Avella, Lucia
    Sarache, William
    [J]. CUADERNOS DE GESTION, 2022, 22 (02): : 81 - 96
  • [10] Information Extraction from the Text Data on Traditional Chinese Medicine: A Review on Tasks, Challenges, and Methods from 2010 to 2021
    Zhang, Tingting
    Huang, Zonghai
    Wang, Yaqiang
    Wen, Chuanbiao
    Peng, Yangzhi
    Ye, Ying
    [J]. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE, 2022, 2022