A general framework for subjective information extraction from unstructured English text

被引:10
|
作者
Mangassarian, Hratch [1 ]
Artail, Hassan [1 ]
机构
[1] Amer Univ Beirut, Dept Elect & Comp Engn, Beirut, Lebanon
关键词
information extraction; natural language processing; text evaluation; intelligent systems; financial analysis;
D O I
10.1016/j.datak.2006.10.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an information extraction (IE) strategy for handling subjective information from unstructured text. The presented methodology is general: it can be useful in many real-life applications that could potentially benefit from an automatic IE system that makes human-like decisions. We test our methodology in the sphere of company news evaluation with respect to the potential effect of the news on the company's stock prices. The described general framework comprises four sequential processing steps: part-of-speech tagging, syntactic parsing, relation generation, and criteria evaluation. The first two steps perform generic NLP tasks, while the last two phases are application-specific and require a thorough understanding of the application domain. We describe each stage and illustrate the flow of the modus operandi. We keep up with the company news evaluation example throughout the paper. Due to the inherent subjectivity of the envisaged problem, results cannot be categorically justified. However, comparing the system's evaluation of company news to our own, the results were very encouraging. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:352 / 367
页数:16
相关论文
共 50 条
  • [1] Extraction of Formal Manufacturing Rules from Unstructured English Text
    Kang, SungKu
    Patil, Lalit
    Rangarajan, Arvind
    Moitra, Abha
    Jia, Tao
    Robinson, Dean
    Ameri, Farhad
    Dutta, Debasish
    [J]. COMPUTER-AIDED DESIGN, 2021, 134
  • [2] A System for Medical Information Extraction and Verification from Unstructured Text
    Juric, Damir
    Stoilos, Giorgos
    Melo, Andre
    Moore, Jonathan
    Khodadadi, Mohammad
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13314 - 13319
  • [3] EXTRACTION OF MANUFACTURING RULES FROM UNSTRUCTURED TEXT USING A SEMANTIC FRAMEWORK
    Kang, SungKu
    Patil, Lalit
    Rangarajan, Arvind
    Moitra, Abha
    Jia, Tao
    Robinson, Dean
    Dutta, Debasish
    [J]. INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2015, VOL 1B, 2016,
  • [4] A Framework for Relationship Extraction from Unstructured Text via Link Grammar Parsing
    Samuel, Kenneth
    Savas, Onur
    Manikonda, Vikram
    [J]. NEXT-GENERATION ANALYST VI, 2018, 10653
  • [5] CyNER: Information Extraction from Unstructured Text of CTI Sources with Noncontextual IOCs
    Fujii, Shota
    Kawaguchi, Nobutaka
    Shigemoto, Tomohiro
    Yamauchi, Toshihiro
    [J]. ADVANCES IN INFORMATION AND COMPUTER SECURITY, IWSEC 2022, 2022, 13504 : 85 - 104
  • [6] Event Extraction from Unstructured Amharic Text
    Tadesse, Ephrem
    Aga, Rosa Tsegaye
    Qaqqabaa, Kuulaa
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2103 - 2109
  • [7] GeoXTag: Relative Spatial Information Extraction and Tagging of Unstructured Text
    Syed, Mehtab Alam
    Arsevska, Elena
    Roche, Mathieu
    Teisseire, Maguelonne
    [J]. 25TH AGILE CONFERENCE ON GEOGRAPHIC INFORMATION SCIENCE ARTIFICIAL INTELLIGENCE IN THE SERVICE OF GEOSPATIAL TECHNOLOGIES, 2022, 3
  • [8] An Application of Intuitionistic Fuzzy Sets to Improve Information Extraction from Thai Unstructured Text
    Intarapaiboon, Peerasak
    Theeramunkong, Thanaruk
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (09): : 2334 - 2345
  • [9] Extraction of protein interaction information from unstructured text using a link grammar parser
    Seoud, Rania A. Abul
    Youssef, Abou-Bakr M.
    Kadah, Yasser M.
    [J]. 2007 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS: ICCES '07, 2007, : 70 - +
  • [10] A distributed event extraction framework for large-scale unstructured text
    Kan, Zhigang
    Mi, Haibo
    Yang, Sen
    Qiao, Linbo
    Feng, Dawei
    Li, Dongsheng
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING (JCC 2020), 2020, : 102 - 108