A machine learning approach to information extraction

被引:0
|
作者
Téllez-Valero, A
Montes-y-Gómez, M
Villaseñor-Pineda, L
机构
[1] INAOE, Dept Comp Sci, Language Technol Grp, Mexico City, DF, Mexico
[2] Univ Polytecn Valencia, Dept Informat Syst & Computat, Valencia, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information extraction is concerned with applying natural language processing to automatically extract the essential details from text documents. A great disadvantage of current approaches is their intrinsic dependence to the application domain and the target language. Several machine learning techniques have been applied in order to facilitate the portability of the information extraction systems. This paper describes a general method for building an information extraction system using regular expressions along with supervised learning algorithms. In this method, the extraction decisions are lead by a set of classifiers instead of sophisticated linguistic analyses. The paper also shows a system called TOPO that allows to extract the information related with natural disasters from newspaper articles in Spanish language. Experimental results of this system indicate that the proposed method can be a practical solution for building information extraction systems reaching an F-measure as high as 72%.
引用
收藏
页码:539 / 547
页数:9
相关论文
共 50 条
  • [21] Answer extraction for definition questions using information gain and machine learning
    Martinez-Gil, Carmen
    Lopez-Lopez, A.
    [J]. ARTIFICIAL INTELLIGENCE IN THEORY AND PRACTICE II, 2008, 276 : 141 - 150
  • [22] Maritime Incident Information Extraction using Machine and Deep Learning Techniques
    Mackenzie, Andrew
    Teske, Alexander
    Abielmona, Rami
    Petriu, Emil
    [J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [23] Automatic Information Extraction from Electronic Documents Using Machine Learning
    Kamaleson, Nishanthan
    Chu, Dominique
    Otero, Fernando E. B.
    [J]. ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 : 183 - 194
  • [24] Learning information extraction rules: An Inductive Logic Programming approach
    Aitken, JS
    [J]. ECAI 2002: 15TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 77 : 355 - 359
  • [25] An information-theoretic approach to feature extraction in competitive learning
    Kamimura, Ryotaro
    [J]. NEUROCOMPUTING, 2009, 72 (10-12) : 2693 - 2704
  • [26] A machine learning approach for forecasting and visualising flood inundation information
    Kabir, Syed
    Patidar, Sandhya
    Pender, Gareth
    [J]. PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-WATER MANAGEMENT, 2021, 174 (01) : 27 - 41
  • [27] An information-based machine learning approach to elasticity imaging
    Cameron Hoerig
    Jamshid Ghaboussi
    Michael F. Insana
    [J]. Biomechanics and Modeling in Mechanobiology, 2017, 16 : 805 - 822
  • [28] An information-based machine learning approach to elasticity imaging
    Hoerig, Cameron
    Ghaboussi, Jamshid
    Insana, Michael F.
    [J]. BIOMECHANICS AND MODELING IN MECHANOBIOLOGY, 2017, 16 (03) : 805 - 822
  • [29] Readability of Arabic Medicine Information Leaflets: A Machine Learning Approach
    Alotaibi, Sihaam
    Alyahya, Maha
    Al-Khalifa, Hend
    Alageel, Sinaa
    Abanmy, Nora
    [J]. 4TH SYMPOSIUM ON DATA MINING APPLICATIONS (SDMA2016), 2016, 82 : 122 - 126
  • [30] Bridging Algorithmic Information Theory and Machine Learning: A new approach to kernel learning
    Hamzi, Boumediene
    Hutter, Marcus
    Owhadi, Houman
    [J]. PHYSICA D-NONLINEAR PHENOMENA, 2024, 464