Segment-based Hidden Markov Models for Information Extraction

被引:0
|
作者
Gu, Zhenmei [1 ]
Cercone, Nick [1 ]
机构
[1] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2I 3G1, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hidden Markov models (HMMs) are powerful statistical models that have found successful applications in Information Extraction (IE). In current approaches to applying HMMs to IE, an HMM is used to model text at the document level. This modelling might cause undesired redundancy in extraction in the sense that more than one filler is identified and extracted. We propose to use HMMs to model text at the segment level, in which the extraction process consists of two steps: a segment retrieval step followed by an extraction step. In order to retrieve extraction-relevant segments from documents, we introduce a method to use HMMs to model and retrieve segments. Our experimental results show that the resulting segment HMM IE system not only achieves near zero extraction redundancy, but also has better overall extraction performance than traditional document HMM IE systems.
引用
收藏
页码:481 / 488
页数:8
相关论文
共 50 条
  • [1] Thai syllable-based information extraction using hidden Markov models
    Narupiyakul, L
    Thomas, C
    Cercone, N
    Sirinaovakul, B
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2004, 2945 : 537 - 546
  • [2] COMBINING TEXT CLASSIFIERS AND HIDDEN MARKOV MODELS FOR INFORMATION EXTRACTION
    Barros, Flavia A.
    Silva, Eduardo F. A.
    Prudencio, Ricardo B. C.
    Filho, Valmir M.
    Nascimento, Andre C. A.
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2009, 18 (02) : 311 - 329
  • [3] Information Extraction System Based on Hidden Markov Model
    Park, Dong-Chul
    Huong, Vu Thi Lan
    Woo, Dong-Min
    Hieu, Duong Ngoc
    Ninh, Sai Thi Hien
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 1, PROCEEDINGS, 2009, 5551 : 52 - +
  • [4] KXtractor: An effective biomedical information extraction technique based on mixture hidden Markov models
    Song, M
    Song, IY
    Hu, XH
    Allen, RB
    TRANSACTIONS ON COMPUTATIONAL SYSTEMS BIOLOGY II, 2005, 3680 : 68 - 81
  • [5] Web information extraction based on a Generalized Hidden Markov Model
    Yao, Yong
    Wang, Jing
    Liu, Zhijing
    Journal of Computational Information Systems, 2007, 3 (05): : 1847 - 1854
  • [6] Medical Risk Information Extraction Based on Hidden Markov Model
    Yu, Xin
    Zhang, Ju
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 778 - 782
  • [7] Segment-based Models for Event Detection and Recounting
    Kovvuri, Rama
    Nevatia, Ram
    Snoek, Cees G. M.
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3868 - 3873
  • [8] IMPROVEMENT OF SEGMENT-BASED DEPTH ESTIMATION USING A NOVEL SEGMENT EXTRACTION
    Um, Gi-Mun
    Bang, Gun
    Cheong, Won-Sik
    Hur, Namho
    Lee, Soo In
    2010 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON 2010), 2010,
  • [9] Web object information extraction based on generalized hidden Markov model
    Wang, Jing
    Yao, Yong
    Liu, ZhiJing
    2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1520 - 1523
  • [10] Text Information Extraction based on Genetic Algorithm and Hidden Markov Model
    Li, Rong
    Zheng, Jia-heng
    Pei, Chun-qin
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL I, 2009, : 334 - +