Personal Information Extraction of the Teaching Staff Based on CRFs

被引:2
|
作者
Dong, Fang [1 ]
Wang, Junao [1 ]
机构
[1] Wuhan Univ, Sch Comp, Wuhan 430072, Peoples R China
关键词
Data extraction; CRFs; Personal Information;
D O I
10.1109/ICNISC.2015.124
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As the attribute information of the profile stored in a web page is usually in the form of natural language, it is very difficult to use the HTML structure to extract the target information. In this paper Conditional Random Fields is adopted to extract the personal attribute information of the personal detail in web pages. Via segmentation system the HTML document could be divided into the sequence of words, and then to establish the appropriate template of characteristics and train the sample sequences, at last using the characteristics function model generated by CRFs to mark the test sequences and identify the information which need to be extracted. The experimental results show that annotation and reasoning function of the CRFs in the text sequence can be used to extract the specific attributes information in the personal home page very well.
引用
收藏
页码:615 / 617
页数:3
相关论文
共 50 条
  • [1] An enhanced CRFs-based system for information extraction from radiology reports
    Esuli, Andrea
    Marcheggiani, Diego
    Sebastiani, Fabrizio
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (03) : 425 - 435
  • [2] Emotional Element Extraction Based on CRFs
    Wang, Yashen
    Liu, Quanchao
    Huang, Heyan
    [J]. PRACTICAL APPLICATIONS OF INTELLIGENT SYSTEMS, ISKE 2013, 2014, 279 : 507 - 517
  • [3] The CRFs-Based Chinese Open Entity Relation Extraction
    Wu, Xiaoyang
    Wu, Bin
    [J]. 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 405 - 411
  • [4] A Bank Information Extraction System Based on Named Entity Recognition with CRFs from Noisy Customer Order Texts in Turkish
    Emekligil, Erdem
    Arslan, Secil
    Agin, Onur
    [J]. KNOWLEDGE ENGINEERING AND SEMANTIC WEB, KESW 2016, 2016, 649 : 93 - 102
  • [5] A Fast Events Relationship Extraction Method Based on semi-CRFs
    Gao, Ce
    Song, Yixu
    Jia, Peifa
    [J]. 2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 3, 2009, : 217 - 220
  • [6] Sentiment Target Extraction Based on CRFs with Multi-features for Chinese Microblog
    Chen, Bingfeng
    Hao, Zhifeng
    Cai, Ruichu
    Wen, Wen
    Du, Shenzhi
    [J]. WEB TECHNOLOGIES AND APPLICATIONS: APWEB 2016 WORKSHOPS, WDMA, GAP, AND SDMA, 2016, 9865 : 29 - 41
  • [7] Prescription extraction using CRFs and word embeddings
    Tao, Carson
    Filannino, Michele
    Uzuner, Ozlem
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 72 : 60 - 66
  • [8] Personal Information Extraction from Korean Obituaries
    Han, Kyoung-Soo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (12): : 2873 - 2876
  • [9] Scholarly Document Information Extraction using Extensible Features for Efficient Higher Order Semi-CRFs
    Nguyen Viet Cuong
    Chandrasekaran, Muthu Kumar
    Kan, Min-Yen
    Lee, Wee Sun
    [J]. PROCEEDINGS OF THE 15TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL'15), 2015, : 61 - 64
  • [10] Pedestrian Route Guidance System Using Moving Information Based on Personal Feature Extraction
    Narumi, Takuji
    Hada, Yasushi
    Asama, Hajime
    Tsuji, Kunihiro
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 653 - +