A Two-Step Resume Information Extraction Algorithm

被引:14
|
作者
Chen, Jie [1 ]
Zhang, Chunxia [2 ]
Niu, Zhendong [1 ,3 ,4 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Software, Beijing 100081, Peoples R China
[3] Beijing Inst Technol, Beijing Engn Res Ctr Mass Language Informat Proc, Beijing 100081, Peoples R China
[4] Univ Pittsburgh, Sch Comp & Informat, Pittsburgh, PA 15260 USA
基金
中国国家自然科学基金;
关键词
D O I
10.1155/2018/5761287
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the rapid growth of Internet-based recruiting, there are a great number of personal resumes among recruiting systems. To gain more attention from the recruiters, most resumes are written in diverse formats, including varying font size, font colour, and table cells. However, the diversity of format is harmful to data mining, such as resume information extraction, automatic job matching, and candidates ranking. Supervised methods and rule-based methods have been proposed to extract facts from resumes, but they strongly rely on hierarchical structure information and large amounts of labelled data, which are hard to collect in reality. In this paper, we propose a two-step resume information extraction approach. In the first step, raw text of resume is identified as different resume blocks. To achieve the goal, we design a novel feature, Writing Style, to model sentence syntax information. Besides word index and punctuation index, word lexical attribute and prediction results of classifiers are included in Writing Style. In the second step, multiple classifiers are employed to identify different attributes of fact information in resumes. Experimental results on a real-world dataset show that the algorithm is feasible and effective.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] A new two-step algorithm for ionospheric tomography solution
    Debao Wen
    Yong Wang
    Robert Norman
    GPS Solutions, 2012, 16 : 89 - 94
  • [42] A NEW PARALLEL TWO-STEP ALGORITHM FOR THE TREATMENT OF COPD
    Oishi, Keiji
    Matsunaga, Kazuto
    RESPIROLOGY, 2018, 23 : 130 - 130
  • [43] Two-Step Greedy Algorithm for Reduced Order Quadratures
    Harbir Antil
    Scott E. Field
    Frank Herrmann
    Ricardo H. Nochetto
    Manuel Tiglio
    Journal of Scientific Computing, 2013, 57 : 604 - 637
  • [44] A Two-Step Algorithm for the Dynamic Reduction of Flexible Mechanisms
    Cammarata, A.
    Sinatra, R.
    Maddio, P. D.
    MECHANISM DESIGN FOR ROBOTICS, 2019, 66 : 25 - 32
  • [45] Two-Step Spectral Clustering Controlled Islanding Algorithm
    Ding, Lei
    Gonzalez-Longatt, Francisco
    Wall, Peter
    Terzija, Vladimir
    2013 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PES), 2013,
  • [46] The two-step
    Loizou, Nicolette
    DANCING TIMES, 2019, 109 (1308): : 64 - 65
  • [47] Phase shift extraction algorithm by special points in two-step generalized phase-shifting interferometry
    Zhou Z.
    Dong Y.
    Zhang Y.
    Jiao G.
    Lu Y.
    Lü J.
    Dong, Yuming (ym.dong@siat.ac.cn), 1600, Science Press (43):
  • [48] Relation extraction based on two-step classification with distant supervision
    Maengsik Choi
    Hyeon-gu Lee
    Harksoo Kim
    The Journal of Supercomputing, 2016, 72 : 2609 - 2622
  • [49] Two-Step Feature Extraction in A Transform Domain for Face Recognition
    Alobaidi, Taif
    Mikhael, Wasfy B.
    2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [50] Automatic text summarization using two-step sentence extraction
    Jung, WC
    Ko, YJ
    Seo, JY
    INFORMATION RETRIEVAL TECHNOLOGY, 2005, 3411 : 71 - 81