Skills prediction based on multi-label resume classification using CNN with model predictions explanation

被引:0
|
作者
Kameni Florentin Flambeau Jiechieu
Norbert Tsopze
机构
[1] University of Yaounde I,Department of Computer Science
[2] UMMISCO,IRD
来源
Neural Computing and Applications | 2021年 / 33卷
关键词
Skill-gap; Resume; Skills extraction; Multi-label classification; Convolutional neural network; Model explanation;
D O I
暂无
中图分类号
学科分类号
摘要
Skills extraction is a critical task when creating job recommender systems. It is also useful for building skills profiles and skills knowledge bases for organizations. The aim of skills extraction is to identify the skills expressed in documents such as resumes or job postings. Several methods have been proposed to tackle this problem. These methods already perform well when it comes to extracting explicitly mentioned skills from resumes. But skills have different levels of abstraction: high-level skills can be determined by low-level ones. Instead of just extracting skill-related terms, we propose a multi-label classification architecture model based on convolutional neural networks to predict high-level skills from resumes even if they are not explicitly mentioned in these resumes. Experiments carried out on a set of anonymous IT resumes collected from the Internet have shown the effectiveness of our method reaching 98.79% of recall and 91.34% of precision. In addition, features (terms) detected by convolutional filters are projected on the input resumes in order to present to the user, the terms which contributed to the model decision.
引用
收藏
页码:5069 / 5087
页数:18
相关论文
共 50 条
  • [31] Link Prediction-based Multi-label Classification on Networked Data
    Zhao, Yinfeng
    Li, Lei
    Wu, Xindong
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 61 - 68
  • [32] A multi-label classification algorithm based on random walk model
    Zheng W.
    Wang C.-K.
    Liu Z.
    Wang J.-M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (08): : 1418 - 1426
  • [33] Cluster Tree based Multi-Label Classification for Protein Function Prediction
    Wu, Qingyao
    Ye, Yunming
    Zhang, Xiaofeng
    Ho, Shen-Shyang
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [34] Stacking model of multi-label classification based on pruning strategies
    Liu, Haiyang
    Wang, Zhihai
    Sun, Yange
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (22): : 16763 - 16774
  • [35] Multi-label classification with XGBoost for metabolic pathway prediction
    Hyunwhan Joe
    Hong-Gee Kim
    BMC Bioinformatics, 25
  • [36] An empirical study of empty prediction of multi-label classification
    Liu, Shuhua
    Chen, Jiun-Hung
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) : 5567 - 5579
  • [37] A Generative Probabilistic Model for Multi-Label Classification
    Wang, Hongning
    Huang, Minlie
    Zhu, Xiaoyan
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 628 - 637
  • [38] Multi-label Classification for Intelligent Health Risk Prediction
    Li, Runzhi
    Zhao, Hongling
    Lin, Yusong
    Maxwell, Andrew
    Zhang, Chaoyang
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 986 - 993
  • [39] Air pollution prediction via multi-label classification
    Corani, Giorgio
    Scanagatta, Mauro
    ENVIRONMENTAL MODELLING & SOFTWARE, 2016, 80 : 259 - 264
  • [40] Multi-label classification with XGBoost for metabolic pathway prediction
    Joe, Hyunwhan
    Kim, Hong-Gee
    BMC BIOINFORMATICS, 2024, 25 (01)