DeepCarotene - Job Title Classification with Multi-stream Convolutional Neural Network

被引:0
|
作者
Wang, Jingya [1 ,2 ]
Abdelfatah, Kareem [1 ,3 ]
Korayem, Mohammed [1 ]
Balaji, Janani [1 ]
机构
[1] CareerBuilder, Norcross, GA USA
[2] Indiana Univ Bloomington, Bloomington, IN USA
[3] Univ South Carolina, Columbia, SC 29208 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In online recruitment, job title classification is a fundamental task that enables several downstream applications like job recommendation and ranking for job search. A special case of multi-class text classification, the job title classification problem takes as input two components from a job posting - a short job title and a lengthier job description, and normalizes the raw job title into its closest match from the given taxonomy. Typically, the job title, though shorter in length, contains more targeted signals than the job description, that can contain additional information irrelevant to the context. On the other hand, the job description often provides valuable information that helps steer the classification model towards choosing the best match. Achieving a balance between the two components is not a trivial task. In this paper, we propose a multi-stream CNN based model for job title classification, that learns semantic features on both character and word level. We collected about 15 million data points from one of the largest online job boards, Careerbuilder, to train the model. Due to the universal problem of getting massive labeled data, we adopt a weakly supervised method to efficiently generate noisy labels for this large data set. Compared with the current state-of-the-art job title classification systems, the proposed model, DeepCarotene, shows a significant improvement in performance. This model provides a new direction of CNN based end-to-end approach for job title classification.
引用
收藏
页码:1953 / 1961
页数:9
相关论文
共 50 条
  • [1] Multi-Stream Deep Convolutional Neural Network for PET Preform Surface Defects Detection and Classification
    Zhang, Taochuan
    Duan, Chunmei
    [J]. IEEE ACCESS, 2021, 9 : 156973 - 156986
  • [2] Stochastic Fusion for Multi-stream Neural Network in Video Classification
    Huang, Yu-Min
    Tseng, Huan-Hsin
    Chien, Jen-Tzung
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 69 - 74
  • [3] Multi-Stream Convolutional Neural Network for SAR Automatic Target Recognition
    Zhao, Pengfei
    Liu, Kai
    Zou, Hao
    Zhen, Xiantong
    [J]. REMOTE SENSING, 2018, 10 (09)
  • [4] Background Knowledge Based Multi-Stream Neural Network for Text Classification
    Ren, Fuji
    Deng, Jiawen
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (12):
  • [5] Automatic Modulation Classification Using a Deep Multi-Stream Neural Network
    Zhang, Hao
    Wang, Yan
    Xu, Lingwei
    Gulliver, T. Aaron
    Cao, Conghui
    [J]. IEEE ACCESS, 2020, 8 : 43888 - 43897
  • [6] MULTI-STREAM CONVOLUTIONAL NEURAL NETWORK WITH FREQUENCY SELECTION FOR ROBUST SPEAKER VERIFICATION
    Yao, Wei
    Chen, Shen
    Cui, Jiamin
    Lou, Yaolin
    [J]. COMPUTING AND INFORMATICS, 2024, 43 (04) : 819 - 848
  • [7] An analysis of information segregation in parallel streams of a multi-stream convolutional neural network
    Tamura, Hiroshi
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [8] Cervical Cell Features Based Multi-Stream Convolutional Neural Networks Classification Method
    Yang, Zhiming
    Li, Yawei
    Yang, Bing
    Pang, Wenbo
    Tian, Zening
    Wang, Yong
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (04): : 531 - 540
  • [9] Multi-Stream Deep Neural Network For 12-Lead ECG Classification
    Baumgartner, Martin
    Eggerth, Alphons
    Ziegl, Andreas
    Hayn, Dieter
    Schreier, Guenter
    [J]. 2020 COMPUTING IN CARDIOLOGY, 2020,
  • [10] A Multi-Stream Convolutional Neural Network for Classification of Progressive MCI in Alzheimer's Disease Using Structural MRI Images
    Ashtari-Majlan, Mona
    Seifi, Abbas
    Dehshibi, Mohammad Mahdi
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (08) : 3918 - 3926