A NEW INPUT REPRESENTATION FOR MULTI-LABEL TEXT CLASSIFICATION

被引:0
|
作者
Alfaro, Rodrigo [1 ]
Allende, Hector [1 ]
机构
[1] Univ Tacn Federico Santa Maria, Dept Informat, Valparaiso, Chile
关键词
Multi-label; Text classification; Text representation; Machine learning;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Automatic text classification is the task of assigning unseen documents to a predefined set of classes or categories. Text Representation for classification has been traditionally approached with tf.idf due to its simplicity and good performance. Multi-label automatic text classification has been traditionally tackled in the literature either by transforming the problem to apply binary techniques or by adapting binary algorithms to work with multiple labels. We present tf.rfl, a novel text representation for the multi-label classification approach. Our proposal focuses on modifying the data set input to the algorithm, differentiating the input by the label to evaluate. Performance of tf.rfl was tested with a known benchmark and compared to alternative techniques. The results show improvement compared to alternative approaches in terms of Hamming Loss.
引用
收藏
页码:207 / 210
页数:4
相关论文
共 50 条
  • [31] Academic Resource Text Hierarchical Multi-Label Classification
    Wang, Yue
    Li, Yawen
    Li, Ang
    [J]. Computer Engineering and Applications, 2023, 59 (13): : 92 - 98
  • [32] A novel reasoning mechanism for multi-label text classification
    Wang, Ran
    Ridley, Robert
    Su, Xi'ao
    Qu, Weiguang
    Dai, Xinyu
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (02)
  • [33] Multi-label legal text classification with BiLSTM and attention
    Enamoto, Liriam
    Santos, Andre R. A. S.
    Maia, Ricardo
    Weigang, Li
    Rocha Filho, Geraldo P.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2022, 68 (04) : 369 - 378
  • [34] Multi-label Classification of Cybersecurity Text with Distant Supervision
    Ishii, Masahiro
    Mori, Kento
    Kuwana, Ryoichi
    Matsuura, Satoshi
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, ARES 2022, 2022,
  • [35] Multi-label Text Classification with Deep Neural Networks
    Chen, Yun
    Xiao, Bo
    Lin, Zhiqing
    Dai, Cheng
    Li, Zuochao
    Yang, Liping
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 409 - 413
  • [36] Multi-label Text Classification for Public Procurement in Spanish
    Navas-Loro, Maria
    Garijo, Daniel
    Corcho, Oscar
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2022, (69): : 73 - 82
  • [37] Review and Prospect of Multi-Label Text Classification Research
    Zhang, Wenfeng
    Xi, Xuefeng
    Cui, Zhiming
    Zou, Yichen
    Luan, Jinquan
    [J]. Computer Engineering and Applications, 2023, 59 (18) : 28 - 48
  • [38] Contrastive Enhanced Learning for Multi-Label Text Classification
    Wu, Tianxiang
    Yang, Shuqun
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [39] Deep Learning for Extreme Multi-label Text Classification
    Liu, Jingzhou
    Chang, Wei-Cheng
    Wu, Yuexin
    Yang, Yiming
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 115 - 124
  • [40] On the Value of Head Labels in Multi-Label Text Classification
    Wang, Haobo
    Peng, Cheng
    Dong, Hede
    Feng, Lei
    Liu, Weiwei
    Hu, Tianlei
    Chen, Ke
    Chen, Gang
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (05)