Labeled Bilingual Topic Model for Cross-Lingual Text Classification and Label Recommendation

被引:1
|
作者
Tian, Ming-Jie [1 ]
Huang, Zheng-Hao [1 ]
Cui, Rong-Yi [1 ]
机构
[1] Yanbian Univ, Intelligent Informat Proc Lab, Yanji, Jilin, Peoples R China
关键词
topic model; label; cross-lingual text classification; label recommendation; latent topic;
D O I
10.1109/ICISCE.2018.00067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aiming at the increasingly rich multi language information resources and multi-label data in news reports and scientific literatures, in order to mining the relevance between languages and the correlation between data, this paper proposed labeled bilingual topic model, applied on cross-lingual text classification and label recommendation. First of all, it could assume that the keywords in the scientific literature are relevant to the abstract in same article, then extracted the keywords and regarded it as labels, and aligned the labels with topics in topic model, instantiated the "latent" topic. Secondly, trained the abstracts in article through the topic model proposed by this paper. Finally, classified the new documents by cross-lingual text classifier, also recommended the labels. The experiment result show that Micro-F1 measure reaches 94.81% in cross-lingual text classification task, and the recommended labels also reflects the sematic relevance with documents.
引用
收藏
页码:285 / 289
页数:5
相关论文
共 50 条
  • [41] Ontology-supported text classification based on cross-lingual word sense disambiguation
    Tufis, Dan
    Koeva, Svetla
    [J]. APPLICATIONS OF FUZZY SETS THEORY, 2007, 4578 : 447 - +
  • [42] A Comparative Study of Cross-Lingual Sentiment Classification
    Wan, Xiaojun
    [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 24 - 31
  • [43] Generalized Funnelling: Ensemble Learning and Heterogeneous Document Embeddings for Cross-Lingual Text Classification
    Moreo, Alejandro
    Pedrotti, Andrea
    Sebastiani, Fabrizio
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (02)
  • [44] Cross-lingual sentiment classification with stacked autoencoders
    Guangyou Zhou
    Zhiyuan Zhu
    Tingting He
    Xiaohua Tony Hu
    [J]. Knowledge and Information Systems, 2016, 47 : 27 - 44
  • [45] A cross-lingual video classification using subtitles
    El Kah, Anoual
    Zeroual, Imad
    [J]. 2022 2ND INTERNATIONAL CONFERENCE ON INNOVATIVE RESEARCH IN APPLIED SCIENCE, ENGINEERING AND TECHNOLOGY (IRASET'2022), 2022, : 703 - 707
  • [46] Czech Dataset for Cross-lingual Subjectivity Classification
    Priban, Pavel
    Steinberger, Josef
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1381 - 1391
  • [47] Cross-lingual sentiment classification with stacked autoencoders
    Zhou, Guangyou
    Zhu, Zhiyuan
    He, Tingting
    Hu, Xiaohua Tony
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 47 (01) : 27 - 44
  • [48] CLTC: A Chinese-English Cross-lingual Topic Corpus
    Xia, Yunqing
    Tang, Guoyu
    Jin, Peng
    Yang, Xia
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 532 - 537
  • [49] Active Learning for Cross-Lingual Sentiment Classification
    Li, Shoushan
    Wang, Rong
    Liu, Huanhuan
    Huang, Chu-Ren
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 236 - 246
  • [50] Microblogging Reviews Based Cross-Lingual Sentimental Classification for Cold-Start Product Recommendation
    Kumbhar, Nilesh
    Belerao, Krushnadeo
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,