A self-attention hybrid emoji prediction model for code-mixed language: (Hinglish)

被引:0
|
作者
Gadde Satya Sai Naga Himabindu
Rajat Rao
Divyashikha Sethia
机构
[1] Delhi Technological University,Department of Computer Engineering
来源
关键词
Emoji prediction; Hinglish; Code mixed; Deep learning; Hybrid model;
D O I
暂无
中图分类号
学科分类号
摘要
Emojis are an essential tool for communication, and various resource-rich languages such as English use emoji prediction systems. However, there is limited research on emoji prediction for resource-poor and code-mixed languages such as Hinglish (Hindi + English), the fourth most used code-mixed language globally. This paper proposes a novel Hinglish Emoji Prediction (HEP) dataset created using Twitter as a corpus and a hybrid emoji prediction model BiLSTM attention random forest (BARF) for code-mixed Hinglish language. The proposed BARF model combines deep learning features with machine learning classification. It begins with BiLSTM to capture the context and then proceeds to self-attention to extract significant texts. Finally, it uses random forest to categorize the features to predict an emoji. The self-attention mechanism aids learning since Hinglish, a code-mixed language, lacks proper grammatical rules. The combination of deep learning and machine learning algorithms and attention is novel to emoji prediction in the code-mixed language(Hinglish). Results on the HEP dataset indicate that the BARF model outperformed previous multilingual and baseline emoji prediction models. It achieved an accuracy of 61.14%, precision of 0.66, recall of 0.59, and F1 score of 0.59.
引用
收藏
相关论文
共 50 条
  • [21] HARSAM: A Hybrid Model for Recommendation Supported by Self-Attention Mechanism
    Peng, Dunlu
    Yuan, Weiwei
    Liu, Cong
    IEEE ACCESS, 2019, 7 : 12620 - 12629
  • [22] Language Detection in Sinhala-English Code-mixed Data
    Smith, Ian
    Thayasivam, Uthayasanker
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 228 - 233
  • [23] A self-attention model for viewport prediction based on distance constraint
    Lan, ChengDong
    Qiu, Xu
    Miao, Chenqi
    Zheng, MengTing
    VISUAL COMPUTER, 2024, 40 (09): : 5997 - 6014
  • [24] Self-Attention ConvLSTM for Spatiotemporal Prediction
    Lin, Zhihui
    Li, Maomao
    Zheng, Zhuobin
    Cheng, Yangyang
    Yuan, Chun
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11531 - 11538
  • [25] HiFun: homology independent protein function prediction by a novel protein-language self-attention model
    Wu, Jun
    Qing, Haipeng
    Ouyang, Jian
    Zhou, Jiajia
    Gao, Zihao
    Mason, Christopher E.
    Liu, Zhichao
    Shi, Tieliu
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [26] Session interest model for CTR prediction based on self-attention mechanism
    Wang, Qianqian
    Liu, Fang'ai
    Zhao, Xiaohui
    Tan, Qiaoqiao
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [27] Multicolumn Self-Attention GRU Model for Intersection Vehicle Trajectory Prediction
    Liu, Yue
    Liang, Guohua
    Chen, Yixin
    Yang, Xiaoyao
    Chen, Ziyu
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2024, 150 (12)
  • [28] Tweet Emoji Prediction Using Hierarchical Model with Attention
    Wu, Chuhan
    Wu, Fangzhao
    Wu, Sixing
    Huang, Yongfeng
    Xie, Xing
    PROCEEDINGS OF THE 2018 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC'18 ADJUNCT), 2018, : 1337 - 1344
  • [29] Solar irradiance prediction based on self-attention recursive model network
    Kang, Ting
    Wang, Huaizhi
    Wu, Ting
    Peng, Jianchun
    Jiang, Hui
    Frontiers in Energy Research, 2022, 10
  • [30] Analysis of Part of Speech Tags in Language Identification of Code-Mixed Text
    Ansari, Mohd Zeeshan
    Khan, Shazia
    Amani, Tamsil
    Hamid, Aman
    Rizvi, Syed
    ADVANCES IN COMPUTING AND INTELLIGENT SYSTEMS, ICACM 2019, 2020, : 417 - 425