Thai Sentiment Lexicon Construction

被引:0
|
作者
Intasorn, Jeerawat [1 ]
Gertphol, Sethavidh [1 ]
Sammapun, Usa [1 ]
机构
[1] Kasetsart Univ, Fac Sci, Dept Comp Sci, Bangkok, Thailand
关键词
sentiment lexicon; sentiment analysis; Thai;
D O I
10.1109/KST51265.2021.9415804
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online platforms have become increasingly integrated into daily lives of Thai citizens. Besides being used to query information, online platforms are the medium to exchange opinions and discuss various topics via posts or comments. Since the number of posts and comments can be very large, understanding which topics users are talking about and grasping how users feel about those topics can be difficult and time consuming. Researchers therefore have studied how to automatically analyze posts and comments to extract topics and their sentiments using techniques such as topic modeling and sentiment analysis. There are various approaches to analyze sentiments, and many of them utilize sentiment lexicons, a set of words annotated with sentiment, to help with the analysis. Several researches have constructed sentiment lexicons for English language. This paper aims to construct sentiment lexicons for Thai language using a dictionary-based approach in combination with a crowdsourcing approach. First, words are chosen from a Thai dictionary, mostly adjectives since they usually express sentiment. Next, a web application is developed so that users can help vote whether a particular word is positive, negative, or neutral. After a number of votes have been collected, each word is scored to determine its sentiment. To validate and ensure users are voting truthfully, users are authenticated via Facebook, and simple checks are carried out against answers of each user. This process results in a sentiment lexicons for Thai words and should help form the basis for sentiment analysis of Thai text.
引用
收藏
页码:123 / 128
页数:6
相关论文
共 50 条
  • [1] SenseTag: A Tagging Tool for Constructing Thai Sentiment Lexicon
    Trakultaweekoon, Kanokorn
    Klaithin, Supon
    [J]. 2016 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2016, : 179 - 182
  • [2] Topic-Adaptive Sentiment Lexicon Construction
    Deng, Dong
    Jing, Liping
    Yu, Jian
    Ng, Michael K.
    [J]. 2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [3] Sentiment Lexicon Construction Using SentiWordNet 3.0
    Medagoda, Nishantha
    Shanmuganathan, Subana
    Whalley, Jacqueline
    [J]. 2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 802 - 807
  • [4] Automatic Lexicon Construction for Arabic Sentiment Analysis
    Abdulla, Nawaf
    Majdalawi, Roa'a
    Mohammed, Salwa
    Al-Ayyoub, Mahmoud
    Al-Kabi, Mohammed
    [J]. 2014 INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD), 2014, : 547 - 552
  • [5] An Approach to Cross-lingual Sentiment Lexicon Construction
    Chang, Chia-Hsuan
    Wu, Ming-Lun
    Hwang, San-Yih
    [J]. 2019 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS 2019), 2019, : 129 - 131
  • [6] Sentiment Lexicon Construction With Hierarchical Supervision Topic Model
    Deng, Dong
    Jing, Liping
    Yu, Jian
    Sun, Shaolong
    Ng, Michael K.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (04) : 704 - 718
  • [7] Automatic construction of target-specific sentiment lexicon
    Wu, Sixing
    Wu, Fangzhao
    Chang, Yue
    Wu, Chuhan
    Huang, Yongfeng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 116 : 285 - 298
  • [8] MELex: The Construction of Malay-English Sentiment Lexicon
    Mahadzir, Nurul Husna
    Omar, Mohd Faizal
    Nawi, Mohd Nasrun Mohd
    Salameh, Anas A.
    Hussin, Kasmaruddin Che
    Sohail, Abid
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 1789 - 1805
  • [9] Automatic construction of domain sentiment lexicon for semantic disambiguation
    Yanyan Wang
    Fulian Yin
    Jianbo Liu
    Marco Tosato
    [J]. Multimedia Tools and Applications, 2020, 79 : 22355 - 22373
  • [10] Automatic Construction of Sentiment Lexicon by Analyzing SMS Bigdata
    Kang, Seung-Shik
    Lee, Minhaeng
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5348 - 5350