Thai Sentiment Lexicon Construction

被引:0
|
作者
Intasorn, Jeerawat [1 ]
Gertphol, Sethavidh [1 ]
Sammapun, Usa [1 ]
机构
[1] Kasetsart Univ, Fac Sci, Dept Comp Sci, Bangkok, Thailand
关键词
sentiment lexicon; sentiment analysis; Thai;
D O I
10.1109/KST51265.2021.9415804
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online platforms have become increasingly integrated into daily lives of Thai citizens. Besides being used to query information, online platforms are the medium to exchange opinions and discuss various topics via posts or comments. Since the number of posts and comments can be very large, understanding which topics users are talking about and grasping how users feel about those topics can be difficult and time consuming. Researchers therefore have studied how to automatically analyze posts and comments to extract topics and their sentiments using techniques such as topic modeling and sentiment analysis. There are various approaches to analyze sentiments, and many of them utilize sentiment lexicons, a set of words annotated with sentiment, to help with the analysis. Several researches have constructed sentiment lexicons for English language. This paper aims to construct sentiment lexicons for Thai language using a dictionary-based approach in combination with a crowdsourcing approach. First, words are chosen from a Thai dictionary, mostly adjectives since they usually express sentiment. Next, a web application is developed so that users can help vote whether a particular word is positive, negative, or neutral. After a number of votes have been collected, each word is scored to determine its sentiment. To validate and ensure users are voting truthfully, users are authenticated via Facebook, and simple checks are carried out against answers of each user. This process results in a sentiment lexicons for Thai words and should help form the basis for sentiment analysis of Thai text.
引用
收藏
页码:123 / 128
页数:6
相关论文
共 50 条
  • [31] Sentiment lexicon for sentiment analysis of Saudi dialect tweets
    Al-Thubaity, Abdulmohsen
    Alqahtani, Qubayl
    Aljandal, Abdulaziz
    [J]. ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 301 - 307
  • [32] Combine Sentiment Lexicon and Dependency Parsing for Sentiment Classification
    Quan, Changqin
    Wei, Xiquan
    Ren, Fuji
    [J]. 2013 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2013, : 100 - 104
  • [33] Arabic Domain-Oriented Sentiment Lexicon Construction Using Latent Dirichlet Allocation
    Alshahrani, Hasan A.
    Fong, Alvis C.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 174 - 180
  • [34] Developing Lexicon-based Algorithms and Sentiment Lexicon for Sentiment Analysis of Saudi Dialect Tweets
    Al-Ghaith, Waleed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (11) : 83 - 88
  • [35] Automatic construction of domain-specific sentiment lexicon based on constrained label propagation
    Huang, Sheng
    Niu, Zhendong
    Shi, Chongyang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2014, 56 : 191 - 200
  • [36] Sinhala Sentiment Analysis using Corpus based Sentiment Lexicon
    Chathuranga, P. D. T.
    Lorensuhewa, S. A. S.
    Kalyani, M. A. L.
    [J]. 2019 19TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER - 2019), 2019,
  • [37] Extending persian sentiment lexicon with idiomatic expressions for sentiment analysis
    Dashtipour, Kia
    Gogate, Mandar
    Gelbukh, Alexander
    Hussain, Amir
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [38] Extending persian sentiment lexicon with idiomatic expressions for sentiment analysis
    Kia Dashtipour
    Mandar Gogate
    Alexander Gelbukh
    Amir Hussain
    [J]. Social Network Analysis and Mining, 2022, 12
  • [39] Automatic construction of direction-aware sentiment lexicon using direction-dependent words
    Park, Jihye
    Lee, Hye Jin
    Cho, Sungzoon
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [40] Creating a General Russian Sentiment Lexicon
    Loukachevitch, Natalia
    Levchik, Anatoly
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1171 - 1176