A Lexicon-Based Approach for Detecting Hedges in Informal Text

被引:0
|
作者
Islam, Jumayel [1 ]
Xiao, Lu [2 ]
Mercer, Robert E. [1 ]
机构
[1] Univ Western Ontario, Dept Comp Sci, London, ON, Canada
[2] Syracuse Univ, Sch Informat Studies, Syracuse, NY 13244 USA
关键词
Hedging; Informal conversation; Discourse Markers;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Hedging is a commonly used strategy in conversational management to show the speaker's lack of commitment to what they communicate, which may signal problems between the speakers. Our project is interested in examining the presence of hedging words and phrases in identifying the tension between an interviewer and interviewee during a survivor interview. While there have been studies on hedging detection in the natural language processing literature, all existing work has focused on structured texts and formal communications. Our project thus investigated a corpus of eight unstructured conversational interviews about the Rwanda Genocide and identified hedging patterns in the interviewees' responses. Our work produced three manually constructed lists of hedge words, booster words, and hedging phrases. Leveraging these lexicons, we developed a rule-based algorithm that detects sentence-level hedges in informal conversations such as survivor interviews. Our work also produced a dataset of 3000 sentences having the categories Hedge and Non-hedge annotated by three researchers. With experiments on this annotated dataset, we verify the efficacy of our proposed algorithm. Our work contributes to the further development of tools that identify hedges from informal conversations and discussions.
引用
收藏
页码:3109 / 3113
页数:5
相关论文
共 50 条
  • [1] A lexicon-based approach to detecting suicide-related messages on Twitter
    Sarsam, Samer Muthana
    Al-Samarraie, Hosam
    Alzahrani, Ahmed Ibrahim
    Alnumay, Waleed
    Smith, Andrew Paul
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 65
  • [2] Lexicon-Based Text Analysis for Twitter and Quora
    Nishant, Potnuru Sai
    Mohan, Bhaskaruni Gopesh Krishna
    Chandra, Balina Surya
    Lokesh, Yangalasetty
    Devaraju, Gantakora
    Revanth, Madamala
    [J]. INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, 2020, 46 : 276 - 283
  • [3] Detecting sentiment embedded in Arabic social media - A lexicon-based approach
    Duwairi, R. M.
    Ahmed, Nizar A.
    Al-Rifai, Saleh Y.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 29 (01) : 107 - 117
  • [4] Lexicon-based probabilistic indexing of handwritten text images
    Vidal, Enrique
    Toselli, Alejandro H.
    Puigcerver, Joan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24): : 17501 - 17520
  • [5] A lexicon-based method for detecting eye diseases on microblogs
    Sarsam, Samer Muthana
    Al-Samarraie, Hosam
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [6] Lexicon-based probabilistic indexing of handwritten text images
    Enrique Vidal
    Alejandro H. Toselli
    Joan Puigcerver
    [J]. Neural Computing and Applications, 2023, 35 : 17501 - 17520
  • [7] A lexicon-based approach for hate speech detection
    School of Information Science and Engineering, Central South University, Changsha, China
    不详
    [J]. Int. J. Multimedia Ubiquitous Eng., 4 (215-230):
  • [8] Effective lexicon-based approach for Urdu sentiment analysis
    Neelam Mukhtar
    Mohammad Abid Khan
    [J]. Artificial Intelligence Review, 2020, 53 : 2521 - 2548
  • [9] Mining Comparative Opinions in Portuguese: A Lexicon-based Approach
    Kansaon, Daniel
    Brandão, Michele A.
    Reis, Julio C. S.
    Benevenuto, Fabrício
    [J]. Journal of the Brazilian Computer Society, 2024, 30 (01) : 347 - 362
  • [10] A Lexicon-based Collaborative Filtering Approach for Recommendation Systems
    Deac-Petrusel, Mara
    [J]. ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 203 - 210