A Lexicon-Based Approach for Detecting Hedges in Informal Text

被引：0

作者：

Islam, Jumayel ^{[1
]}

Xiao, Lu ^{[2
]}

Mercer, Robert E. ^{[1
]}

机构：

[1] Univ Western Ontario, Dept Comp Sci, London, ON, Canada

[2] Syracuse Univ, Sch Informat Studies, Syracuse, NY 13244 USA

来源：

PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年

关键词：

Hedging; Informal conversation; Discourse Markers;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Hedging is a commonly used strategy in conversational management to show the speaker's lack of commitment to what they communicate, which may signal problems between the speakers. Our project is interested in examining the presence of hedging words and phrases in identifying the tension between an interviewer and interviewee during a survivor interview. While there have been studies on hedging detection in the natural language processing literature, all existing work has focused on structured texts and formal communications. Our project thus investigated a corpus of eight unstructured conversational interviews about the Rwanda Genocide and identified hedging patterns in the interviewees' responses. Our work produced three manually constructed lists of hedge words, booster words, and hedging phrases. Leveraging these lexicons, we developed a rule-based algorithm that detects sentence-level hedges in informal conversations such as survivor interviews. Our work also produced a dataset of 3000 sentences having the categories Hedge and Non-hedge annotated by three researchers. With experiments on this annotated dataset, we verify the efficacy of our proposed algorithm. Our work contributes to the further development of tools that identify hedges from informal conversations and discussions.

引用

页码：3109 / 3113

页数：5

共 50 条

[1] A lexicon-based approach to detecting suicide-related messages on Twitter
Sarsam, Samer Muthana
Al-Samarraie, Hosam
Alzahrani, Ahmed Ibrahim
Alnumay, Waleed
Smith, Andrew Paul
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 65
[2] Lexicon-Based Text Analysis for Twitter and Quora
Nishant, Potnuru Sai
Mohan, Bhaskaruni Gopesh Krishna
Chandra, Balina Surya
Lokesh, Yangalasetty
Devaraju, Gantakora
Revanth, Madamala
[J]. INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, 2020, 46 : 276 - 283
[3] Detecting sentiment embedded in Arabic social media - A lexicon-based approach
Duwairi, R. M.
Ahmed, Nizar A.
Al-Rifai, Saleh Y.
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 29 (01) : 107 - 117
[4] Lexicon-based probabilistic indexing of handwritten text images
Vidal, Enrique
Toselli, Alejandro H.
Puigcerver, Joan
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24): : 17501 - 17520
[5] A lexicon-based method for detecting eye diseases on microblogs
Sarsam, Samer Muthana
Al-Samarraie, Hosam
[J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[6] Lexicon-based probabilistic indexing of handwritten text images
Enrique Vidal
Alejandro H. Toselli
Joan Puigcerver
[J]. Neural Computing and Applications, 2023, 35 : 17501 - 17520
[7] A lexicon-based approach for hate speech detection
School of Information Science and Engineering, Central South University, Changsha, China
不详
[J]. Int. J. Multimedia Ubiquitous Eng., 4 (215-230):
[8] Effective lexicon-based approach for Urdu sentiment analysis
Neelam Mukhtar
Mohammad Abid Khan
[J]. Artificial Intelligence Review, 2020, 53 : 2521 - 2548
[9] Mining Comparative Opinions in Portuguese: A Lexicon-based Approach
Kansaon, Daniel
Brandão, Michele A.
Reis, Julio C. S.
Benevenuto, Fabrício
[J]. Journal of the Brazilian Computer Society, 2024, 30 (01) : 347 - 362
[10] A Lexicon-based Collaborative Filtering Approach for Recommendation Systems
Deac-Petrusel, Mara
[J]. ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 203 - 210

← 1 2 3 4 5 →