A Lexicon-Based Approach for Detecting Hedges in Informal Text

被引:0
|
作者
Islam, Jumayel [1 ]
Xiao, Lu [2 ]
Mercer, Robert E. [1 ]
机构
[1] Univ Western Ontario, Dept Comp Sci, London, ON, Canada
[2] Syracuse Univ, Sch Informat Studies, Syracuse, NY 13244 USA
关键词
Hedging; Informal conversation; Discourse Markers;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Hedging is a commonly used strategy in conversational management to show the speaker's lack of commitment to what they communicate, which may signal problems between the speakers. Our project is interested in examining the presence of hedging words and phrases in identifying the tension between an interviewer and interviewee during a survivor interview. While there have been studies on hedging detection in the natural language processing literature, all existing work has focused on structured texts and formal communications. Our project thus investigated a corpus of eight unstructured conversational interviews about the Rwanda Genocide and identified hedging patterns in the interviewees' responses. Our work produced three manually constructed lists of hedge words, booster words, and hedging phrases. Leveraging these lexicons, we developed a rule-based algorithm that detects sentence-level hedges in informal conversations such as survivor interviews. Our work also produced a dataset of 3000 sentences having the categories Hedge and Non-hedge annotated by three researchers. With experiments on this annotated dataset, we verify the efficacy of our proposed algorithm. Our work contributes to the further development of tools that identify hedges from informal conversations and discussions.
引用
收藏
页码:3109 / 3113
页数:5
相关论文
共 50 条
  • [31] Impact for whom? Mapping the users of public research with lexicon-based text mining
    Andrea Bonaccorsi
    Filippo Chiarello
    Gualtiero Fantoni
    [J]. Scientometrics, 2021, 126 : 1745 - 1774
  • [32] Lexicon-based corpus processing with LexWare
    Dura, E
    [J]. PALC'99: PRACTICAL APPLICATIONS IN LANGUAGE CORPORA, 2000, 1 : 69 - 76
  • [33] Lexicon-Based Methods for Sentiment Analysis
    Taboada, Maite
    Brooke, Julian
    Tofiloski, Milan
    Voll, Kimberly
    Stede, Manfred
    [J]. COMPUTATIONAL LINGUISTICS, 2011, 37 (02) : 267 - 307
  • [34] TOWARDS A LEXICON-BASED THEORY OF AGREEMENT
    WUNDERLICH, D
    [J]. THEORETICAL LINGUISTICS, 1994, 20 (01) : 1 - 35
  • [35] The implementation and evaluation of a lexicon-based stemmer
    Silva, G
    Oliveira, C
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2003, 2857 : 266 - 276
  • [36] Towards enhancement of a lexicon-based approach for Saudi dialect sentiment analysis
    Assiri, Adel
    Emam, Ahmed
    Al-Dossari, Hmood
    [J]. JOURNAL OF INFORMATION SCIENCE, 2018, 44 (02) : 184 - 202
  • [37] Unsupervised Learning for Lexicon-Based Classification
    Eisenstein, Jacob
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3188 - 3194
  • [38] Lexicon-based emotion analysis in Turkish
    Tocoglu, Mansur Alp
    Alpkocak, Adil
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1213 - 1227
  • [39] A Semantic Lexicon-Based Approach for Sense Disambiguation and Its WWW Application
    Di Lecce, Vincenzo
    Calabrese, Marco
    Soldo, Domenico
    [J]. EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2009, 5755 : 468 - +
  • [40] LMS Content Evaluation System with Sentiment Analysis Using Lexicon-Based Approach
    Tan, Riegie D.
    Piad, Keno
    Lagman, Ace
    Victoriano, Jayson
    Tano, Isagani
    San Gabriel, Nicanor, Jr.
    Espino, Joseph
    [J]. 2022 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND EDUCATION TECHNOLOGY (ICIET 2022), 2022, : 93 - 98