Annotating opinion-evaluation of blogs: the Blogoscopy corpus

被引:3
|
作者
Daille, Beatrice [1 ]
Dubreil, Estelle [1 ]
Monceaux, Laura [1 ]
Vernier, Matthieu [1 ]
机构
[1] Univ Nantes, LINA, F-44322 Nantes 3, France
关键词
Blogs; Sentiment analysis; Corpus annotation; Evaluation; Polarity; French language;
D O I
10.1007/s10579-011-9154-z
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The blog phenomenon is universal. Blogs are characterized by their evaluative use, in that they enable Internet users to express their opinion on a given subject. From this point of view, they are an ideal resource for the constitution of an annotated sentiment analysis corpus, crossing the subject and the opinion expressed on this subject. This paper presents the Blogoscopy corpus for the French language which was built up with personal thematic blogs. The annotation was governed by three principles: theoretical, as opinion is grounded in a linguistic theory of evaluation, practical, as every opinion is linked to an object, and methodological as annotation rules and successive phases are defined to ensure quality and thoroughness.
引用
收藏
页码:409 / 437
页数:29
相关论文
共 50 条
  • [1] Annotating opinion—evaluation of blogs: the Blogoscopy corpus
    Béatrice Daille
    Estelle Dubreil
    Laura Monceaux
    Matthieu Vernier
    Language Resources and Evaluation, 2011, 45 : 409 - 437
  • [2] Annotating Arguments in a Corpus of Opinion Articles
    Rocha, Gil
    Trigo, Luis
    Cardoso, Henrique Lopes
    Sousa-Silva, Rui
    Carvalho, Paula
    Martins, Bruno
    Won, Miguel
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1890 - 1899
  • [3] Automatically annotating a five-billion-word corpus of Japanese blogs for sentiment and affect analysis
    Ptaszynski, Michal
    Rzepka, Rafal
    Araki, Kenji
    Momouchi, Yoshio
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 38 - 55
  • [4] Annotating the MASC Corpus with BabelNet
    Moro, Andrea
    Navigli, Roberto
    Tucci, Francesco Maria
    Passonneau, Rebecca J.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4214 - 4219
  • [5] Annotating Events in an Emotion Corpus
    Lee, Sophia Yat Mei
    Li, Shoushan
    Huang, Chu-Ren
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3511 - 3516
  • [6] Annotating an Arabic Learner Corpus for Error
    Abuhakema, Ghazi
    Faraj, Reem
    Feldman, Anna
    Fitzpatrick, Eileen
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1347 - 1350
  • [7] FrSemCor: Annotating a French corpus with supersenses
    Barque, L.
    Haas, P.
    Huyghe, R.
    Tribout, D.
    Candito, M.
    Crabbe, B.
    Segonne, V
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5904 - 5910
  • [8] Annotating Arguments in a Parliamentary Corpus: An Experience
    Koit, Mare
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KEOD), VOL 2, 2020, : 213 - 218
  • [9] Annotating Errors in a Hungarian Learner Corpus
    Dickinson, Markus
    Ledbetter, Scott
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1659 - 1664
  • [10] A Framework for Opinion Mining in Blogs for Agriculture
    Valsamidis, Stavros
    Theodosiou, Theodosios
    Kazanidis, Ioannis
    Nikolaidis, Michael
    6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES IN AGRICULTURE, FOOD AND ENVIRONMENT (HAICTA 2013), 2013, 8 : 264 - 274