Annotating opinion-evaluation of blogs: the Blogoscopy corpus

被引:3
|
作者
Daille, Beatrice [1 ]
Dubreil, Estelle [1 ]
Monceaux, Laura [1 ]
Vernier, Matthieu [1 ]
机构
[1] Univ Nantes, LINA, F-44322 Nantes 3, France
关键词
Blogs; Sentiment analysis; Corpus annotation; Evaluation; Polarity; French language;
D O I
10.1007/s10579-011-9154-z
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The blog phenomenon is universal. Blogs are characterized by their evaluative use, in that they enable Internet users to express their opinion on a given subject. From this point of view, they are an ideal resource for the constitution of an annotated sentiment analysis corpus, crossing the subject and the opinion expressed on this subject. This paper presents the Blogoscopy corpus for the French language which was built up with personal thematic blogs. The annotation was governed by three principles: theoretical, as opinion is grounded in a linguistic theory of evaluation, practical, as every opinion is linked to an object, and methodological as annotation rules and successive phases are defined to ensure quality and thoroughness.
引用
收藏
页码:409 / 437
页数:29
相关论文
共 50 条
  • [31] John of Scythopolis and the Dionysian corpus. Annotating the Areopagite
    Williams, JP
    JOURNAL OF THEOLOGICAL STUDIES, 1999, 50 : 784 - 788
  • [32] Sentence-Level Opinion-Topic Association for Opinion Detection in Blogs
    Missen, Malik Muhammad Saad
    Boughanem, Mohand
    2009 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS: WAINA, VOLS 1 AND 2, 2009, : 733 - 737
  • [33] Anaphoric Annotation of Wikipedia and Blogs in the Live Memories Corpus
    Rodriguez, Kepa J.
    Delogu, Francesca
    Versley, Yannick
    Stemle, Egon W.
    Poesio, Massimo
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 157 - 163
  • [34] Pragmatic labeling of a corpus of economic and financial blogs in Spanish
    Egido, Jose Joaquin Martinez
    CIRCULO DE LINGUISTICA APLICADA A LA COMUNICACION, 2023, (96): : 101 - 112
  • [35] Facet-based opinion retrieval from blogs
    Vechtomova, Olga
    INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (01) : 71 - 88
  • [36] Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data
    Hwang, Jena D.
    Zaenen, Annie
    Palmer, Martha
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1297 - 1304
  • [37] Ontology Based Approach for Annotating a Corpus of Computer Science Abstracts
    Almugbel, Zainab
    2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 81 - 86
  • [38] OCA: Opinion Corpus for Arabic
    Rushdi-Saleh, Mohammed
    Teresa Martin-Valdivia, M.
    Alfonso Urena-Lopez, L.
    Perea-Ortega, Jose M.
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (10): : 2045 - 2054
  • [39] IARG-AnCora: Annotating AnCora corpus with implicit arguments
    Taule, Mariona
    Antonia Marti, M.
    Penis, Aina
    Rodriguez, Horacio
    Moreno, Lidia
    Moreda, Paloma
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 181 - 184
  • [40] A Transfer Learning Framework For Annotating Implementation-Specific Corpus
    Ponniah, Anbumunee
    Agarwal, Swati
    Ranka, Sharanya Milind
    Madhusudhan, Shashank
    2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 503 - 512