COMFO: Multilingual Corpus for Opinion Mining

被引:0
|
作者
Faty, Lamine [1 ]
Drame, Khadim [1 ]
Sarr, Edouard Ngor [1 ]
Ndiaye, Marie [1 ]
Diop, Ibrahima [1 ]
Dia, Yoro [2 ]
Sall, Ousmane [3 ]
机构
[1] Univ Assane Seck Ziguinchor, Ziguinchor, Senegal
[2] Univ Iba Thiam, Ziguinchor, Senegal
[3] Univ Virtuelle Senegal, Ziguinchor, Senegal
来源
关键词
Opinion mining; Online comment; Corpus building; COMFO;
D O I
10.1007/978-3-031-19907-3_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of Machine Learning (ML) algorithms in opinion mining, particularly supervised learning algorithms, requires an annotated corpus to train the classification model in order to predict results that are close to reality. Unfortunately, there are still no resources for the automatic processing of textual data expressed in the Senegalese urban language. The objective of this paper is to build a multilingual corpus for opinion mining (COMFO). The process of building theCOMFOcorpus is composed of three steps: presentation of the data source, data collection and preparation, and annotation by lexical approach. The particularity of COMFO lies in the integration of foreign languages (French and English) and local languages, particularly urbanWolof, in order to reflect the collective opinion of Senegalese readers.
引用
收藏
页码:14 / 19
页数:6
相关论文
共 50 条
  • [1] Multilingual Corpus Development for Opinion Mining
    Schulz, Julia Maria
    Womser-Hacker, Christa
    Mandl, Thomas
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3409 - 3412
  • [2] Opinion mining in a telephone survey corpus
    Camelin, Nathalie
    Damnati, Geraldine
    Bechet, Frederic
    De Mori, Renato
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1041 - +
  • [3] Refine Crude Corpus for Opinion Mining
    Bhattacharyya, Debnath
    Das, Poulami
    Mitra, Kheyali
    Mukherjee, Swarnendu
    Ganguly, Debashis
    Bandyopadhyay, Samir Kumar
    Kim, Tai-hoon
    [J]. 2009 1ST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS(CICSYN 2009), 2009, : 17 - +
  • [4] Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining
    Kvapilikova, Ivana
    Artetxe, Mikel
    Labaka, Gorka
    Agirre, Eneko
    Bojar, Ondrej
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 255 - 262
  • [5] Twitter as a Corpus for Sentiment Analysis and Opinion Mining
    Pak, Alexander
    Paroubek, Patrick
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [6] Sentiment Analysis and Opinion Mining: The EmotiBlog Corpus
    Fernandez, Javi
    Boldrini, Ester
    Manuel Gomez, Jose
    Martinez-Barco, Patricio
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 179 - 187
  • [7] Inference Annotation of a Chinese Corpus for Opinion Mining
    Yan, Liyun
    Danni, E.
    Gan, Mei
    Grouin, Cyril
    Valette, Mathieu
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4991 - 4999
  • [8] ARAACOM: ARAbic Algerian Corpus for Opinion Mining
    Rahab, Hichem
    Zitouni, Abdelhafid
    Djoudi, Mahieddine
    [J]. ACM PROCEEDINGS OF INTERNATIONAL CONFERENCE OF COMPUTING FOR ENGINEERING AND SCIENCE (ICCES'17), 2017, : 35 - 39
  • [9] Opinion Mining on a German Corpus of a Media Response Analysis
    Scholz, Thomas
    Conrad, Stefan
    Hillekamps, Lutz
    [J]. TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 39 - 46
  • [10] A Cleaning Algorithm for Noiseless Opinion Mining Corpus Construction
    Manad, Otman
    Pappa, Anna
    Bernard, Gilles
    [J]. 2018 IEEE/ACS 15TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2018,