Multilingual Image Corpus - Towards a Multimodal and Multilingual Dataset

被引:0
|
作者
Koeva, Svetla [1 ]
Stoyanova, Ivelina [1 ]
Kralev, Jordan [1 ,2 ]
机构
[1] Bulgarian Acad Sci, Inst Bulgarian Language, Sofia, Bulgaria
[2] Tech Univ, Sofia, Bulgaria
基金
欧盟地平线“2020”;
关键词
multilingual image corpus; multilingual dataset; multimodal dataset;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
One of the processing tasks for large multimodal data streams is the automatic image description (image classification, object segmentation and classification). Although the number and the diversity of image datasets is constantly expanding, there is still a huge demand for more datasets in terms of variety of domains and object classes covered. The goal of the project Multilingual Image Corpus (MIC21) is to provide a large image dataset with annotated objects and object descriptions in 25 languages. The Multilingual Image Corpus relies on an Ontology of Visual Objects (based on WordNet) and comprises a collection of thematically related images whose objects are annotated with segmentation masks and labels linked to the ontology classes. The dataset is designed both for image classification and object detection and for semantic segmentation. The main contributions of our work are: a) the provision of a large collection of high-quality images licensed for commercial and non-commercial use; b) the compilation of the Ontology of Visual Objects based on WordNet noun hierarchies; c) the automatic object segmentation within the images followed by precise manual editing and the annotation of object classes; and d) the mapping of objects and images to extended multilingual descriptions based onWordNet inner- and interlingual relations. The dataset can be used also for multilingual image caption generation, image-to-text alignment and automatic question answering for multimedia content.
引用
收藏
页码:1509 / 1518
页数:10
相关论文
共 50 条
  • [31] Media Identities - multimodal and multilingual
    Schwegler, Carolin
    Steen, Pamela
    [J]. LILI-ZEITSCHRIFT FUR LITERATURWISSENSCHAFT UND LINGUISTIK, 2024, : 383 - 391
  • [32] Multilingual and Multimodal Abuse Detection
    Sharon, Rini
    Shah, Heet
    Mukherjee, Debdoot
    Gupta, Vikram
    [J]. INTERSPEECH 2022, 2022, : 4631 - 4635
  • [33] Towards the linguistic approach to ideasthesia (case study of the multilingual parallel corpus)
    Iaroshenko, Polina, V
    [J]. VESTNIK SANKT-PETERBURGSKOGO UNIVERSITETA-YAZYK I LITERATURA, 2023, 20 (01): : 156 - 169
  • [34] Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
    Gao, Haoyuan
    Mao, Junhua
    Zhou, Jie
    Huang, Zhiheng
    Wang, Lei
    Xu, Wei
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [35] GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
    Hagiwara, Masato
    Mita, Masato
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6761 - 6768
  • [36] A Dataset and Baselines for Multilingual Reply Suggestion
    Zhang, Mozhi
    Wang, Wei
    Deb, Budhaditya
    Zheng, Guoqing
    Shokouhi, Milad
    Awadallah, Ahmed Hassan
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1207 - 1220
  • [37] Leyzer: A Dataset for Multilingual Virtual Assistants
    Sowanski, Marcin
    Janicki, Artur
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 477 - 486
  • [38] CMU WILDERNESS MULTILINGUAL SPEECH DATASET
    Black, Alan W.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5971 - 5975
  • [39] Euronews: a multilingual speech corpus for ASR
    Gretter, Roberto
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2635 - 2638
  • [40] A Dataset for Multilingual Epidemiological Event Extraction
    Mutuvi, Stephen
    Doucet, Antoine
    Lejeune, Gael
    Odeo, Moses
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4139 - 4144