Multilingual Image Corpus - Towards a Multimodal and Multilingual Dataset

被引：0

作者：

Koeva, Svetla ^{[1
]}

Stoyanova, Ivelina ^{[1
]}

Kralev, Jordan ^{[1
,2
]}

机构：

[1] Bulgarian Acad Sci, Inst Bulgarian Language, Sofia, Bulgaria

[2] Tech Univ, Sofia, Bulgaria

来源：

LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年

基金：

欧盟地平线“2020”;

关键词：

multilingual image corpus; multilingual dataset; multimodal dataset;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

One of the processing tasks for large multimodal data streams is the automatic image description (image classification, object segmentation and classification). Although the number and the diversity of image datasets is constantly expanding, there is still a huge demand for more datasets in terms of variety of domains and object classes covered. The goal of the project Multilingual Image Corpus (MIC21) is to provide a large image dataset with annotated objects and object descriptions in 25 languages. The Multilingual Image Corpus relies on an Ontology of Visual Objects (based on WordNet) and comprises a collection of thematically related images whose objects are annotated with segmentation masks and labels linked to the ontology classes. The dataset is designed both for image classification and object detection and for semantic segmentation. The main contributions of our work are: a) the provision of a large collection of high-quality images licensed for commercial and non-commercial use; b) the compilation of the Ontology of Visual Objects based on WordNet noun hierarchies; c) the automatic object segmentation within the images followed by precise manual editing and the annotation of object classes; and d) the mapping of objects and images to extended multilingual descriptions based onWordNet inner- and interlingual relations. The dataset can be used also for multilingual image caption generation, image-to-text alignment and automatic question answering for multimedia content.

引用

页码：1509 / 1518

页数：10

共 50 条

[31] Media Identities - multimodal and multilingual
Schwegler, Carolin
Steen, Pamela
[J]. LILI-ZEITSCHRIFT FUR LITERATURWISSENSCHAFT UND LINGUISTIK, 2024, : 383 - 391
[32] Multilingual and Multimodal Abuse Detection
Sharon, Rini
Shah, Heet
Mukherjee, Debdoot
Gupta, Vikram
[J]. INTERSPEECH 2022, 2022, : 4631 - 4635
[33] Towards the linguistic approach to ideasthesia (case study of the multilingual parallel corpus)
Iaroshenko, Polina, V
[J]. VESTNIK SANKT-PETERBURGSKOGO UNIVERSITETA-YAZYK I LITERATURA, 2023, 20 (01): : 156 - 169
[34] Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Gao, Haoyuan
Mao, Junhua
Zhou, Jie
Huang, Zhiheng
Wang, Lei
Xu, Wei
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
[35] GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
Hagiwara, Masato
Mita, Masato
[J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6761 - 6768
[36] A Dataset and Baselines for Multilingual Reply Suggestion
Zhang, Mozhi
Wang, Wei
Deb, Budhaditya
Zheng, Guoqing
Shokouhi, Milad
Awadallah, Ahmed Hassan
[J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1207 - 1220
[37] Leyzer: A Dataset for Multilingual Virtual Assistants
Sowanski, Marcin
Janicki, Artur
[J]. TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 477 - 486
[38] CMU WILDERNESS MULTILINGUAL SPEECH DATASET
Black, Alan W.
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5971 - 5975
[39] Euronews: a multilingual speech corpus for ASR
Gretter, Roberto
[J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2635 - 2638
[40] A Dataset for Multilingual Epidemiological Event Extraction
Mutuvi, Stephen
Doucet, Antoine
Lejeune, Gael
Odeo, Moses
[J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4139 - 4144

← 1 2 3 4 5 →