An Annotated Multilingual Dataset to Study Modality in the Gospels

被引:0
|
作者
Bermudez-Sabel, Helena [1 ]
Dell'Oro, Francesca [1 ,2 ]
机构
[1] Univ Neuchatel, Neuchatel, Switzerland
[2] Swiss Natl Sci Fdn, Bern, Switzerland
来源
DIGITAL HUMANITIES QUARTERLY | 2024年 / 18卷 / 01期
基金
瑞士国家科学基金会;
关键词
D O I
暂无
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
This paper presents a number of resources for examining the expression of modality in the Gospels. The main resource is an XML-TEI dataset that contains the linguistic annotation of a predefined list of potentially modal markers in both Ancient Greek and Latin. When one of these markers conveys a modal meaning, each constituent of the modal passage (i.e., the marker, its scope, and the modal relation between them) is annotated with a great level of detail through several linguistic features. One of the original features of our dataset is the implementation of a cross-referencing system that enables the alignment of the potentially modal markers of both languages. To facilitate the exploitation of our data by those unfamiliar with XML technologies, we also provide summary tables with the most relevant features of the annotation. In addition, a program written in Apache Ant allows any user to generate the summary sheets and to align modal passages in both Ancient Greek and Latin with any other language available in the Multilingual Bible Parallel Corpus [Christodouloupoulos and Steedman 2015]. This contribution presents the details of the semantic annotation and its formalization, and how our resources may be exploited within semantics and translation studies. In addition, the encoding strategies implemented are relevant for other projects dealing with the combination of multiple layers of (linguistic) annotation and/or tackling the development of parallel corpora.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [31] An annotated video dataset for computing video memorability
    Kiziltepe, Rukiye Savran
    Sweeney, Lorin
    Constantin, Mihai Gabriel
    Doctor, Faiyaz
    de Herrera, Alba Garcia Seco
    Demarty, Claire-Helene
    Healy, Graham
    Ionescu, Bogdan
    Smeaton, Alan F.
    [J]. DATA IN BRIEF, 2021, 39
  • [32] An Expert Annotated Dataset for the Detection of Online Misogyny
    Guest, Ella
    Vidgen, Bertie
    Mittos, Alexandros
    Sastry, Nishanth
    Tyson, Gareth
    Margetts, Helen
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1336 - 1350
  • [33] A Dataset of Annotated Omnidirectional Videos for Distancing Applications
    Mazzola, Giuseppe
    Lo Presti, Liliana
    Ardizzone, Edoardo
    La Cascia, Marco
    [J]. JOURNAL OF IMAGING, 2021, 7 (08)
  • [34] An annotated dataset of bioacoustic sensing and features of mosquitoes
    Dinarte Vasconcelos
    Nuno Jardim Nunes
    João Gomes
    [J]. Scientific Data, 7
  • [35] TFW: Annotated Thermal Faces in the Wild Dataset
    Kuzdeuov, Askat
    Aubakirova, Dana
    Koishigarina, Darina
    Varol, Huseyin Atakan
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 2084 - 2094
  • [36] Annotated dataset of history-related tweets
    Sumikawa, Yasunobu
    Jatowt, Adam
    [J]. DATA IN BRIEF, 2021, 38
  • [37] Dataset: Annotated soybean market news articles
    Reis Filho, Ivan José dos
    Coleti, Jamille de Campos
    Marcacini, Ricardo Marcondes
    Rezende, Solange Oliveira
    [J]. Data in Brief, 55
  • [38] MultiHumES: Multilingual Humanitarian Response Dataset for Extractive Summarization
    Yela-Bello, Jenny Paola
    Oglethorpe, Ewan
    Rekabsaz, Navid
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1713 - 1717
  • [39] MultiLegalSBD: A Multilingual Legal Sentence Boundary Detection Dataset
    Brugger, Tobias
    Sturmer, Matthias
    Niklaus, Joel
    [J]. PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2023, 2023, : 42 - 51
  • [40] MultiSubs: A Large-scale Multimodal and Multilingual Dataset
    Wang, Josiah
    Figueiredo, Josiel
    Specia, Lucia
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6776 - 6785