The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic

被引:0
|
作者
Abdelali, Ahmed [1 ]
Temnikova, Irina
Hedaya, Samy
Vogel, Stephan
机构
[1] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Doha, Qatar
关键词
corpus; interpreting strategies; annotation; translation; Arabic;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This article presents the WAW Corpus, an interpreting corpus for English/Arabic, which can be used for teaching interpreters, studying the characteristics of interpreters' work, as well as to train machine translation systems. The corpus contains recordings of lectures and speeches from international conferences, their interpretations, the transcripts of the original speeches and of their interpretations, as well as human translations of both kinds of transcripts into the opposite language of the language pair. The article presents the corpus curation, statistics, assessment, as well as a case study of the corpus use.
引用
收藏
页码:2135 / 2140
页数:6
相关论文
共 50 条
  • [41] Revealing the translator's style: A corpus-based study of english translations of Mencius
    Gao, Yaoyao
    Zhou, Guijun
    PLOS ONE, 2024, 19 (07):
  • [42] Machine Translation and Linguistic Use: An Analysis of English-French Translations Reunited in Corpus
    Loock, Rudy
    META, 2018, 63 (03) : 786 - 806
  • [43] BAAC: Bangor Arabic Annotated Corpus
    Alkhazi, Ibrahim S.
    Teahan, William J.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (11) : 131 - 140
  • [44] Exploring and exploiting a historical corpus for Arabic
    Bassam Hammo
    Sane Yagi
    Omaima Ismail
    Mohammad AbuShariah
    Language Resources and Evaluation, 2016, 50 : 839 - 861
  • [45] An Enhanced Corpus for Arabic Newspaper Comments
    Rahab, Hichem
    Zitouni, Abdelhafid
    Djoudi, Mahieddine
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (05) : 789 - 798
  • [46] Compilation of an Arabic Children's Corpus
    Al-Sulaiti, Latifa
    Abbas, Noorhan
    Brierley, Claire
    Atwell, Eric
    Alghamdi, Ayman
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1808 - 1812
  • [47] Developing an Arabic Corpus for Event Mining
    Alasfour, Abdel Alnasser A.
    Trausan-Matu, Stefan
    2013 17TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2013, : 21 - 28
  • [48] Exploring and exploiting a historical corpus for Arabic
    Hammo, Bassam
    Yagi, Sane
    Ismail, Omaima
    AbuShariah, Mohammad
    LANGUAGE RESOURCES AND EVALUATION, 2016, 50 (04) : 839 - 861
  • [49] MASC: MASSIVE ARABIC SPEECH CORPUS
    Al-Fetyani, Mohammad
    Al-Barham, Muhammad
    Abandah, Gheith
    Alsharkawi, Adham
    Dawas, Maha
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 1006 - 1013
  • [50] Phonetic Inventory for an Arabic Speech Corpus
    Halabi, Nawar
    Wald, Mike
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 734 - 738