Using Annotation Projection for Semantic Role Labeling of Low-Resourced Language: Sinhala

被引:0
|
作者
Gunasekara, Sandun [1 ]
Chathura, Dulanjaya [1 ]
Jeewantha, Chamoda [1 ]
Dias, Gihan [1 ]
机构
[1] Univ Moratuwa, Dept Comp Sci & Engn, Moratuwa, Sri Lanka
关键词
SRL; Semantics; Semantic Role Labeling; Sinhala; Annotation; Projection; Labeller; Roles;
D O I
10.1109/ialp51396.2020.9310468
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present SinSRL, the first-ever semantic role labeller (SRL) for Sinhala, an Indo-European language spoken mainly in Sri Lanka. SinSRL takes parallel text in English (or any other language for which a suitable SRL exists) and Sinhala and outputs semantically annotated Sinhala text. We have enhanced existing tools to address several issues related to the target language. This will also be useful for labeling other Indic languages. In addition, we have manually semantically labeled a small Sinhala-English parallel dataset. The accuracy of our system is similar to that of manually labeled data. Our implementation can be used to generate a SRL dataset which may be used to train a direct semantic role labeller. SinSRL may be easily modified to annotate other low-resource languages for which parallel corpora are available.
引用
收藏
页码:98 / 103
页数:6
相关论文
共 50 条
  • [31] Transfer of Models and Resources for Under-Resourced Languages Semantic Role Labeling
    Mohamed, Yesuf
    Menzel, Wolfgang
    PAN-AFRICAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PT I, PANAFRICON AI 2023, 2024, 2068 : 141 - 153
  • [32] Semantic Role Labeling System for Persian Language
    Mirzaei, Azadeh
    Sedghi, Fatemeh
    Safari, Pegah
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (03)
  • [33] The Best of both Worlds: Dual Channel Language modeling for Hope Speech Detection in low-resourced Kannada
    Hande, Adeep
    Hegde, Siddhanth U.
    Sangeetha, Sivanesan
    Priyadharshini, Ruba
    Chakravarthi, Bharathi Raja
    PROCEEDINGS OF THE SECOND WORKSHOP ON LANGUAGE TECHNOLOGY FOR EQUALITY, DIVERSITY AND INCLUSION (LTEDI 2022), 2022, : 127 - 135
  • [34] Multilingual broad phoneme recognition and language-independent spoken term detection for low-resourced languages
    Deekshitha, G.
    Mary, Leena
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 7313 - 7323
  • [35] We Asked the Experts: The Role of Rural Hospitals in Achieving Equitable Surgical Access in Low-Resourced Settings
    Chu, Kathryn
    Maine, Rebecca
    Duvenage, Riaan
    WORLD JOURNAL OF SURGERY, 2021, 45 (10) : 3016 - 3018
  • [36] Semantic role labeling in Chinese language for nominal predicates
    Li J.-H.
    Zhou G.-D.
    Zhu Q.-M.
    Qian P.-D.
    Ruan Jian Xue Bao/Journal of Software, 2011, 22 (08): : 1725 - 1737
  • [37] We Asked the Experts: The Role of Rural Hospitals in Achieving Equitable Surgical Access in Low-Resourced Settings
    Kathryn Chu
    Rebecca Maine
    Riaan Duvenage
    World Journal of Surgery, 2021, 45 : 3016 - 3018
  • [38] Low-Resource Semantic Role Labeling
    Gormley, Matthew R.
    Mitchell, Margaret
    Van Durme, Benjamin
    Dredze, Mark
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1177 - 1187
  • [39] Image Segmentation and Labeling Using Free-form Semantic Annotation
    Tegen, Agnes
    Weegar, Rebecka
    Hammarlund, Linus
    Oskarsson, Magnus
    Jiang, Fangyuan
    Medved, Dennis
    Nugues, Pierre
    Astrom, Kalle
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2281 - 2286
  • [40] Improving Tone Recognition Performance using Wav2vec 2.0-Based Learned Representation in Yoruba, a Low-Resourced Language
    Obiang, Saint germes b. bengono
    Tsopze, Norbert
    Yonta, Paulin melatagia
    Bonastre, Jean-francois
    Jimenez, Tania
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (12)