Word Segmentation of Informal Arabic with Domain Adaptation

被引:0
|
作者
Monroe, Will [1 ]
Green, Spence [1 ]
Manning, Christoper D. [1 ]
机构
[1] Stanford Univ, Comp Sci Dept, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Segmentation of clitics has been shown to improve accuracy on a variety of Arabic NLP tasks. However, state-of-the-art Arabic word segmenters are either limited to formal Modern Standard Arabic, performing poorly on Arabic text featuring dialectal vocabulary and grammar, or rely on linguistic knowledge that is hand-tuned for each dialect. We extend an existing MSA segmenter with a simple domain adaptation technique and new features in order to segment informal and dialectal Arabic text. Experiments show that our system outperforms existing systems on newswire, broadcast news and Egyptian dialect, improving segmentation F-1 score on a recently released Egyptian Arabic corpus to 95.1%, compared to 90.8% for another segmenter designed specifically for Egyptian Arabic.
引用
收藏
页码:206 / 211
页数:6
相关论文
共 50 条
  • [41] Rethinking unsupervised domain adaptation for semantic segmentation
    Wang, Zhijie
    Suganuma, Masanori
    Okatani, Takayuki
    [J]. Pattern Recognition Letters, 2024, 186 : 119 - 125
  • [42] SEAUNet:Domain Adaptation for Biomedical Image Segmentation
    Chen, Hang
    Chen, Leiting
    Chen, Yuchu
    Fan, Minghao
    Zhou, Chuan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1914 - 1919
  • [43] Automatic Brain Tumor Segmentation with Domain Adaptation
    Dai, Lutao
    Li, Tengfei
    Shu, Hai
    Zhong, Liming
    Shen, Haipeng
    Zhu, Hongtu
    [J]. BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2018, PT II, 2019, 11384 : 380 - 392
  • [44] Multichannel Semantic Segmentation with Unsupervised Domain Adaptation
    Watanabe, Kohei
    Saito, Kuniaki
    Ushiku, Yoshitaka
    Harada, Tatsuya
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT V, 2019, 11133 : 600 - 616
  • [45] Pulmonary nodules segmentation based on domain adaptation
    Sui, Guozheng
    Liu, Xuefeng
    Chen, Shuang
    Liu, Shunli
    Zhang, Zaixian
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (15):
  • [46] Unsupervised Camouflaged Object Segmentation as Domain Adaptation
    Zhang, Yi
    Wu, Chengyi
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4336 - 4346
  • [47] Unsupervised Domain Adaptation in Semantic Segmentation: A Review
    Toldo, Marco
    Maracani, Andrea
    Michieli, Umberto
    Zanuttigh, Pietro
    [J]. TECHNOLOGIES, 2020, 8 (02)
  • [48] Domain Adaptation for CNN Based Iris Segmentation
    Jalilian, Ehsaneddin
    Uhl, Andreas
    Kwitt, Roland
    [J]. 2017 INTERNATIONAL CONFERENCE OF THE BIOMETRICS SPECIAL INTEREST GROUP (BIOSIG), 2017,
  • [49] FDA: Fourier Domain Adaptation for Semantic Segmentation
    Yang, Yanchao
    Soatto, Stefano
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4084 - 4094
  • [50] Unknown Object Segmentation through Domain Adaptation
    Chen, Yiting
    Li, Miao
    Yang, Chenguang
    [J]. 2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021), 2021, : 72 - 77