An Arabic WordNet Enrichment Approach using Machine Translation and External Linguistic Resources

被引:0
|
作者
Lachichi, Cilia [1 ]
Bendiaf, Chahrazad [1 ]
Berkani, Lamia [2 ]
Guessoum, Ahmed [2 ]
机构
[1] Univ Algiers, Dept Comp Sc, USTHB, Algiers, Algeria
[2] Univ Algiers, Dept Comp Sc, LRIA Lab, USTHB, Algiers, Algeria
关键词
Arabic WordNet; princeton WordNet; AWN anrichment; WordNet concepts alignment; machine translation; wikipedia; ranking metric; dictionaries;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of Arabic WordNet (AWN) has provided the Arabic Natural Language Processing (NLP) community with a lexical resource. However, due to the considerable lack of AWN Synsets compared to other WordNets, the use of this resource in Arabic NLP applications cannot be considered as an option yet. In this paper, we propose an automatic approach for AWN enrichment, based on three features: (1) the translation of Princeton WordNet (PWN) words and their direct Hypernyms; (2) an automatic validation of the hypernymy relation that is suggested as existing between words using Wikipedia articles; and (3) the integration of validated Synsets in AWN including their definitions, examples and synonyms that get extracted using dictionaries and other resources. The proposed approach has been implemented and a preliminary evaluation shows that the obtained results are promising.
引用
收藏
页码:107 / 112
页数:6
相关论文
共 50 条
  • [1] An Automatic Approach for WordNet Enrichment Applied to Arabic WordNet
    Ameur, Mohamed Seghir Hadj
    Khadir, Ahlem Cherifa
    Guessoum, Ahmed
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, 2018, 782 : 3 - 18
  • [2] Incorporation of Linguistic Features in Machine Translation Evaluation of Arabic
    El Marouani, Mohamed
    Boudaa, Tarik
    Enneya, Nourddine
    [J]. BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 500 - 511
  • [3] Arabic To Bangla Machine Translation Using Encoder Decoder Approach
    Kamruzzaman, Toshiba
    [J]. 2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 1176 - 1179
  • [4] Linguistic Factors in Statistical Machine Translation Involving Arabic Language
    Youssef, Islam
    Sakr, Mohamed
    Kouta, Mohamed
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (11): : 154 - 159
  • [5] Improved Arabic-Chinese Machine Translation with Linguistic Input Features
    Aqlan, Fares
    Fan, Xiaoping
    Alqwbani, Abdullah
    Al-Mansoub, Akram
    [J]. FUTURE INTERNET, 2019, 11 (01)
  • [6] Classical Arabic English machine translation using rule-based approach
    Hebresha, Huda Alhusain
    Aziz, Mohd Juzaiddin Ab
    [J]. Journal of Applied Sciences, 2013, 13 (01) : 79 - 86
  • [7] Applying the Cognitive Machine Translation Evaluation Approach to Arabic
    Temnikova, Irina
    Zaghouani, Wajdi
    Vogel, Stephan
    Habash, Nizar
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3644 - 3651
  • [8] A Linguistic Approach for Automatic Analysis, Recognition and Translation of Arabic Nominal Predicates
    Cheikhrouhou, Hajer
    Lahyani, Imed
    [J]. FORMALIZING NATURAL LANGUAGES: APPLICATIONS TO NATURAL LANGUAGE PROCESSING AND DIGITAL HUMANITIES, NOOJ 2022, 2022, 1758 : 100 - 111
  • [9] VERTa: a linguistic approach to automatic machine translation evaluation
    Elisabet Comelles
    Jordi Atserias
    [J]. Language Resources and Evaluation, 2019, 53 : 57 - 86
  • [10] VERTa: a linguistic approach to automatic machine translation evaluation
    Comelles, Elisabet
    Atserias, Jordi
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2019, 53 (01) : 57 - 86