Using Graph Mining Method in Analyzing Turkish Loanwords Derived from Arabic Language

被引:0
|
作者
Jassim, Abbood Kirebut [1 ]
Hamzah, Muneam Jabbar [1 ]
Aliwy, Ahmed Hussein [2 ]
机构
[1] Univ Baghdad, Coll Sci Women, Dept Comp Sci, Baghdad, Iraq
[2] Univ Kufa, Fac Comp Sci & Math, Depnt Comp Sci, Najaf, Iraq
关键词
Arabic language; Data mining; Graph mining; Loanwords; Turkish language;
D O I
10.21123/bsj.2022.6008
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Loanwords are the words transferred from one language to another, which become essential part of the borrowing language. The loanwords have come from the source language to the recipient language because of many reasons. Detecting these loanwords is complicated task due to that there are no standard specifications for transferring words between languages and hence low accuracy. This work tries to enhance this accuracy of detecting loanwords between Turkish and Arabic language as a case study. In this paper, the proposed system contributes to find all possible loanwords using any set of characters either alphabetically or randomly arranged. Then, it processes the distortion in the pronunciation, and solves the problem of the missing letters in Turkish language relative to Arabic language. A graph mining technique was introduced, for identifying the Turkish loanwords from Arabic language, which is used for the first time for this purpose. Also, the problem of letters differences, in the two languages, is solved by using a reference language (English) to unify the style of writing. The proposed system was tested using 1256 words that manually annotated. The obtained results showed that the f-measure is 0.99 which is high value for such system. Also, all these contributions lead to decrease time and effort to identify the loanwords in efficient and accurate way. Moreover, researchers do not need to have knowledge in the recipient and the source languages. In addition, this method can be generalized to any two languages using the same steps followed in obtaining Turkish loanwords from Arabic.
引用
收藏
页码:1369 / 1377
页数:9
相关论文
共 50 条
  • [1] Analyzing Web Layout Structures using Graph Mining
    Lam, Winnie W. M.
    Chan, Keith C. C.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 361 - 366
  • [2] Using Natural Language Processing For Analyzing Arabic Poetry Rhythm
    Ahmed, Munef Abdullah
    Trausan-Matu, Stefan
    [J]. 2017 16TH ROEDUNET CONFERENCE: NETWORKING IN EDUCATION AND RESEARCH (ROEDUNET), 2017,
  • [3] An approach to Mining Information from Telephone Graph Using Graph Mining Techniques
    Rao, Bapuji
    Mishra, S. N.
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 424 - 429
  • [4] Text Mining Analysis in Turkish Language Using Big Data Tools
    Cakir, Mehmet Ulas
    Guldamlasioglu, Seren
    [J]. PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS, VOL 1, 2016, : 614 - 618
  • [5] Transportation data analyzing by using data mining method
    Luo, Qi
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 766 - 767
  • [6] Arabic sign language continuous sentences recognition using PCNN and graph matching
    M. F. Tolba
    Ahmed Samir
    Magdy Aboul-Ela
    [J]. Neural Computing and Applications, 2013, 23 : 999 - 1010
  • [7] Arabic sign language continuous sentences recognition using PCNN and graph matching
    Tolba, M. F.
    Samir, Ahmed
    Aboul-Ela, Magdy
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 23 (3-4): : 999 - 1010
  • [8] Morphology and Adaptation of Three Loanwords from Arabic as Idiomatic Words in Idioms in the Spanish Language: balde, (h)erre and guajete
    Ruiz, Manuel Jose Aguilar
    [J]. RILCE-REVISTA DE FILOLOGIA HISPANICA, 2023, 39 (02): : 581 - 603
  • [9] Data Mining Regarding Cyberbullying in the Arabic Language on Instagram Using KNIME and Orange Tools
    Alzahrani, Shumaa Saeed
    [J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2022, 12 (05) : 9364 - 9371
  • [10] An empirical method using features combination for Arabic native language identification
    Mechti, Seifeddine
    Abbassi, Ayoub
    Belguith, Lamia Hadrich
    Faiz, Rim
    [J]. 2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,