A Comparative Study of Different Dimensionality Reduction Techniques for Arabic Machine Translation

被引:1
|
作者
Bensalah, Nouhaila [1 ]
Ayad, Habib [1 ]
Adib, Abdellah [1 ]
El Farouk, Abdelhamid Ibn [2 ]
机构
[1] Univ Hassan 2, Data Sci & Artificial Intelligence, Casablanca 20000, Morocco
[2] Languages & Cultures Lab, Mohammadia, Morocco
关键词
Dimensionality Reduction Techniques; post-processing algorithm; Arabic machine translation; Transformer;
D O I
10.1145/3634681
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word embeddings are widely deployed in a tremendous range of fundamental natural language processing applications and are also useful for generating representations of paragraphs, sentences, and documents. In some contexts involving constrained memory, it may be beneficial to reduce the size of word embeddings since they represent a core component of several natural language processing tasks. By reducing the dimensionality of word embeddings, their usefulness in memory-limited devices can be significantly improved, yielding gains in many real-world applications. This article aims to provide a comparative study of different dimensionality reduction techniques to generate efficient lower-dimensional word vectors. Based on empirical experiments carried out on the Arabic machine translation task, we found that the post-processing algorithm combined with independent component analysis provides optimal performance over the considered dimensionality reduction techniques. Therefore, we arrive at a new combination of the post-processing algorithm and dimensionality reduction (independent component analysis) techniques, which has not been investigated before. The latter was applied to both contextual and non-contextual word embeddings to reduce the size of the vectors while achieving a better translation quality than the original ones.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] The Effect of Different Dimensionality Reduction Techniques on Machine Learning Overfitting Problem
    Salam, Mustafa Abdul
    Azar, Ahmad Taher
    Elgendy, Mustafa Samy
    Fouad, Khaled Mohamed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (04) : 641 - 655
  • [2] The Effect of Different Dimensionality Reduction Techniques on Machine Learning Overfitting Problem
    Salam, Mustafa Abdul
    Azar, Ahmad Taher
    Elgendy, Mustafa Samy
    Fouad, Khaled Mohamed
    [J]. International Journal of Advanced Computer Science and Applications, 2021, 12 (04): : 641 - 655
  • [3] An Evolution and Evaluation of Dimensionality Reduction Techniques-A Comparative Study
    Snehal, Joshi K.
    Machchhar, Sahista
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 1244 - 1248
  • [4] Dimensionality reduction methods for machine translation quality estimation
    Gonzalez-Rubio, Jesus
    Ramon Navarro-Cerdan, J.
    Casacuberta, Francisco
    [J]. MACHINE TRANSLATION, 2013, 27 (3-4) : 281 - 301
  • [5] A comparative study of dimensionality reduction techniques to enhance trace clustering performances
    Song, M.
    Yang, H.
    Siadat, S. H.
    Pechenizkiy, M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (09) : 3722 - 3737
  • [6] Overview and comparative study of dimensionality reduction techniques for high dimensional data
    Ayesha, Shaeela
    Hanif, Muhammad Kashif
    Talib, Ramzan
    [J]. INFORMATION FUSION, 2020, 59 : 44 - 58
  • [7] Comparative Study of Dimensionality Reduction Techniques for Spectral-Temporal Data
    You, Shingchern D.
    Hung, Ming-Jen
    [J]. INFORMATION, 2021, 12 (01) : 1 - 12
  • [8] Comparative assessment of advanced machine learning techniques for simulation of lake water level fluctuations based on different dimensionality reduction methods
    Riazi, Mostafa
    Karimi, Maryam
    Eslamian, Saeid
    Samani, Majid Riahi
    [J]. EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 37 - 55
  • [9] Comparative assessment of advanced machine learning techniques for simulation of lake water level fluctuations based on different dimensionality reduction methods
    Mostafa Riazi
    Maryam Karimi
    Saeid Eslamian
    Majid Riahi Samani
    [J]. Earth Science Informatics, 2023, 16 : 37 - 55
  • [10] A Study of Dimensionality Reduction Techniques with Machine Learning Methods for Credit Risk Prediction
    Sivasankar, E.
    Selvi, C.
    Mala, C.
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, CIDM 2016, 2017, 556 : 65 - 76