Identification of Translationese: A Machine Learning Approach

被引:0
|
作者
Ilisei, Iustina [1 ]
Inkpen, Diana [2 ]
Pastor, Gloria Corpas [3 ]
Mitkov, Ruslan [1 ]
机构
[1] Wolverhampton Univ, Res Inst Informat & Language Proc, Wolverhampton WV1 1DJ, W Midlands, England
[2] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1N 6N5, Canada
[3] Univ Malaga, Dept Translat & Interpreting, E-29071 Malaga, Spain
来源
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING | 2010年 / 6008卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a machine learning approach to the study of translationese. The goal is to train a computer system to distinguish between translated and non-translated text, in order to determine the characteristic features that influence the classifiers. Several algorithms reach up to 97.62% success rate on a technical dataset. Moreover, the SVM classifier consistently reports a statistically significant improved accuracy when the learning system benefits from the addition of simplification features to the basic translational classifier system. Therefore, these findings may be considered an argument for the existence of the Simplification Universal.
引用
收藏
页码:503 / +
页数:3
相关论文
共 50 条
  • [21] Identification of Spoken Language using Machine Learning Approach
    Shahriar, Md Asif
    Aziz, Iftekhar
    Banik, Shovan
    Sattar, Abdus
    2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
  • [22] Machine Translationese: Effects of Algorithmic Bias on Linguistic Complexity in Machine Translation
    Vanmassenhove, Eva
    Shterionov, Dimitar
    Gwilliam, Matthew
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2203 - 2213
  • [23] No more rage against the machine: how the corpus-based identification of machine-translationese can lead to student empowerment
    Loock, Rudy
    JOURNAL OF SPECIALISED TRANSLATION, 2020, (34): : 150 - 170
  • [24] Identification of Respiratory Phases Using Seismocardiogram: A Machine Learning Approach
    Zakeri, Vahid
    Tavakolian, Kouhyar
    2015 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), 2015, 42 : 305 - 308
  • [25] A Machine Learning Approach for Identification of Low-Head Dams
    Vinay, Salvador
    Hotchkiss, Rollin H.
    Ramirez, Saul
    WATER, 2023, 15 (04)
  • [26] Identification Of Physical Fatigue In Interval Training: A Machine Learning Approach
    Marotta, Luca
    Buurke, Jaap H.
    van Beijnum, Bert-Jan F.
    Stoel, Marleen
    Reenalda, Jasper
    MEDICINE & SCIENCE IN SPORTS & EXERCISE, 2022, 54 (09) : 6 - 6
  • [27] A Machine Learning Approach to Identification and Resolution of One-Anaphora
    Ng, Hwee Tou
    Zhou, Yu
    Dale, Robert
    Gardiner, Mary
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1105 - 1110
  • [28] Similarity Ranking as an Attribute for Machine Learning Approach to Authorship Identification
    Rygl, Jan
    Horak, Ales
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 726 - 729
  • [29] A Machine Learning Approach to Objective Identification of Dust in Satellite Imagery
    Berndt, E. B.
    Elmer, N. J.
    Junod, R. A.
    Fuell, K. K.
    Harkema, S. S.
    Burke, A. R.
    Feemster, C. M.
    EARTH AND SPACE SCIENCE, 2021, 8 (06)
  • [30] Identification of high-frequency trading: A machine learning approach
    Goudarzi, Mostafa
    Bazzana, Flavio
    RESEARCH IN INTERNATIONAL BUSINESS AND FINANCE, 2023, 66