English-Vietnamese cross-language paraphrase identification using hybrid feature classes

被引:3
|
作者
Dien Dinh [1 ]
Nguyen Le Thanh [1 ]
机构
[1] VNU HCM, Univ Sci, Ho Chi Minh City, Vietnam
关键词
Paraphrase identification; Semantic similarity; Cross-language; BabelNet; Vietnamese;
D O I
10.1007/s10732-019-09411-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Paraphrase identification plays an important role with various applications in natural language processing tasks such as machine translation, bilingual information retrieval, plagiarism detection, etc. With the development of information technology and the Internet, the requirement of textual comparing is not only in the same language but also in many different language pairs. Especially in Vietnamese, detecting paraphrase in the English-Vietnamese pair of sentences is a high demand because English is one of the most popular foreign languages in Vietnam. However, the in-depth studies on cross- language paraphrase identification tasks between English and Vietnamese are still limited. Therefore, in this paper, we propose a method to identify the English-Vietnamese cross-language paraphrase cases, using hybrid feature classes. These classes are calculated by using the fuzzy-based method as well as the siamese recurrent model, and then combined to get the final result with a mathematical formula. The experimental results show that our model achieves 87.4% F-measure accuracy.
引用
收藏
页码:193 / 209
页数:17
相关论文
共 50 条
  • [1] English–Vietnamese cross-language paraphrase identification using hybrid feature classes
    Dien Dinh
    Nguyen Le Thanh
    Journal of Heuristics, 2022, 28 : 193 - 209
  • [2] English-Vietnamese Cross-Lingual Paraphrase Identification Using MT-DNN
    Hung Vo Tran Chi
    Duy Lu Anh
    Nguyen Le Thanh
    Dien Dinh
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2021, 11 (05) : 7598 - 7604
  • [3] Cross-Language Identification of Spanish Consonants in English
    Rose, Marda
    FOREIGN LANGUAGE ANNALS, 2012, 45 (03) : 415 - 429
  • [4] Vietnamese- English Cross-Lingual Paraphrase Identification Using Siamese Recurrent Architectures
    Le Thanh Nguyen
    Dinh Dien
    ISCIT 2019: PROCEEDINGS OF 2019 19TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2019, : 70 - 75
  • [5] A Hybrid Method for Word Segmentation with English-Vietnamese Bilingual Text
    Quoc Hung Ngo
    Dinh Dien
    Winiwarter, Werner
    2013 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2013,
  • [6] An English-Vietnamese Translation System Using Artificial Intelligence Approach
    Nguyen Van Binh
    Huynh Cong Phap
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2018, PT I, 2018, 10751 : 211 - 220
  • [7] Cross-language identification of consonants .1. Korean perception of English
    Schmidt, AM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (05): : 3201 - 3211
  • [8] Cross-Language Information Retrieval using Japanese and English WordNets
    Ueno, Ryo
    Klyuev, Vitaly
    2012 INTERNATIONAL CONFERENCE ON APPLIED INFORMATICS AND COMMUNICATION (ICAIC 2012), 2013, : 198 - 203
  • [9] CROSS-LANGUAGE PHONETIC INTERFERENCE - ARABIC TO ENGLISH
    FLEGE, JE
    PORT, R
    LANGUAGE AND SPEECH, 1981, 24 (APR-) : 125 - 146
  • [10] Cross-language identification of consonants. Part 1. Korean perception of English
    Schmidt, Anna Marie
    Journal of the Acoustical Society of America, 1996, 99 (05):