Korean-Chinese Person Name Translation for Cross Language Information Retrieval

被引:0
|
作者
Wang, Yu-Chun [1 ,2 ]
Lee, Yi-Hsun [1 ]
Lin, Chu-Cheng [1 ,3 ]
Tsai, Richard Tzong-Han [4 ]
Hsu, Wen-Lian [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan
[3] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[4] Yuan Ze Univ, Dept Comp Sci & Engn, Taoyuan, Taiwan
关键词
Person Name Translation; Korean-Chinese Cross Language Information Retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named entity translation plays an important role in many applications, such as information retrieval and machine translation. In this paper, we focus on translating person names, the most common type of name entity in Korean-Chinese cross language information retrieval (KCIR). Unlike other languages, Chinese uses characters (ideographs), which makes person name translation difficult because one syllable may map to several Chinese characters. We propose an effective hybrid person name translation method to improve the performance of KCIR. First, we use Wikipedia as a translation tool based on the inter-language links between the Korean edition and the Chinese or English editions. Second, we adopt the Naver people search engine to find the query name's Chinese or English translation. Third, we extract Korean-English transliteration pairs from Google snippets, and then search for the English-Chinese transliteration in the database of Taiwan's Central News Agency or in Google. The performance of KCIR using our method is over five times better than that of a dictionary-based system. The mean average precision is 0.3490 and the average recall is 0.7534. The method can deal with Chinese, Japanese, Korean, as well as non-CJK person name translation from Korean to Chinese. Hence, it substantially improves the performance of KCIR.
引用
收藏
页码:489 / +
页数:2
相关论文
共 50 条
  • [21] A Comparative Study on Translation Disambiguation for Cross-Language Information Retrieval
    Sadat, Fatiha
    KNOWLEDGE MANAGEMENT AND INNOVATION IN ADVANCING ECONOMIES-ANALYSES & SOLUTIONS, VOLS 1-3, 2009, : 1326 - 1337
  • [22] Statistical query translation models for cross-language information retrieval
    Microsoft Research
    不详
    不详
    不详
    不详
    ACM Trans. Asian Lang. Inf. Process., 2006, 4 (323-359): : 323 - 359
  • [23] Exploration on Korean-Chinese collaborative translation method based on recursive recurrent neural network
    Zhu, Lin
    PERSONAL AND UBIQUITOUS COMPUTING, 2020, 24 (02) : 309 - 318
  • [24] Ethnicity is in the blood, not in the language: exploring Korean-Chinese bilingual families' multilingual planning
    Cui, Huiling
    Zheng, Yongyan
    JOURNAL OF MULTILINGUAL AND MULTICULTURAL DEVELOPMENT, 2024, 45 (02) : 129 - 146
  • [25] An analysis of errors in Korean-Chinese public sector translation-A case study on COVID-19 information materials
    Li, Xian
    JOURNAL OF THE INTERNATIONAL NETWORK FOR KOREAN LANGUAGE AND CULTURE, 2022, 19 (02): : 203 - 228
  • [26] Cross Language Information Retrieval
    Poibeau, Thierry
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2010, 51 (03): : 157 - 162
  • [27] Korean-Chinese Machine Translation Using Three-Stage Verb Pattern Matching
    Yin, Chang Hao
    Seo, Young Ae
    Kim, Young-Gil
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 313 - 320
  • [28] Exploration on Korean-Chinese collaborative translation method based on recursive recurrent neural network
    Lin Zhu
    Personal and Ubiquitous Computing, 2020, 24 : 309 - 318
  • [29] Research on Chinese-English cross-language information retrieval
    Zhang, Tao
    Zhang, Yue-Jie
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2591 - +
  • [30] Research on English-Chinese cross-language information retrieval
    Zhang, Yue-Jie
    Zhang, Tao
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3448 - +