A Comparative Study on English-Chinese Machine Transliteration

被引:0
|
作者
Gao E. [1 ]
Duan X. [2 ]
机构
[1] School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou
[2] School of Computer Science and Technology, Soochow University, Suzhou
关键词
Deep neural network method; Machine transliteration; Statistical method; Transliteration alignment;
D O I
10.13209/j.0479-8023.2017.039
中图分类号
学科分类号
摘要
With the aim to study the two main methods on machine transliteration: traditional statistical method and the current prevalent deep neural network method, the authors carry out the comparative study on them with two typical systems per method The experiments show that traditional statistical method and deep neural network method perform comparatively regarding evaluation metrics, while manifest difference on individual transliteration result. A system combination method is proposed to balance the strengths of all systems. Experimental results show that system combination significantly improves the transliteration quality over single system. © 2017 Peking University.
引用
下载
收藏
页码:287 / 294
页数:7
相关论文
共 17 条
  • [1] Li H., Zhang M., Su J., A joint source-channel model for machine transliteration, Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, pp. 159-166, (2004)
  • [2] Pervouchine V., Li H., Lin B., Transliteration alignment, Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp. 136-144, (2009)
  • [3] Finch A., Liu L., Wang X., Et al., Neural network transduction models in transliteration generation, Proceedings of the Fifth Named Entity Workshop, pp. 61-66, (2015)
  • [4] Koehn P., Och F.J., Marcu D., Statistical phrase-based translation, Proceedings of the HLT/NAACL, pp. 127-133, (2003)
  • [5] Stolcke A., SRILM - an extensible language mode-ling toolkit, Proceedings of International Conference on Spoken Language Processing, pp. 901-904, (2002)
  • [6] Chen S.F., Goodman J.T., An Empirical Study of Smoothing Techniques for Language Modeling, (1998)
  • [7] Och F.J., Ney H., A systematic comparison of various statistical alignment models, Computational Linguistics, 29, 1, pp. 19-51, (2003)
  • [8] Koehn P., Hoang H., Birch A., Et al., Moses: open source toolkit for statistical machine translation, Proceedings of the Annual Meeting of the Association for Computational Linguistics, Demonstration Session, pp. 177-180, (2007)
  • [9] Li H.Z., Ma B., Lee C.H., A vector space modeling approach to spoken language identification, IEEE Transaction on Acoustic, Speech, Signal Processing, 15, 1, pp. 271-284, (2007)
  • [10] Liu L., Finch A., Utiyama M., Et al., Agreement on targetbidirectional LSTMs for sequence to sequence learning, Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, pp. 2630-2637, (2016)