From Feature to Paradigm: Deep Learning in Machine Translation (Extended Abstract)

被引:0
|
作者
Costa-jussa, Marta R. [1 ]
机构
[1] Univ Politecn Cataluna, TALP Res Ctr, Barcelona, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last years, deep learning algorithms have highly revolutionized several areas including speech, image and natural language processing. The specific field of Machine Translation (MT) has not remained invariant. Integration of deep learning in MT varies from re-modeling existing features into standard statistical systems to the development of a new architecture. Among the different neural networks, research works use feed-forward neural networks, recurrent neural networks and the encoder-decoder schema. These architectures are able to tackle challenges as having low-resources or morphology variations. This extended abstract focuses on describing the foundational works on the neural MT approach; mentioning its strengths and weaknesses; and including an analysis of the corresponding challenges and future work. The full manuscript [Costa-jussa, 2018] describes, in addition, how these neural networks have been integrated to enhance different aspects and models from statistical MT, including language modeling, word alignment, translation, reordering, and rescoring; and on describing the new neural MT approach together with recent approaches on using subword, characters and training with multilingual languages, among others.
引用
收藏
页码:5583 / 5587
页数:5
相关论文
共 50 条
  • [1] From feature to paradigm: Deep learning in machine translation
    [J]. Marta, Costa-JussàR. (marta.ruiz@upc.edu), 1600, AI Access Foundation (61):
  • [2] From Feature to Paradigm: Deep Learning in Machine Translation
    Costa-Jussa, Marta R.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 : 947 - 974
  • [3] Some elements of machine learning (Extended abstract)
    Quinlan, JR
    [J]. MACHINE LEARNING, PROCEEDINGS, 1999, : 523 - 525
  • [4] Some elements of machine learning - (Extended abstract)
    Quinlan, JR
    [J]. INDUCTIVE LOGIC PROGRAMMING, 1999, 1634 : 15 - 18
  • [5] Statistical Feature Aided Intelligent Deep Learning Machine Translation in Internet of Things
    Zhang, Yidian
    Zhang, Lin
    Lan, Ping
    Li, Wenyong
    Yang, Dan
    Wu, Zhiqiang
    [J]. MOBILE NETWORKS & APPLICATIONS, 2023, 28 (01): : 325 - 333
  • [6] Statistical Feature Aided Intelligent Deep Learning Machine Translation in Internet of Things
    Yidian Zhang
    Lin Zhang
    Ping Lan
    Wenyong Li
    Dan Yang
    Zhiqiang Wu
    [J]. Mobile Networks and Applications, 2023, 28 : 325 - 333
  • [7] Deep Residual Reinforcement Learning (Extended Abstract)
    Zhang, Shangtong
    Boehmer, Wendelin
    Whiteson, Shimon
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4869 - 4873
  • [8] Learning from Failure [Extended Abstract]
    Grollman, Daniel H.
    Billard, Aude G.
    [J]. PROCEEDINGS OF THE 6TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTIONS (HRI 2011), 2011, : 145 - 146
  • [9] A Visual Programming Paradigm for Abstract Deep Learning Model Development
    Tamilselvam, Srikanth G.
    Panwar, Naveen
    Khare, Shreya
    Aralikatte, Rahul
    Sankaran, Anush
    Mani, Senthil
    [J]. PROCEEDINGS OF THE 10TH INDIAN CONFERENCE ON HUMAN-COMPUTER INTERACTION (INDIA HCI 2019), 2019, : 133 - 143
  • [10] Cryptocurrencies asset pricing via machine learning: Extended abstract
    Wang, Qiyu
    [J]. 2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 789 - 790