Multilingual Grammar Induction with Continuous Language Identification

被引:0
|
作者
Han, Wenjuan [1 ]
Wang, Ge [1 ]
Jiang, Yong [2 ]
Tu, Kewei [1 ]
机构
[1] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
[2] Alibaba Grp, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The key to multilingual grammar induction is to couple grammar parameters of different languages together by exploiting the similarity between languages. Previous work relies on linguistic phylogenetic knowledge to specify similarity between languages. In this work, we propose a novel universal grammar induction approach that represents language identities with continuous vectors and employs a neural network to predict grammar parameters based on the representation. Without any prior linguistic phylogenetic knowledge, we automatically capture similarity between languages with the vector representations and softly tie the grammar parameters of different languages. In our experiments, we apply our approach to 15 languages across 8 language families and subfamilies in the Universal Dependency Treebank dataset, and we observe substantial performance gain on average over monolingual and multilingual baselines.
引用
收藏
页码:5728 / 5733
页数:6
相关论文
共 50 条
  • [21] A unified system for multilingual speech recognition and language identification
    Liu, Danyang
    Xu, Ji
    Zhang, Pengyuan
    Yan, Yonghong
    SPEECH COMMUNICATION, 2021, 127 : 17 - 28
  • [22] Enhancing multilingual recognition of emotion in speech by language identification
    Sagha, Hesam
    Matejka, Pavel
    Gavryukova, Maryna
    Povolny, Filip
    Marchi, Erik
    Schuller, Bjoern
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2949 - 2953
  • [23] GF: A Multilingual Grammar Formalism
    Ranta, Aarne
    LANGUAGE AND LINGUISTICS COMPASS, 2009, 3 (05):
  • [24] Collaborative Multilingual Continuous Sign Language Recognition: A Unified Framework
    Hu, Hezhen
    Pu, Junfu
    Zhou, Wengang
    Li, Houqiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7559 - 7570
  • [25] English as a foreign language teacher beliefs about reflective and interlinguistic grammar instruction regarding didactic multilingual grammar sequences
    Garcia-pastor, Maria Dolores
    REVISTA INTERUNIVERSITARIA DE FORMACION DEL PROFESORADO-RIFOP, 2024, (99): : 189 - 208
  • [26] Language Identification: A New Fast Algorithm to Identify the Language of a Text in a Multilingual Corpus
    Gadri, Said
    Moussaoui, Abdelouahab
    Belabdelouahab-Fernini, Linda
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 321 - 326
  • [27] Learning classifier system approach to natural language grammar induction
    Unold, Olgierd
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 2, PROCEEDINGS, 2007, 4488 : 1210 - 1213
  • [28] Language identification of multilingual posts from Twitter: a case study
    Ferran Pla
    Lluís-F. Hurtado
    Knowledge and Information Systems, 2017, 51 : 965 - 989
  • [29] An Evaluation of Multilingual Offensive Language Identification Methods for the Languages of India
    Ranasinghe, Tharindu
    Zampieri, Marcos
    INFORMATION, 2021, 12 (08)
  • [30] Language Identification oriented to Multilingual Speech Recognition in the Basque context
    Barroso, Nora
    Lopez de Ipina, Karmele
    Barroso, Odei
    Ezeiza, Aitzol
    Susperregi, Unai
    2010 IEEE CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2010,