FedKC: Federated Knowledge Composition for Multilingual Natural Language Understanding

被引：9

作者：

Wang, Haoyu ^{[1
]}

Zhao, Handong ^{[2
]}

Wang, Yaqing ^{[1
]}

Yu, Tong ^{[2
]}

Gu, Jiuxiang ^{[2
]}

Gao, Jing ^{[1
]}

机构：

[1] Purdue Univ, W Lafayette, IN 47907 USA

[2] Adobe Res, San Jose, CA USA

来源：

PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22) | 2022年

基金：

美国国家科学基金会;

关键词：

Federated learning; Multilingual natural language understanding;

D O I：

10.1145/3485447.3511988

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multilingual natural language understanding, which aims to comprehend multilingual documents, is an important task. Existing efforts have been focusing on the analysis of centrally stored text data, but in real practice, multilingual data is usually distributed. Federated learning is a promising paradigm to solve this problem, which trains local models with decentralized data on local clients and aggregates local models on the central server to achieve a good global model. However, existing federated learning methods assume that data are independent and identically distributed (IID), and cannot handle multilingual data, that are usually non-IID with severely skewed distributions: First, multilingual data is stored on local client devices such that there are only monolingual or bilingual data stored on each client. This makes it difficult for local models to know the information of documents in other languages. Second, the distribution over different languages could be skewed. High resource language data is much more abundant than low resource language data. The model trained on such skewed data may focus more on high resource languages but fail to consider the key information of low resource languages. To solve the aforementioned challenges of multilingual federated NLU, we propose a plug-and-play knowledge composition (KC) module, called FedKC, which exchanges knowledge among clients without sharing raw data. Specifically, we propose an effective way to calculate a consistency loss defined based on the shared knowledge across clients, which enables models trained on different clients achieve similar predictions on similar data. Leveraging this consistency loss, joint training is thus conducted on distributed data respecting the privacy constraints. We also analyze the potential risk of FedKC and provide theoretical bound to show that it is difficult to recover data from the corrupted data. We conduct extensive experiments on three public multilingual datasets for three typical NLU tasks, including paraphrase identification, question answering matching, and news classification. The experiment results show that the proposed FedKC can outperform state-of-the-art baselines on the three datasets significantly.

引用

页码：1839 / 1850

页数：12

共 50 条

[31] Combining large language models with enterprise knowledge graphs: a perspective on enhanced natural language understanding
Mariotti, Luca
Guidetti, Veronica
Mandreoli, Federica
Belli, Andrea
Lombardi, Paolo
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
[32] The language of thought and natural language understanding
Knowles, J
ANALYSIS, 1998, 58 (04) : 264 - 272
[33] Language clustering and knowledge sharing in multilingual organizations: A social perspective on language
Ahmad, Farhan
Widen, Gunilla
JOURNAL OF INFORMATION SCIENCE, 2015, 41 (04) : 430 - 443
[34] Analysis of knowledge data discovery and mining by construction of natural language understanding system
Wang, QiuFen
Guo, HuiLing
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMMERCE AND SOCIETY, 2015, 17 : 640 - 644
[35] Generating Grammars for Natural Language Understanding from Knowledge about Actions and Objects
Perzylo, Alexander
Griffiths, Sascha
Lafrenz, Reinhard
Knoll, Alois
2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 2008 - 2013
[36] Knowledge-driven Natural Language Understanding of English Text and its Applications
Basu, Kinjal
Varanasi, Sarat Chandra
Shakerin, Farhad
Arias, Joaquin
Gupta, Gopal
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12554 - 12563
[37] Prior Knowledge Driven Label Embedding for Slot Filling in Natural Language Understanding
Zhu, Su
Zhao, Zijian
Ma, Rao
Yu, Kai
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1440 - 1451
[38] Multilingual spatial domain natural language interface to databases
Wang, Wenlu
Li, Jingjing
Ku, Wei-Shinn
Wang, Haixun
GEOINFORMATICA, 2024, 28 (01) : 29 - 52
[39] Multilingual spatial domain natural language interface to databases
Wenlu Wang
Jingjing Li
Wei-Shinn Ku
Haixun Wang
GeoInformatica, 2024, 28 : 29 - 52
[40] LANGUAGE AND UNDERSTANDING IN SCIENTIFIC KNOWLEDGE
Stefanov, Angel
FILOSOFIYA-PHILOSOPHY, 2014, 23 (02): : 111 - 116

← 1 2 3 4 5 →