SeaLLMs - Large Language Models for Southeast Asia

被引：0

作者：

Xuan-Phi Nguyen ^{[1
]}

Zhang, Wenxuan ^{[1
]}

Li, Xin ^{[1
]}

Aljunied, Mahani ^{[1
]}

Hu, Zhiqiang ^{[1
]}

Shen, Chenhui ^{[1
]}

Chia, Yew Ken ^{[1
]}

Li, Xingxuan ^{[1
]}

Wang, Jianyu ^{[1
]}

Tan, Qingyu ^{[1
]}

Cheng, Liying ^{[1
]}

Chen, Guanzheng ^{[1
]}

Deng, Yue ^{[1
]}

Yang, Sen ^{[1
]}

Liu, Chaoqun ^{[1
]}

Zhang, Hang ^{[1
]}

Bing, Lidong ^{[1
]}

机构：

[1] Alibaba Grp, DAMO Acad, Hangzhou, Zhejiang, Peoples R China

来源：

PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 3: SYSTEM DEMONSTRATIONS | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite the remarkable achievements of large language models (LLMs) in various tasks, there remains a linguistic bias that favors high-resource languages, such as English, often at the expense of low-resource and regional languages. To address this imbalance, we introduce SeaLLMs, an innovative series of language models that specifically focuses on Southeast Asian (SEA) languages. SeaLLMs are built upon popular English-centric models through continued pre-training with an extended vocabulary, specialized instruction and alignment tuning to better capture the intricacies of regional languages. This allows them to respect and reflect local cultural norms, customs, stylistic preferences, and legal considerations. Our comprehensive evaluation demonstrates that SeaLLM models exhibit superior performance across a wide spectrum of linguistic tasks and assistant-style instruction-following capabilities relative to comparable open-source models. Moreover, they outperform ChatGPT-3.5 in non-Latin languages, such as Thai, Khmer, Lao, and Burmese, by large margins while remaining lightweight and cost-effective to operate.

引用

页码：294 / 304

页数：11

共 50 条

[21] CLT using CEFR and EIL in Southeast Asia and East Asia in the English language classroom
Foley, Joseph
RELC JOURNAL, 2022, 53 (01) : 240 - 252
[22] Large independents find opportunities in Southeast Asia deep water
Knight, R
Wright, H
OIL & GAS JOURNAL, 2004, 102 (44) : 41 - +
[23] Large Language Models in der WissenschaftLarge language models in science
Karl-Friedrich Kowalewski
Severin Rodler
Die Urologie, 2024, 63 (9) : 860 - 866
[24] Community forestry models in southeast Asia and Cambodia: A comparative study
Sokh, H
Iida, S
JOURNAL OF THE FACULTY OF AGRICULTURE KYUSHU UNIVERSITY, 2001, 46 (01): : 113 - 121
[25] Language, Education and Nation-Building: Assimilation and Shift in Southeast Asia
Lo Bianco, Joseph
JOURNAL OF SOCIOLINGUISTICS, 2015, 19 (03) : 404 - 409
[26] LANGUAGE POLICY AND MODERNITY IN SOUTHEAST ASIA (MALAYSIA, THE PHILIPPINES, SINGAPORE AND THAILAND)
Ridge, Brian
AUSTRALIAN REVIEW OF APPLIED LINGUISTICS, 2007, 30 (01)
[27] Plurilingual Teaching of a Second Foreign Language for Students from Southeast Asia
Ragozina, Alena, V
Obdalova, Olga A.
TOMSK STATE UNIVERSITY JOURNAL, 2021, (465): : 164 - 171
[28] Southeast Asia
Cressey, George B.
ANNALS OF THE ASSOCIATION OF AMERICAN GEOGRAPHERS, 1952, 42 (01) : 105 - 107
[29] Southeast Asia
Rosinger, Lawrence K.
ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 1952, 279 : 245 - 246
[30] Southeast Asia
Quijon, Carlos, Jr.
ARTFORUM INTERNATIONAL, 2020, 58 (07): : 234 - 234

← 1 2 3 4 5 →