Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models

被引:0
|
作者
Kim, Seungduk [1 ]
Choi, Seungtaek [1 ]
Jeong, Myeongho [1 ]
机构
[1] Yanolja, Korea, Republic of
来源
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
引用
收藏
相关论文
共 50 条
  • [1] Towards efficient and effective unlearning of large language models for recommendation
    Wang, Hangyu
    Lin, Jianghao
    Chen, Bo
    Yang, Yang
    Tang, Ruiming
    Zhang, Weinan
    Yu, Yong
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (03)
  • [2] Efficient handling of multilingual language models
    Fügen, C
    Stüker, S
    Soltau, H
    Metze, F
    Schultz, T
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 441 - 446
  • [3] Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
    Khassanov, Yerbolat
    Chng, Eng Siong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3343 - 3347
  • [4] A survey of multilingual large language models
    Qin, Libo
    Chen, Qiguang
    Zhou, Yuhang
    Chen, Zhi
    Li, Yinghui
    Liao, Lizi
    Li, Min
    Che, Wanxiang
    Yu, Philip S.
    PATTERNS, 2025, 6 (01):
  • [5] Efficient Multilingual Language Model Compression through Vocabulary Trimming
    Ushio, Asahi
    Zhou, Yi
    Camacho-Collados, Jose
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14725 - 14739
  • [6] ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
    Lai, Viet Dac
    Nguyen, Nghia Trung
    Ben Veyseh, Amir Pouran
    Man, Hieu
    Dernoncourt, Franck
    Bu, Trung
    Nguyen, Thien Huu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 13171 - 13189
  • [7] MULTILINGUAL JAILBREAK CHALLENGES IN LARGE LANGUAGE MODELS
    Deng, Yue
    Zhang, Wenxuan
    Pan, Sinno Jialin
    Bing, Lidong
    arXiv, 2023,
  • [8] On Leveraging Large Language Models for Multilingual Intent Discovery
    Chow, Rudolf
    Suen, King yiu
    Lam, Albert Y. S.
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
  • [9] Directions Towards Efficient and Automated Data Wrangling with Large Language Models
    Zhang, Zeyu
    Groth, Paul
    Calixto, Iacer
    Schelter, Sebastian
    2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 301 - 304
  • [10] XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models
    Liang, Davis
    Gonen, Hila
    Mao, Yuning
    Hou, Rui
    Goyal, Naman
    Ghazvininejad, Marjan
    Zettlemoyer, Luke
    Khabsa, Madian
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13142 - 13152