Extending Context Window of Large Language Models via Semantic Compression

被引:0
|
作者
Fei, Weizhi [1 ,2 ]
Niu, Xueyan [1 ,2 ]
Zhou, Pingyi [3 ]
Hou, Lu [3 ]
Bai, Bo [2 ]
Deng, Lei [2 ]
Han, Wei [2 ]
机构
[1] Tsinghua Univ, Dept Math Sci, Beijing, Peoples R China
[2] Huawei Technol Co Ltd, Theory Lab, 2012 Labs, Shenzhen, Peoples R China
[3] Huawei Technol Co Ltd, Noahs Ark Lab, 2012 Labs, Shenzhen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer based Large Language Models (LLMs) often impose limitations on the length of the text input to ensure the generation of fluent and relevant responses due to the quadratic complexity. These constraints restrict their applicability in long text scenarios. In this paper, we propose a novel semantic compression method that enables generalization to texts that are 6-8 times longer without incurring significant computational costs or requiring fine-tuning. Our proposed framework draws inspiration from source coding in information theory and employs a pre-trained model to reduce the semantic redundancy of long inputs before passing them to the LLMs for downstream tasks. Experimental results demonstrate that our method effectively extends the context window of LLMs across a range of tasks including question answering, summarization, few-shot learning, and information retrieval. Furthermore, the proposed semantic compression method exhibits consistent fluency in text generation while reducing the associated computational overhead.
引用
收藏
页码:5169 / 5181
页数:13
相关论文
共 50 条
  • [21] Bootstrapping Multilingual Semantic Parsers using Large Language Models
    Awasthi, Abhijeet
    Gupta, Nitish
    Samanta, Bidisha
    Dave, Shachi
    Sarawagi, Sunita
    Talukdar, Partha
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2455 - 2467
  • [22] Detecting hallucinations in large language models using semantic entropy
    Farquhar, Sebastian
    Kossen, Jannik
    Kuhn, Lorenz
    Gal, Yarin
    NATURE, 2024, 630 (8017) : 625 - +
  • [23] Semantic Understanding of Traffic Scenes with Large Vision Language Models
    Jain, Sandesh
    Thapa, Surendrabikram
    Chen, Kuan-Ting
    Abbott, A. Lynn
    Sarkar, Abhijit
    IEEE Intelligent Vehicles Symposium, Proceedings, 2024, : 1580 - 1587
  • [24] Semantic Understanding of Traffic Scenes with Large Vision Language Models
    Jain, Sandesh
    Thapa, Surendrabikram
    Chen, Kuan-Ting
    Abbott, A. Lynn
    Sarkar, Abhijit
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1580 - 1587
  • [25] Extending Building Information Models Semiautomatically Using Semantic Natural Language Processing Techniques
    Zhang, Jiansong
    El-Gohary, Nora M.
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2016, 30 (05)
  • [26] Game Generation via Large Language Models
    Hu, Chengpeng
    Zhao, Yunlong
    Liu, Jialin
    2024 IEEE CONFERENCE ON GAMES, COG 2024, 2024,
  • [27] Text Classification via Large Language Models
    Sun, Xiaofei
    Li, Xiaoya
    Li, Jiwei
    Wu, Fei
    Guo, Shangwei
    Zhang, Tianwei
    Wang, Guoyin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8990 - 9005
  • [28] Concise and Precise Context Compression for Tool-Using Language Models
    Xu, Yang
    Feng, Yunlong
    Mu, Honglin
    Hon, Yutai
    Li, Yitong
    Wang, Xinghao
    Zhong, Wanjun
    Li, Zhongyang
    Tu, Dandan
    Zhu, Qingfu
    Zhang, Min
    Che, Wanxiang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 16430 - 16441
  • [29] Studying large language models as compression algorithms for human culture
    Buttrick, Nicholas
    TRENDS IN COGNITIVE SCIENCES, 2024, 28 (03) : 187 - 189
  • [30] Unveiling the potential of large language models in generating semantic and cross-language clones
    Roy, Palash R.
    Alam, Ajmain I.
    Al-omari, Farouq
    Roy, Banani
    Roy, Chanchal K.
    Schneider, Kevin A.
    2023 IEEE 17TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES, IWSC 2023, 2023, : 22 - 28