Artificial Intelligence Language Model Based on Long-term Memory Enhancement Technology

被引:0
|
作者
Zhang, Yi [1 ]
机构
[1] Intellifusion Pty Ltd, Melbourne, Vic, Australia
关键词
artificial intelligence language model; long-term memory enhancement; attention mechanism;
D O I
10.1145/3677779.3677817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study explores the application of long-term memory enhancement technology in artificial intelligence language models. By integrating attention mechanisms, hierarchical network design, and dynamic pruning techniques, this research significantly enhances the model's ability to process long sequence texts. The new model demonstrates lower perplexity on multiple standard test sets, effectively proving the practical benefits of the adopted technologies. While there is room for improvement in handling unstructured text and high-noise data, the achieved results clearly showcase the immense potential of long-term memory enhancement technology in improving language models. Future work will focus on improving the model's computational efficiency and enhancing its adaptability in more complex application scenarios.
引用
收藏
页码:232 / 237
页数:6
相关论文
共 50 条
  • [1] Intelligence, learning and long-term memory
    Alexander, JRM
    Smales, S
    PERSONALITY AND INDIVIDUAL DIFFERENCES, 1997, 23 (05) : 815 - 825
  • [2] CREB and the enhancement of long-term memory
    Nguyen, PV
    TRENDS IN NEUROSCIENCES, 2001, 24 (06) : 314 - 314
  • [3] Simulating the Coevolution of Language and Long-Term Memory
    Shuai, Lan
    Wang, Zhen
    Gong, Tao
    2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 1374 - 1381
  • [4] Artificial Intelligence for Long-Term Robot Autonomy: A Survey
    Kunze, Lars
    Hawes, Nick
    Duckett, Tom
    Hanheide, Marc
    Krajnik, Tomas
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 4023 - 4030
  • [5] A Collaborative Framework With Artificial Intelligence for Long-Term Care
    Chou, Hsien-Ming
    IEEE ACCESS, 2020, 8 : 43657 - 43664
  • [6] Long-Term Trends in the Public Perception of Artificial Intelligence
    Fast, Ethan
    Horvitz, Eric
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 963 - 969
  • [7] SHORT AND LONG-TERM PREDICTION BASED ON COMPLEX STATISTICS AND ARTIFICIAL INTELLIGENCE TECHNIQUES
    El Biely, Mona Mustafa
    ADVANCES AND APPLICATIONS IN STATISTICS, 2018, 52 (04) : 267 - 280
  • [8] Energy Usage Forecasting Model Based on Long Short-Term Memory (LSTM) and eXplainable Artificial Intelligence (XAI)
    Maarif, Muhammad Rifqi
    Saleh, Arif Rahman
    Habibi, Muhammad
    Fitriyani, Norma Latif
    Syafrudin, Muhammad
    INFORMATION, 2023, 14 (05)
  • [9] LONG-TERM BACKGROUND MEMORY BASED ON GAUSSIAN MIXTURE MODEL
    Zhao, W.
    Zhao, X. D.
    Liu, W. M.
    Tang, X. L.
    2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
  • [10] A Memory Model Based on the Siamese Network for Long-Term Tracking
    Lee, Hankyeol
    Choi, Seokeon
    Kim, Changick
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 100 - 115