Large language model for patent concept generation

被引:0
|
作者
Ren, Runtao [1 ]
Ma, Jian [1 ]
Luo, Jianxi [2 ]
机构
[1] City Univ Hong Kong, Dept Informat Syst, Kowloon Tong, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Syst Engn, Kowloon Tong, Hong Kong, Peoples R China
关键词
Generative AI; Large language model; Finetuning; Patent;
D O I
10.1016/j.aei.2025.103301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In traditional innovation practices, concept and IP generation are often iteratively integrated. Both processes demand an intricate understanding of advanced technical domain knowledge. Existing large language models (LLMs), while possessing massive pre-trained knowledge, often fall short in the innovative concept generation due to a lack of specialized knowledge necessary for the generation. To bridge this critical gap, we propose a novel knowledge finetuning (KFT) framework to endow LLM-based AI with the ability to autonomously mine, understand, and apply domain-specific knowledge and concepts for invention generation, i.e., concept and patent generation together. Our proposed PatentGPT integrates knowledge injection pre-training (KPT), domainspecific supervised finetuning (SFT), and reinforcement learning from human feedback (RLHF). Extensive evaluation shows that PatentGPT significantly outperforms the state-of-the-art models on patent-related benchmark tests. Our method not only provides new insights into data-driven innovation but also paves a new path to fine-tune LLMs for applications in the context of technology. We also discuss the managerial and policy implications of AI-generating inventions in the future.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] LogExpert: Log-based Recommended Resolutions Generation using Large Language Model
    Wang, Jiabo
    Chu, Guojun
    Wang, Jingyu
    Sun, Haifeng
    Qi, Qi
    Wang, Yuanyi
    Qi, Ji
    Liao, Jianxin
    2024 IEEE/ACM 46TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: NEW IDEAS AND EMERGING RESULTS, ICSE-NIER 2024, 2024, : 42 - 46
  • [32] Chinese Text Open Domain Tag Generation Method via Large Language Model
    He, Chunhui
    Ge, Bin
    Zhang, Chong
    2024 10TH INTERNATIONAL CONFERENCE ON BIG DATA AND INFORMATION ANALYTICS, BIGDIA 2024, 2024, : 183 - 188
  • [33] Multi-Intent Inline Code Comment Generation via Large Language Model
    Zhang, Xiaowei
    Chen, Zhifei
    Cao, Yulu
    Chen, Lin
    Zhou, Yuming
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2024, 34 (06) : 845 - 868
  • [34] RTLLM: An Open-Source Benchmark for Design RTL Generation with Large Language Model
    Lu, Yao
    Liu, Shang
    Zhang, Qijun
    Xie, Zhiyao
    29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 722 - 727
  • [35] Fine-Tuning a Large Language Model with Reinforcement Learning for Educational Question Generation
    Lamsiyah, Salima
    El Mahdaouy, Abdelkader
    Nourbakhsh, Aria
    Schommer, Christoph
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024, 2024, 14829 : 424 - 438
  • [36] ChatPCG: Large Language Model-Driven Reward Design for Procedural Content Generation
    Baek, In-Chang
    Park, Tae-Hwa
    Noh, Jin-Ha
    Bae, Cheong-Mok
    Kim, Kyung-Joong
    2024 IEEE CONFERENCE ON GAMES, COG 2024, 2024,
  • [37] Automatic item generation in various STEM subjects using large language model prompting
    Park, Joonhyeong (joonhyeong.park@nie.edu.sg), 2025, 8
  • [38] Performance of a Large Language Model in the Generation of Clinical Guidelines for Antibiotic Prophylaxis in Spine Surgery
    Zaidat, Bashar
    Shrestha, Nancy
    Rosenberg, Ashley M.
    Ahmed, Wasil
    Rajjoub, Rami
    Hoang, Timothy
    Mejia, Mateo Restrepo
    Duey, Akiro H.
    Tang, Justin E.
    Kim, Jun S.
    Cho, Samuel K.
    NEUROSPINE, 2024, 21 (01) : 128 - 146
  • [39] Continually Tuning a Large Language Model for Multi-domain Radiology Report Generation
    Sun, Yihua
    Khor, Hee Guan
    Wang, Yuanzheng
    Wang, Zhuhao
    Zhao, Hongliang
    Zhang, Yu
    Ma, Longfei
    Zheng, Zhuozhao
    Liao, Hongen
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 177 - 187
  • [40] Synthetic Skeleton Data Generation using Large Language Model for Nurse Activity Recognition
    Dobhal, Umang
    Garcia, Christina
    Inoue, Sozo
    COMPANION OF THE 2024 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, UBICOMP COMPANION 2024, 2024, : 493 - 499