Large language model for patent concept generation

被引:0
|
作者
Ren, Runtao [1 ]
Ma, Jian [1 ]
Luo, Jianxi [2 ]
机构
[1] City Univ Hong Kong, Dept Informat Syst, Kowloon Tong, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Syst Engn, Kowloon Tong, Hong Kong, Peoples R China
关键词
Generative AI; Large language model; Finetuning; Patent;
D O I
10.1016/j.aei.2025.103301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In traditional innovation practices, concept and IP generation are often iteratively integrated. Both processes demand an intricate understanding of advanced technical domain knowledge. Existing large language models (LLMs), while possessing massive pre-trained knowledge, often fall short in the innovative concept generation due to a lack of specialized knowledge necessary for the generation. To bridge this critical gap, we propose a novel knowledge finetuning (KFT) framework to endow LLM-based AI with the ability to autonomously mine, understand, and apply domain-specific knowledge and concepts for invention generation, i.e., concept and patent generation together. Our proposed PatentGPT integrates knowledge injection pre-training (KPT), domainspecific supervised finetuning (SFT), and reinforcement learning from human feedback (RLHF). Extensive evaluation shows that PatentGPT significantly outperforms the state-of-the-art models on patent-related benchmark tests. Our method not only provides new insights into data-driven innovation but also paves a new path to fine-tune LLMs for applications in the context of technology. We also discuss the managerial and policy implications of AI-generating inventions in the future.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Patent data-assisted concept generation for new product development
    Yang W.
    Cao G.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (03): : 992 - 1010
  • [22] Language model for multilingual natural language generation
    Zhang, Dongmo
    Ge, Yong
    Yao, Tianfang
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2000, 34 (07): : 944 - 947
  • [23] Validation of concept representation using natural language generation
    Baud, RH
    Rodrigues, JM
    Wagner, JC
    Rassinoux, AM
    Lovis, C
    Rush, P
    Trombert-Paviot, B
    Scherrer, JR
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1997, : 841 - 841
  • [24] Retrieval augmentation of large language models for lay language generation
    Guo, Yue
    Qiu, Wei
    Leroy, Gondy
    Wang, Sheng
    Cohen, Trevor
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 149
  • [25] Medical concept systems, lexicons and natural language generation
    Wagner, JC
    Lovis, C
    Baud, RH
    Scherrer, JR
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 1997, 1211 : 398 - 401
  • [26] Retrieval augmentation of large language models for lay language generation
    Guo, Yue
    Qiu, Wei
    Leroy, Gondy
    Wang, Sheng
    Cohen, Trevor
    Journal of Biomedical Informatics, 2024, 149
  • [27] A Determination Method of Robot Motion from Language Instruction with Error Correction of Motion Generation by Large Language Model
    Suzuki, Takahiro
    Hashimoto, Manabu
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2024, 90 (11): : 859 - 866
  • [28] Data extraction for evidence synthesis using a large language model: A proof-of-concept study
    Gartlehner, Gerald
    Kahwati, Leila
    Hilscher, Rainer
    Thomas, Ian
    Kugley, Shannon
    Crotty, Karen
    Viswanathan, Meera
    Nussbaumer-Streit, Barbara
    Booth, Graham
    Erskine, Nathaniel
    Konet, Amanda
    Chew, Robert
    RESEARCH SYNTHESIS METHODS, 2024, 15 (04) : 576 - 589
  • [29] Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction
    Peng, Cheng
    Yang, Xi
    Smith, Kaleb E.
    Yu, Zehao
    Chen, Aokun
    Bian, Jiang
    Wu, Yonghui
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 153
  • [30] Natural Language Generation System for Knowledge Acquisition Based on Patent Database
    Rene, Antonio Oliveira Nzinga
    Okuhara, Koji
    Matsui, Takeshi
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2022, 26 (02) : 160 - 168