Patent classification by fine-tuning BERT language model

被引:68
|
作者
Lee, Jieh-Sheng [1 ]
Hsiang, Jieh [1 ]
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
关键词
D O I
10.1016/j.wpi.2020.101965
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
In this work we focus on fine-tuning a pre-trained BERT model and applying it to patent classification. When applied to large datasets of over two million patents, our approach outperforms the state of the art by an approach using CNN with word embeddings. Besides, we focus on patent claims without other parts in patent documents. Our contributions include: (1) a new state-of-the-art result based on pre-trained BERT model and fine-tuning for patent classification, (2) a large dataset USPTO-3M at the CPC subclass level with SQL statements that can be used by future researchers, (3) showing that patent claims alone are sufficient to achieve state-of-the-art results for classification task, in contrast to conventional wisdom.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Patent classification by fine-tuning BERT language model (vol 61, 101965, 2020)
    Lee, Jieh-Sheng
    [J]. WORLD PATENT INFORMATION, 2022, 71
  • [2] Universal Language Model Fine-tuning for Text Classification
    Howard, Jeremy
    Ruder, Sebastian
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 328 - 339
  • [3] BERT MODEL FINE-TUNING FOR TEXT CLASSIFICATION IN KNEE OA RADIOLOGY REPORTS
    Chen, L.
    Shah, R.
    Link, T.
    Bucknor, M.
    Majumdar, S.
    Pedoia, V.
    [J]. OSTEOARTHRITIS AND CARTILAGE, 2020, 28 : S315 - S316
  • [4] Hierarchical BERT with an adaptive fine-tuning strategy for document classification
    Kong, Jun
    Wang, Jin
    Zhang, Xuejie
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 238
  • [5] An Application of Transfer Learning: Fine-Tuning BERT for Spam Email Classification
    Bhopale, Amol P.
    Tiwari, Ashish
    [J]. MACHINE LEARNING AND BIG DATA ANALYTICS (PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND BIG DATA ANALYTICS (ICMLBDA) 2021), 2022, 256 : 67 - 77
  • [6] Fine-Tuning BERT Model for Materials Named Entity Recognition
    Zhao, Xintong
    Greenberg, Jane
    An, Yuan
    Hu, Xiaohua Tony
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3717 - 3720
  • [7] LAMBERT: Leveraging Attention Mechanisms to Improve the BERT Fine-Tuning Model for Encrypted Traffic Classification
    Liu, Tao
    Ma, Xiting
    Liu, Ling
    Liu, Xin
    Zhao, Yue
    Hu, Ning
    Ghafoor, Kayhan Zrar
    [J]. MATHEMATICS, 2024, 12 (11)
  • [8] Transfer fine-tuning of BERT with phrasal paraphrases
    Arase, Yuki
    Tsujii, Junichi
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 66
  • [9] Research Paper Classification and Recommendation System based-on Fine-Tuning BERT
    Biswas, Dipto
    Gil, Joon-Min
    [J]. 2023 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI, 2023, : 295 - 296
  • [10] Research Paper Classification and Recommendation System based-on Fine-Tuning BERT
    Biswas, Dipto
    Gil, Joon-Min
    [J]. Proceedings - 2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science, IRI 2023, 2023, : 295 - 296