Automatic Patents Classification Using Supervised Machine Learning

被引:1
|
作者
Shahid, Muhammad [1 ]
Ahmed, Adeel [2 ]
Mushtaq, Muhammad Faheem [3 ]
Ullah, Saleem [3 ]
Matiullah [3 ]
Akram, Urooj [3 ]
机构
[1] Govt Sadiq Egerton SE Coll, Dept Phys, Bahawalpur, Pakistan
[2] Quaid I Azam Univ, Dept Comp Sci, Islamabad, Pakistan
[3] Khwaja Fareed Univ Engn & Informat Technol, Fac Comp Sci & Informat Technol, Rahim Yar Khan, Pakistan
关键词
Classification; Supervised learning; Unigram; BM25; TF-IDF; SMART notations; TEXT;
D O I
10.1007/978-3-030-36056-6_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Every year, approximately one million patent documents are issued with unique patent number or symbol. In order to find the relevant patent document, several users query the IPC documents using IPC symbols. So, there is a need of automatic classification and ranking of patent documents w.r.t. user query. Automatic classification is only possible through supervised machine learning techniques. In this paper, we classified patent documents using common classifiers. We collected 1625 patent documents related to eight different classes taken from IPC website using web crawler in an unstructured text. We considered 90% of training and 10% of test samples of the total patents. We built a feature matrix using tf-idf, smart notations and BM25 weighting schemes. This feature matrix is given to each classifier as input and output of each classifier consists of correctly classified and incorrectly classified instances. Finally, we evaluated the accuracy of each classifier using precision, recall and F-measure. We performed comparative analysis of classifiers and observed that by adding more features to each classifier, accuracy of classifier can be improved.
引用
收藏
页码:297 / 307
页数:11
相关论文
共 50 条
  • [2] Survey on supervised machine learning techniques for automatic text classification
    Kadhim, Ammar Ismael
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (01) : 273 - 292
  • [3] Survey on supervised machine learning techniques for automatic text classification
    Ammar Ismael Kadhim
    Artificial Intelligence Review, 2019, 52 : 273 - 292
  • [4] Automatic classification of white regions in liver biopsies by supervised machine Learning
    Vanderbeck, Scott
    Bockhorst, Joseph
    Komorowski, Richard
    Kleiner, David E.
    Gawrieh, Samer
    HUMAN PATHOLOGY, 2014, 45 (04) : 785 - 792
  • [5] Protostellar classification using supervised machine learning algorithms
    Miettinen, O.
    ASTROPHYSICS AND SPACE SCIENCE, 2018, 363 (09)
  • [6] Automatic flow classification using machine learning
    Anantavrasilp, Isara
    Schoeler, Thorsten
    SOFTCOM 2007: 15TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS, 2007, : 390 - +
  • [7] Automatic Vulnerability Classification Using Machine Learning
    Gawron, Marian
    Cheng, Feng
    Meinel, Christoph
    RISKS AND SECURITY OF INTERNET AND SYSTEMS, CRISIS 2017, 2018, 10694 : 3 - 17
  • [8] Protostellar classification using supervised machine learning algorithms
    O. Miettinen
    Astrophysics and Space Science, 2018, 363
  • [9] Classification of Migraine Disease using Supervised Machine Learning
    Gulati, Seema
    Guleria, Kalpna
    Goyal, Nitin
    2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions), ICRITO 2022, 2022,
  • [10] Using supervised machine learning for large-scale classification in management research: The case for identifying artificial intelligence patents
    Miric, Milan
    Jia, Nan
    Huang, Kenneth G.
    STRATEGIC MANAGEMENT JOURNAL, 2023, 44 (02) : 491 - 519