Automatic Patents Classification Using Supervised Machine Learning

被引:1
|
作者
Shahid, Muhammad [1 ]
Ahmed, Adeel [2 ]
Mushtaq, Muhammad Faheem [3 ]
Ullah, Saleem [3 ]
Matiullah [3 ]
Akram, Urooj [3 ]
机构
[1] Govt Sadiq Egerton SE Coll, Dept Phys, Bahawalpur, Pakistan
[2] Quaid I Azam Univ, Dept Comp Sci, Islamabad, Pakistan
[3] Khwaja Fareed Univ Engn & Informat Technol, Fac Comp Sci & Informat Technol, Rahim Yar Khan, Pakistan
关键词
Classification; Supervised learning; Unigram; BM25; TF-IDF; SMART notations; TEXT;
D O I
10.1007/978-3-030-36056-6_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Every year, approximately one million patent documents are issued with unique patent number or symbol. In order to find the relevant patent document, several users query the IPC documents using IPC symbols. So, there is a need of automatic classification and ranking of patent documents w.r.t. user query. Automatic classification is only possible through supervised machine learning techniques. In this paper, we classified patent documents using common classifiers. We collected 1625 patent documents related to eight different classes taken from IPC website using web crawler in an unstructured text. We considered 90% of training and 10% of test samples of the total patents. We built a feature matrix using tf-idf, smart notations and BM25 weighting schemes. This feature matrix is given to each classifier as input and output of each classifier consists of correctly classified and incorrectly classified instances. Finally, we evaluated the accuracy of each classifier using precision, recall and F-measure. We performed comparative analysis of classifiers and observed that by adding more features to each classifier, accuracy of classifier can be improved.
引用
收藏
页码:297 / 307
页数:11
相关论文
共 50 条
  • [21] Automatic spontaneous pain recognition using supervised classification learning algorithms
    Rupenga, Moses
    Vadapalli, Hima B.
    2016 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2016,
  • [22] Automatic Classification of Vulnerabilities using Deep Learning and Machine Learning Algorithms
    Ramesh, Vishnu
    Abraham, Sara
    Vinod, P.
    Mohamed, Isham
    Visaggio, Corrado A.
    Laudanna, Sonia
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [23] Classification of Patents according to Industry 4.0 Pillars using Machine Learning Algorithms
    Jafery, Wan Ain Zubaidah Wan Chek
    Ahmad, Noor Azurati
    Omar, Mohd Shahir Shamsir
    Ithnin, Hafizah
    2019 6TH INTERNATIONAL CONFERENCE ON RESEARCH AND INNOVATION IN INFORMATION SYSTEMS: EMPOWERING DIGITAL INNOVATION (ICRIIS 2019), 2019,
  • [24] Classification of lidar measurements using supervised and unsupervised machine learning methods
    Farhani, Ghazal
    Sica, Robert J.
    Daley, Mark Joseph
    ATMOSPHERIC MEASUREMENT TECHNIQUES, 2021, 14 (01) : 391 - 402
  • [25] Classification of Space Particle Events using Supervised Machine Learning Algorithms
    Saric, Rijad
    Chen, Junchao
    Krstic, Milos
    Custovic, Edhem
    Panic, Goran
    Kevric, Jasmin
    Jokic, Dejan
    2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2021,
  • [26] Classification of Potentially Hazardous Asteroids Using Supervised Quantum Machine Learning
    Bhavsar, Rushir
    Jadav, Nilesh Kumar
    Bodkhe, Umesh
    Gupta, Rajesh
    Tanwar, Sudeep
    Sharma, Gulshan
    Bokoro, Pitshou N.
    Sharma, Ravi
    IEEE ACCESS, 2023, 11 : 75829 - 75848
  • [27] Research on internet traffic classification techniques using supervised machine learning
    Information Networking Institute, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
    不详
    High Technol Letters, 2009, 4 (369-377):
  • [28] On Fault Classification in Connected Autonomous Vehicles Using Supervised Machine Learning
    Khalil, Abdelrahman
    Al Janaideh, Mohammad
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1198 - 1204
  • [29] Research on internet traffic classification techniques using supervised machine learning
    李君
    High Technology Letters, 2009, 15 (04) : 369 - 377