Automatic Patents Classification Using Supervised Machine Learning

被引:1
|
作者
Shahid, Muhammad [1 ]
Ahmed, Adeel [2 ]
Mushtaq, Muhammad Faheem [3 ]
Ullah, Saleem [3 ]
Matiullah [3 ]
Akram, Urooj [3 ]
机构
[1] Govt Sadiq Egerton SE Coll, Dept Phys, Bahawalpur, Pakistan
[2] Quaid I Azam Univ, Dept Comp Sci, Islamabad, Pakistan
[3] Khwaja Fareed Univ Engn & Informat Technol, Fac Comp Sci & Informat Technol, Rahim Yar Khan, Pakistan
关键词
Classification; Supervised learning; Unigram; BM25; TF-IDF; SMART notations; TEXT;
D O I
10.1007/978-3-030-36056-6_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Every year, approximately one million patent documents are issued with unique patent number or symbol. In order to find the relevant patent document, several users query the IPC documents using IPC symbols. So, there is a need of automatic classification and ranking of patent documents w.r.t. user query. Automatic classification is only possible through supervised machine learning techniques. In this paper, we classified patent documents using common classifiers. We collected 1625 patent documents related to eight different classes taken from IPC website using web crawler in an unstructured text. We considered 90% of training and 10% of test samples of the total patents. We built a feature matrix using tf-idf, smart notations and BM25 weighting schemes. This feature matrix is given to each classifier as input and output of each classifier consists of correctly classified and incorrectly classified instances. Finally, we evaluated the accuracy of each classifier using precision, recall and F-measure. We performed comparative analysis of classifiers and observed that by adding more features to each classifier, accuracy of classifier can be improved.
引用
收藏
页码:297 / 307
页数:11
相关论文
共 50 条
  • [31] LEACH Based WSN Classification Using Supervised Machine Learning Algorithm
    Mustary, Shabnom
    Abul Kashem, Mohammod
    Khan, Nurul Islam
    Jewel, Faruq Ahmed
    Islam, Monirul
    Islam, Saiful
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [32] Hindi Poetry Classification using Eager Supervised Machine Learning Algorithms
    Bafna, Prafulla
    Saini, Jatinderkumar R.
    2020 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2020, : 175 - 178
  • [33] Automatic text classification using machine learning and optimization algorithms
    Janani, R.
    Vijayarani, S.
    SOFT COMPUTING, 2021, 25 (02) : 1129 - 1145
  • [34] An Automatic Flower Classification Approach Using Machine Learning Algorithms
    Zawbaa, Hossam M.
    Abbass, Mona
    Basha, Sameh H.
    Hazman, Maryam
    Hassenian, Abul Ella
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 895 - 901
  • [35] Automatic Classification of Lung Sounds Using Machine Learning Algorithms
    Ullah, Ahmad
    Khan, Misha Urooj
    Mujahid, Farrukh
    Khan, Muhammad Salman
    2021 INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT 2021), 2021, : 131 - 136
  • [36] Automatic Electronic Invoice Classification Using Machine Learning Models
    Bardelli, Chiara
    Rondinelli, Alessandro
    Vecchio, Ruggero
    Figini, Silvia
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2020, 2 (04): : 617 - 629
  • [37] Automatic medical protocol classification using machine learning approaches
    Lopez-Ubeda, Pilar
    Diaz-Galiano, Manuel Carlos
    Martin-Noguerol, Teodoro
    Luna, Antonio
    Urena-Lopez, L. Alfonso
    Martin-Valdivia, M. Teresa
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 200
  • [38] Automatic Classification of Foot Thermograms Using Machine Learning Techniques
    Filipe, Vitor
    Teixeira, Pedro
    Teixeira, Ana
    ALGORITHMS, 2022, 15 (07)
  • [39] Aquatic weed automatic classification using machine learning techniques
    Pereira, Luis A. M.
    Nakamura, Rodrigo Y. M.
    de Souza, Guilherme F. S.
    Martins, Dagoberto
    Papa, Joao P.
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2012, 87 : 56 - 63
  • [40] A methodology for part classification with supervised machine learning
    Rucco, Matteo
    Giannini, Franca
    Lupinetti, Katia
    Monti, Marina
    AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2019, 33 (01): : 100 - 113