A Study of a Patent Documents Classification System Using Rough Set Theory and Machine Translation

被引:1
|
作者
Kurematsu, M. [1 ]
机构
[1] Iwate Prefectural Univ, Takizawa Sugo 152-52, Takizawa, Iwate, Japan
关键词
Rough Set Theory; Machine Translation; Document Classification; Patent Documents; Mahalanobis Distance;
D O I
10.3233/FAIA200583
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We should check whether there are any existing patent documents whose claims fall foul of our idea or innovation before we submit own idea or innovation as a patent. We need a lot of resource to do it, because there are a lot of existing patent documents. These days, we can submit the patent documents by a computer and the number of patent documents is increasing quickly. Therefore, people need a system to support this task. In order to meet this demand, I propose a framework of a system to classify patent documents in this paper. This system uses machine translation to deal with synonym and Rough Set theory to classify patent documents. First, it extracts decision rules by Rough Set Theory from labeled patent documents translated by machine translation. Then, it classifies unlabeled patent documents by estimating labels based on the weight of the matched rules. In this approach, the satisfactory index (SI), the coverage index (CI) and the Lift value are used as the weight of rules and they are compared with the maximum number, the total number and the Mahalanobis distance. I evaluated this idea by classifying Japanese patent documents using a prototype system based on this idea. In the evaluation, the accuracy was about 0.40 and the accuracy has not reached the practical level. Therefore I will apply this approach to other document classification task and improve it based on the analysis result of them.
引用
收藏
页码:389 / 398
页数:10
相关论文
共 50 条
  • [1] Classification and rule induction using rough set theory
    Beynon, M
    Curry, B
    Morgan, P
    [J]. EXPERT SYSTEMS, 2000, 17 (03) : 136 - 148
  • [2] Uncertain Data Classification Using Rough Set Theory
    Suresh, G. Vijay
    Reddy, E. Venkateswara
    Reddy, E. Srinivasa
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 869 - +
  • [3] Anonymizing classification data using rough set theory
    Ye, Mingquan
    Wu, Xindong
    Hu, Xuegang
    Hu, Donghui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 43 : 82 - 94
  • [4] Rough set machine translation using deep structure and transfer learning
    Hassan, Yasser F.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (06) : 4149 - 4159
  • [5] HVAC Fault Diagnosis System Using Rough Set Theory and Support Vector Machine
    Li Xuemei
    Shao Ming
    Ding Lixing
    [J]. WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 895 - +
  • [6] Sentiment Analysis System for Classification of Patient-Generated Health Reviews Using Rough Set Theory and Machine Learning Technique
    Almagrabi, Alaa Omran
    Ahmad, Shakeel
    [J]. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2020, 10 (10) : 2361 - 2368
  • [7] Rough Set Theory in the Classification of Diagnoses
    Rakus-Andersson, Elisabeth
    [J]. COMPUTERS IN MEDICAL ACTIVITY, 2009, 65 : 41 - 51
  • [8] Recognizing Coordinate Structures for Machine Translation of English Patent Documents
    Roh, Yoon-Hyung
    Lee, Ki-Young
    Choi, Sung-Kwon
    Kwon, Oh-Woog
    Kim, Young-Gil
    [J]. PACLIC 22: PROCEEDINGS OF THE 22ND PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2008, : 460 - 466
  • [9] A clinical decision support system using rough set theory and machine learning for disease prediction
    Singh, Kamakhya Narain
    Mantri, Jibendu Kumar
    [J]. INTELLIGENT MEDICINE, 2024, 4 (03): : 200 - 208
  • [10] Study on CAPP System based on Rough Set Theory
    Li, Xiaobin
    Jiang, Derong
    [J]. ADVANCED RESEARCH ON INFORMATION SCIENCE, AUTOMATION AND MATERIAL SYSTEM, PTS 1-6, 2011, 219-220 : 1060 - +