Automatically Classifying Chinese Judgment Documents Using Character-Level Convolutional Neural Networks

被引:0
|
作者
Zhou, Xiaosong [1 ,2 ]
Li, Chuanyi [1 ,2 ]
Ge, Jidong [1 ,2 ]
Li, Zhongjin [3 ]
Zhou, Xiaoyu [1 ,2 ]
Luo, Bin [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Univ, Software Inst, Nanjing, Jiangsu, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
基金
国家重点研发计划;
关键词
Chinese judgment documents; Text classification; Character-level convolutional neural networks; Overfitting;
D O I
10.1007/978-3-319-97310-4_49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Judgment is a decision by a court or other tribunal that resolves a controversy and determines the rights and obligations of the parties. Since the establishment of the China Judgments Online System, more and more judgment documents have been stored online. With the explosive growth of the number of Chinese judgment documents, the need for automated classification methods is getting increasingly urgent. For Chinese data sets, traditional word-level methods often bring extra errors in word segmentation. In this paper, we proposed an approach based on character-level convolutional neural networks to automatically classify Chinese judgment documents. Different from traditional machine learning methods, we hand over the work of feature detection to the model. Throughout the process, the only part that requires human labor is labeling the category of each original documents. In order to prevent overfitting when the amount of training data is not very large, we use a shallow model which has only one convolution layer. The proposed approach does well in achieving high classification accuracy based on 7923 pieces of Chinese judgment documents. In the meanwhile, the effectiveness of our model is satisfactory.
引用
收藏
页码:430 / 437
页数:8
相关论文
共 50 条
  • [41] Character-Level Neural Language Modelling in the Clinical Domain
    Kreuzthaler, Markus
    Oleynik, Michel
    Schulz, Stefan
    [J]. DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 83 - 87
  • [42] Construction of consistency judgment system of diploma policy and curriculum policy using character-level CNN
    Miyazaki, Kazuteru
    Ida, Masaaki
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2019, 102 (12) : 30 - 39
  • [43] Classifying patient portal messages using Convolutional Neural Networks
    Sulieman, Lina
    Gilmore, David
    French, Christi
    Cronin, Robert M.
    Jackson, Gretchen Purcell
    Russell, Matthew
    Fabbri, Daniel
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 74 : 59 - 70
  • [44] A Complaint Text Classification Model Based on Character-level Convolutional Network
    Tong, Xuesong
    Wu, Bin
    Wang, Shuyang
    Lv, Jinna
    [J]. PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 507 - 511
  • [45] The Handwritten Chinese Character Recognition use Convolutional neural networks with the GoogLenet
    Chen, Jiahao
    Bi, Bing
    Yang, Kang
    Tan, Jun
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 2 - 7
  • [46] The Handwritten Chinese Character Recognition Uses Convolutional Neural Networks with the GoogLeNet
    Bi, Ning
    Chen, Jiahao
    Tan, Jun
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)
  • [47] CHARM: An Improved Method for Chinese Precoding and Character-Level Embedding
    Fan, Xiaoming
    Shi, Tuo
    Cai, Jiayan
    Wang, Binjun
    [J]. IEEE ACCESS, 2021, 9 : 129539 - 129551
  • [48] Chinese Sentiment Analysis Based on Lightweight Character-Level BERT
    Tang, Fuhong
    Nongpong, Kwankamol
    [J]. 2021 13TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST-2021), 2021, : 27 - 32
  • [49] Chinese text classification based on character-level CNN and SVM
    Wu, Huaiguang
    Li, Daiyi
    Cheng, Ming
    [J]. International Journal of Intelligent Information and Database Systems, 2019, 12 (03) : 212 - 228
  • [50] Automatically Classify Chinese Judgment Documents Utilizing Machine Learning Algorithms
    Lei, Miaomiao
    Ge, Jidong
    Li, Zhongjin
    Li, Chuanyi
    Zhou, Yemao
    Zhou, Xiaoyu
    Luo, Bin
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), 2017, 10179 : 3 - 17