Using the MF/MD method for automatic text classification

被引:0
|
作者
de Mönnink, I [1 ]
Brom, N [1 ]
Oostdijk, N [1 ]
机构
[1] Catholic Univ Nijmegen, NL-6500 HC Nijmegen, Netherlands
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In corpus linguistics, but also in computational linguistics and information retrieval, there is an increasing demand for the automatic classification of large amounts of text(s). In his research, Biber uses the Multi-Feature/Multi-Dimension (MF/MD) method to obtain a classification of English texts. A major disadvantage of his approach is the heavy reliance on the frequency count of complex grammatical features which are hard to retrieve automatically. In this paper, we investigate whether Biber's MF/MD method can be used for automatic text classification. For this purpose, the MF/MD method is applied to the ICE-GB corpus, using three different sets of linguistic features. The results indicate that automatic text classification is indeed feasible using word class tags as input for the MR/MD method(1).
引用
收藏
页码:15 / 25
页数:11
相关论文
共 50 条
  • [1] A New Method of Automatic Text Document Classification
    Yatsko, V. A.
    [J]. AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2021, 55 (03) : 122 - 133
  • [2] A New Method of Automatic Text Document Classification
    V. A. Yatsko
    [J]. Automatic Documentation and Mathematical Linguistics, 2021, 55 : 122 - 133
  • [3] Automatic text classification using words networks
    Pablo Cardenas, Juan
    Olivares, Gaston
    Alfaro, Rodrigo
    [J]. REVISTA SIGNOS, 2014, 47 (86): : 346 - 364
  • [4] A combined weight method in automatic classification of Chinese text
    Liao, SS
    Jiang, MH
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 625 - 630
  • [5] Automatic Text Classification using Modified Centroid Classifier
    Elmarhumy, Mahmoud
    Fattah, Mohamed Abdel
    Ren, Fuji
    [J]. IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 282 - +
  • [6] Automatic text classification using an artificial neural network
    de Mello, RF
    Senger, LJ
    Yang, LT
    [J]. HIGH PERFORMANCE COMPUTATIONAL SCIENCE AND ENGINEERING, 2004, 172 : 215 - +
  • [7] Research on Feature Selection Method in Chinese Text Automatic Classification
    Hong, Ying
    Shao, Xiwen
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING INNOVATION, 2015, 12 : 1759 - 1763
  • [8] The Automatic Text Classification Method Based on BERT and Feature Union
    Li, Wenting
    Gao, Shangbing
    Zhou, Hong
    Huang, Zihe
    Zhang, Kewen
    Li, Wei
    [J]. 2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 774 - 777
  • [9] Automatic Text Classification Method Based on Zipf's Law
    Yatsko, V. A.
    [J]. AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2015, 49 (03) : 83 - 88
  • [10] Research on feature selection method in Chinese text automatic classification
    Hong, Ying
    Geng, Zengmin
    [J]. ENERGY SCIENCE AND APPLIED TECHNOLOGY, 2016, : 359 - 361