Automatic classification of accounting literature

被引:17
|
作者
Chakraborty, Vasundhara [1 ]
Chiu, Victoria [2 ]
Vasarhelyi, Miklos [3 ]
机构
[1] Ramapo Coll, Mahwah, NJ 07430 USA
[2] SUNY Coll New Paltz, New Paltz, NY 12561 USA
[3] Rutgers State Univ, Piscataway, NJ 08855 USA
关键词
Accounting literature; Automatic classification; Taxonomy; Attributes; Semantic parsing; Data mining; INFORMATION; CONSTRUCTION; SYSTEM; STATE; ART;
D O I
10.1016/j.accinf.2014.01.001
中图分类号
F [经济];
学科分类号
02 ;
摘要
This paper explores the possibility of using semantic parsing, information retrieval and data mining techniques to automatically classify accounting research. Literature taxonomization plays a critical role in understanding a discipline's knowledge attributes and structure. The traditional research classification is a manual process which is considerably time consuming and may introduce inconsistent classifications by different experts. Aiming at aiding this classification issue, this study conducted three studies to seek the most effective and accurate method to classify accounting publications' attributes. We found results in the third study most rewarding in which the classification accuracy reached 87.27% with decision trees and rule-based algorithms applied. Findings in the first and second studies also provided valuable implications on automatic literature classifications, e.g. abstracts are better measures to use than keywords and balancing under-represented subclasses does not contribute to more accurate classifications. All three studies' results also suggest that expanding article sample size is a key to strengthen automatic classification accuracy. Overall, the potential path of this line of research seems to be very promising and would have several collateral benefits and applications. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:122 / 148
页数:27
相关论文
共 50 条
  • [1] Automatic document classification of biological literature
    Chen, David
    Muller, Hans-Michael
    Sternberg, Paul W.
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [2] Automatic document classification of biological literature
    David Chen
    Hans-Michael Müller
    Paul W Sternberg
    BMC Bioinformatics, 7
  • [3] SYSTEM FOR AUTOMATIC CLASSIFICATION OF SCIENTIFIC LITERATURE
    GARFIELD, E
    MALIN, MV
    SMALL, H
    JOURNAL OF THE INDIAN INSTITUTE OF SCIENCE, 1975, 57 (02): : 61 - 74
  • [4] Automatic figure classification in bioscience literature
    Kim, Daehyun
    Ramesh, Balaji Polepalli
    Yu, Hong
    JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (05) : 848 - 858
  • [5] Automatic video classification: A survey of the literature
    Brezeale, Darin
    Cook, Diane J.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (03): : 416 - 430
  • [6] Automatic classification of protein functions from the literature
    Blaschke, C
    Valencia, A
    COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (01): : 75 - 79
  • [7] Automatic Classification of Algorithm Citation Functions in Scientific Literature
    Tuarob, Suppawong
    Kang, Sung Woo
    Wettayakorn, Poom
    Pornprasit, Chanatip
    Sachati, Tanakitti
    Hassan, Saeed-Ul
    Haddawy, Peter
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (10) : 1881 - 1896
  • [8] Automatic Subject Classification of Scientific Literature Using Citation Metadata
    Mahdi, Abdulhussain E.
    Joorabchi, Arash
    DIGITAL ENTERPRISE AND INFORMATION SYSTEMS, 2011, 194 : 545 - 559
  • [9] Classification in Accounting
    Gregory, Robert H.
    ACCOUNTING REVIEW, 1952, 27 (04): : 566 - 567
  • [10] Automatic Identification and Classification of Noun Argument Structures in Biomedical Literature
    Ozyurt, Ibrahim Burak
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1639 - 1648