An Ontology-based Semantic Clustering Algorithm for Accounting Text

被引:0
|
作者
Jiang, Yanhui [1 ]
Li, Mo [1 ]
Yao, Kaohua [1 ]
机构
[1] Hunan Univ, Business Sch, Changsha 410006, Hunan, Peoples R China
关键词
text mining; similarity; clustering; semantics; accounting text;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The feature selection and semantic similarity computing between texts are essential components of accounting text clustering. In the past, several approaches for generic text feature selection and similarity computing by exploiting different measures (vector space model, words frequency, thesauri, domain corpora, etc.) have been proposed. However, accounting field is different from general field. Accounting has its own concepts and rules. These generic methods are not so suitable for accounting text clustering. In this paper, a novel accounting ontology-based feature selection and similarity computing algorithm for accounting text is proposed. Firstly, characterizing the accounting texts, we get a terms vector. Secondly, terms vector is mapped into concept of accounting ontology and converted into concept vector. Based on the structure of concept, the semantic similarity between texts is computed. Then, trough an improved clustering method, accounting texts are clustered effectively. The experiments results imply that our proposal outperforms most of the previous measures as well as eliminates some of their limitations.
引用
下载
收藏
页码:59 / 67
页数:9
相关论文
共 50 条
  • [31] Hybrid optimization and ontology-based semantic model for efficient text-based information retrieval
    Ram Kumar
    S. C. Sharma
    The Journal of Supercomputing, 2023, 79 : 2251 - 2280
  • [32] Hybrid optimization and ontology-based semantic model for efficient text-based information retrieval
    Kumar, Ram
    Sharma, S. C.
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (02): : 2251 - 2280
  • [33] SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes
    Tchechmedjiev, Andon
    Abdaoui, Amine
    Emonet, Vincent
    Zevio, Stella
    Jonquet, Clement
    BMC BIOINFORMATICS, 2018, 19
  • [34] Ontology-Based Query Expansion with Latently Related Named Entities for Semantic Text Search
    Ngo, Vuong M.
    Cao, Tru H.
    ADVANCES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2010, 283 : 41 - 52
  • [35] SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes
    Andon Tchechmedjiev
    Amine Abdaoui
    Vincent Emonet
    Stella Zevio
    Clement Jonquet
    BMC Bioinformatics, 19
  • [36] Ontology-based automatic receipt accounting system
    Shen, ZhiNian
    Tijerino, Yuri
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 236 - 239
  • [37] FOMatch: A Fuzzy Ontology-based Semantic Matching Algorithm of Publish/Subscribe Systems
    Zhang, Weiwei
    Ma, Jiangang
    Ye, Dan
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 111 - 117
  • [38] Semantic oriented ontology cohesion metrics for ontology-based systems
    Ma, Yinglong
    Jin, Beihong
    Feng, Yulin
    JOURNAL OF SYSTEMS AND SOFTWARE, 2010, 83 (01) : 143 - 152
  • [39] Genetic algorithm for text clustering based on latent semantic indexing
    Song, Wei
    Park, Soon Cheol
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 57 (11-12) : 1901 - 1907
  • [40] Research on The parallel Text Clustering Algorithm Based on the Semantic Tree
    Liu, Gangfeng
    Wang, Yunlan
    Zhao, Tianhai
    Li, Dongyang
    2011 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY (ICCIT), 2012, : 400 - 403