Big data-assisted urban governance: A comprehensive system for business documents classification of the government hotline

被引:1
|
作者
Zhang, Zicheng [1 ,2 ,3 ,6 ]
Li, Anguo [4 ]
Wang, Li [5 ]
Cao, Wei [6 ]
Yang, Jianlin [2 ,3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Modern Posts, Nanjing 210003, Peoples R China
[2] Nanjing Univ, Sch Informat Management, Nanjing 210023, Peoples R China
[3] Knowledge Serv, Jiangsu Key Lab Data Engn, Nanjing 210023, Peoples R China
[4] Beihang Univ, Sino French Engineer Sch, Beijing, Peoples R China
[5] Nanjing Univ, Sch Business, Nanjing 210093, Peoples R China
[6] Nanjing Huiningjie Informat Technol Co Ltd, Nanjing 210023, Peoples R China
关键词
Government hotline; Text classification; New words; TF-IDF; Information entropy; Nested balanced binary tree;
D O I
10.1016/j.engappai.2024.107997
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The government service platform, exemplified by the government hotline, has to handle extensive volumes of business documents that contain rich and timely public opinion information and citizens' demands. However, manual processing struggles to process large-scale text data, adversely impacting operating costs and the quality of government services. This study proposes a comprehensive system for business document classification of the government hotline (BDCGHS) in China to address these challenges. BDCGHS leverages information entropy fused with term frequency-inverse document frequency (TF-IDF) weight to mine new words from business documents of the government hotline, and store them in a new word repository. These new words optimize Chinese word segmentation and text representation for text classification. We introduce a novel data structure called nested balanced binary tree to expedite new word mining, yielding a computational speed of almost five times than the Trie trees. Comparative experiments on the THUNews and government hotline datasets validate our proposed improvement BDCGHS algorithm's superior performance 3 % over text classification algorithms. Compared to the latest bidirectional encoder representations from the transformers (BERT) model, BDCGHS enhances the accuracy of order dispatch based on business documents by almost 3 %. It has also demonstrated stable operations in two Chinese cities for over a year, yielding favorable results.
引用
收藏
页数:17
相关论文
共 6 条
  • [1] Big data-assisted urban governance: An intelligent real-time monitoring and early warning system for public opinion in government hotline
    Zhang, Zicheng
    Lin, Xinyue
    Shan, Shaonan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 144 : 90 - 104
  • [2] Big data-assisted social media analytics for business model for business decision making system competitive analysis
    Zhang, Honglei
    Zang, Zhenbo
    Zhu, Hongjun
    Uddin, M. Irfan
    Amin, M. Asim
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (01)
  • [3] Big data-assisted urban governance: forecasting social events with a periodicity by employing different time series algorithms
    Zhang, Zicheng
    Lin, Xinyue
    Shan, Shaonan
    Yin, Zhaokai
    LIBRARY HI TECH, 2023,
  • [4] Big data-Assisted student's English learning ability appraisal model using fuzzy logic system
    Fan L.
    Wang W.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 10621 - 10636
  • [5] Vaite: a Visualization-Assisted Interactive Big Urban Trajectory Data Exploration System
    Yang, Chuang
    Zhang, Yilan
    Tang, Bo
    Zhu, Min
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 2036 - 2039
  • [6] Government responsiveness and public acceptance of big-data technology in urban governance: Evidence from China during the COVID-19 pandemic
    Guo, Yue
    Chen, Jidong
    Liu, Zhilin
    CITIES, 2022, 122