A Novel Model Based on Big Data Environment for Text Content Security Recognition

被引:1
|
作者
Su, Peng [1 ]
Zhao, Hui [2 ]
Wang, Ying [3 ]
机构
[1] Henan Univ, Henan Prov Engn Res Ctr Intelligent Data Proc, Kaifeng 475004, Peoples R China
[2] Henan Univ, Educ Informat Technol Lab, Kaifeng 475000, Peoples R China
[3] Henan Univ, Henan Int Joint Lab Theories & Key Technol Intelli, Kaifeng, Peoples R China
关键词
Big data; Text recognition; Text vector extraction; Improved TF-IDF algorithm; TF-IDF;
D O I
10.1007/s11265-023-01860-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the big data environment, text content security recognition is one of the main ways to intelligently manage the Internet and maintain privacy. However, traditional text content security recognition methods lack semantic understanding and ignore scenarios where keywords are evenly distributed, resulting in high false positive rate and low accuracy. To address this problem, we propose a novel model based on big data environment for text content security recognition. In the scenario where keywords are evenly distributed, we design the TFC-BPLW-AM algorithm to extract text vectors. The TFC BPLW-AM algorithm considers the problem of uniform distribution of keywords, the problem of calculating weights in a single form, and the time-consuming problem caused by too large weight matrix. Thus, the weight integrity is enhanced, the recognition accuracy is improved, and the running time is shortened. Under the 20 newgroups and Fudan University Chinese text datasets, we conduct experimental comparisons with existing models and results show that our model achieves 96.7% F1 score, with a maximum increase of 30.7% and a minimum increase of 2.7%.
引用
收藏
页码:99 / 112
页数:14
相关论文
共 50 条
  • [1] A Novel Model Based on Big Data Environment for Text Content Security Recognition
    Peng Su
    Hui Zhao
    Ying Wang
    Journal of Signal Processing Systems, 2024, 96 : 99 - 112
  • [2] Network security situation assessment based on text SimHash in big data environment
    Lin, Pengwen
    Chen, Yonghong
    International Journal of Network Security, 2019, 21 (04) : 699 - 708
  • [3] Text sentiment analysis based on CBOW model and deep learning in big data environment
    Liu, Bing
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (02) : 451 - 458
  • [4] Text sentiment analysis based on CBOW model and deep learning in big data environment
    Bing Liu
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 451 - 458
  • [5] A Novel SDN-Based IOT Security Architecture Model for Big Data
    Bhimineni, Ojaswi
    Abhijith, Geda Sai Venkata
    Prabhu, Srikanth
    APPLICATIONS AND TECHNIQUES IN INFORMATION SECURITY (ATIS 2021), 2022, 1554 : 141 - 148
  • [6] Research on the Model of Big Data Serve Security in Cloud Environment
    Cui, Hai-ting
    2016 FIRST IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND THE INTERNET (ICCCI 2016), 2016, : 514 - 517
  • [7] Sentiment Classification of Social Network Text Based on AT-BiLSTM Model in a Big Data Environment
    Liu, Jinjun
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2023, 16 (02)
  • [8] Big Data Security in Cloud Environment
    Reddy, Yenumula B.
    2018 IEEE 4TH INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY), 4THIEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) AND 3RD IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2018, : 100 - 106
  • [9] Network security Mode analysis based on big data environment
    Xu, Shuning
    2020 INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2020), 2020, : 50 - 53
  • [10] Research on information security and privacy protection model based on consumer behavior in big data environment
    Li, Yuxue
    Song, Lijun
    Zeng, Yucheng
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (10):