Research on the Automatic Extraction Method of Web Data Objects Based on Deep Learning

被引:14
|
作者
Peng, Hao [1 ]
Li, Qiao [1 ]
机构
[1] Hunan Int Econ Univ, Sch Informat Sci & Engn, High Tech Ind Dev Zone, Changsha 410205, Hunan, Peoples R China
来源
关键词
Automatic extraction; deep learning; neural network; Web data;
D O I
10.32604/iasc.2020.013939
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper represents a neural network model for the Web page information extraction based on the depth learning technology, and implements the model algorithm using the TensorFbw system. We then complete a detailed experimental analysis of the information extraction effect of Web pages on the same website, then show statistics on the accuracy index of the page information extraction, and optimize some parameters in the model according to the experimental results. On the premise of achieving ideal experimental results, an algorithm for migrating the model to the same pages of other websites for information extraction is proposed, and the experimental results are analyzed. Although the overall effect of the experiment is not as good as that of the page information extraction in different websites, it is far more effective than that of using the model directly on new websites. A new method is proposed to improve the portability of the information extraction system based on machine leaming technology. At the same time, the deep nonlinear learning method of the depth learning model can prove deeper features, can have a more essential description of the abstract language, and can better express and understand sentences from the syntactic and semantic levels.
引用
收藏
页码:609 / 616
页数:8
相关论文
共 50 条
  • [41] Research on Household Appliances Recognition Method Based on Data Screening of Deep Learning
    Yu Zhibin
    Chen Hong
    [J]. IFAC PAPERSONLINE, 2019, 52 (24): : 140 - 144
  • [42] Research on the Method of Data Extraction Based on Category
    Chen, Zhongyu
    Guo, Ting
    Qian, Zhongsheng
    Xiao, Chunshui
    [J]. 2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 365 - 369
  • [43] Research on automatic picking method of microseismic signal P wave based on deep learning mode
    Zhao, Hongbao
    Liu, Rui
    Gu, Tao
    Liu, Yihong
    Jiang, Dongmei
    [J]. Yanshilixue Yu Gongcheng Xuebao/Chinese Journal of Rock Mechanics and Engineering, 2021, 40 : 3084 - 3097
  • [44] A Novel Automatic Ontology Construction Method Based on Web Data
    Song, Qiuxia
    Liu, Jin
    Wang, Xiaofeng
    Wang, Jin
    [J]. 2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 762 - 765
  • [45] A Research of the Internet Based on Web Information Extraction and Data Fusion
    Jiang, Yajun
    Wu, Zaoliang
    Zhan, Zengrong
    Xu, Lingyu
    [J]. NEW HORIZONS IN WEB-BASED LEARNING: ICWL 2010 WORKSHOPS, 2011, 6537 : 195 - 206
  • [46] Research on Automatic Dance Generation System Based on Deep Learning
    Lan, Jia
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [47] Research on automatic target detection and recognition based on deep learning
    Wang, Jia
    Liu, Chen
    Fu, Tian
    Zheng, Lili
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 : 44 - 50
  • [48] Research on Automatic Microalgae Detection System Based on Deep Learning
    Xiang, Rui-Jie
    Liu, Hao
    Lu, Zhen
    Xiao, Ze-Yu
    Liu, Hai-Peng
    Wang, Yin-Chu
    Peng, Xiao
    Yan, Wei
    [J]. PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2024, 51 (01) : 177 - 189
  • [49] Research on Automatic Recognition of Casting Defects Based on Deep Learning
    Duan, Liming
    Yang, Ke
    Ruan, Lang
    [J]. IEEE ACCESS, 2021, 9 : 12209 - 12216
  • [50] Research of Address Information Automatic Annotation Based on Deep Learning
    Ling, Guang-Ming
    Xu, Ai-Ping
    Wang, Wei
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (11): : 2081 - 2091