Extraction Research about Parallelization of Named Entity Based on Hadoop Platform

被引:0
|
作者
Shi, Quan [1 ]
Yang, Zhendong [1 ]
Xu, Lu [1 ]
机构
[1] Nantong Univ, Nantong, Peoples R China
关键词
Hadoop; Chinese word segmentation; Named entity recognition; Named entity extraction;
D O I
10.4028/www.scientific.net/AMM.397-400.2309
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the era of big data approaching, data becomes more and more important. Faced with such massive amounts of data space, how to quickly identify the contents of a field that the users are interest in and extract them out, is an urgent problem to be solved. To identify the content that users are interested in, we can use NLPIR. Chinese word segmentation framework for speech segmentation, and identify named entity according to part of speech tagging. For extraction, using Hadoop, parallel cluster platform based on a big data MapReduce framework, using the Hadoop Distributed File System (HDFS) for efficient data access and starting Map and Reduce tasks to extract the information of named entity. This task extracts the required information from the interactive encyclopedia and then stores them in the knowledge base. It implements the task of extracting the information data of parallelization of named entity based on Hadoop platform.
引用
收藏
页码:2309 / 2312
页数:4
相关论文
共 50 条
  • [1] Research on Chinese Nested Named Entity Relation Extraction
    Xu H.
    Li Y.
    He Y.
    Qian L.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2019, 55 (01): : 8 - 14
  • [2] Improved named entity translation and bilingual named entity extraction
    Huang, F
    Vogel, S
    FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, PROCEEDINGS, 2002, : 253 - 258
  • [3] Research on the Extraction of Wikipedia-Based Chinese-Khmer Named Entity Equivalents
    Xia, Qing
    Yan, Xin
    Yu, Zhengtao
    Gao, Shengxiang
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 372 - 379
  • [4] A named entity relation extraction method based on bootstrapping
    He Tingting
    Xu Chao
    Li Jing
    Zhao Junzhe
    2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 758 - 763
  • [5] Named Entity Relation Extraction Based on Multiple Features
    Li, Yeqing
    2015 IEEE 29TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS WAINA 2015, 2015, : 213 - 216
  • [6] Information Extraction based on Named Entity for Tourism Corpus
    Chantrapornchai, Chantana
    Tunsakul, Aphisit
    2019 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2019), 2019, : 187 - 192
  • [7] Chinese Named Entity Extraction System Based On Word2vec Under Spark Platform
    Yuan, Jialu
    Xiong, Yongping
    PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS AND INFORMATION TECHNOLOGY PROCESSING (AMITP 2016), 2016, 60 : 387 - 394
  • [8] Research on relation extraction of named entity on social media in smart cities
    Zuoguo Liu
    Xiaorong Chen
    Soft Computing, 2020, 24 : 11135 - 11147
  • [9] Research on the Chinese Named-Entity-Relation-Extraction Method for Crop Diseases Based on BERT
    Zhang, Wenhao
    Wang, Chunshan
    Wu, Huarui
    Zhao, Chunjiang
    Teng, Guifa
    Huang, Sufang
    Liu, Zhen
    AGRONOMY-BASEL, 2022, 12 (09):
  • [10] Research on relation extraction of named entity on social media in smart cities
    Liu, Zuoguo
    Chen, Xiaorong
    SOFT COMPUTING, 2020, 24 (15) : 11135 - 11147