Extraction Research about Parallelization of Named Entity Based on Hadoop Platform

被引:0
|
作者
Shi, Quan [1 ]
Yang, Zhendong [1 ]
Xu, Lu [1 ]
机构
[1] Nantong Univ, Nantong, Peoples R China
关键词
Hadoop; Chinese word segmentation; Named entity recognition; Named entity extraction;
D O I
10.4028/www.scientific.net/AMM.397-400.2309
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the era of big data approaching, data becomes more and more important. Faced with such massive amounts of data space, how to quickly identify the contents of a field that the users are interest in and extract them out, is an urgent problem to be solved. To identify the content that users are interested in, we can use NLPIR. Chinese word segmentation framework for speech segmentation, and identify named entity according to part of speech tagging. For extraction, using Hadoop, parallel cluster platform based on a big data MapReduce framework, using the Hadoop Distributed File System (HDFS) for efficient data access and starting Map and Reduce tasks to extract the information of named entity. This task extracts the required information from the interactive encyclopedia and then stores them in the knowledge base. It implements the task of extracting the information data of parallelization of named entity based on Hadoop platform.
引用
收藏
页码:2309 / 2312
页数:4
相关论文
共 50 条
  • [21] Hadoop Recognition of Biomedical Named Entity Using Conditional Random Fields
    Li, Kenli
    Ai, Wei
    Tang, Zhuo
    Zhang, Fan
    Jiang, Lingang
    Li, Keqin
    Hwang, Kai
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (11) : 3040 - 3051
  • [22] Optimization and Research of Hadoop Platform Based on FIFO Scheduler
    Pei Shu-jun
    Zheng Xi-min
    Hu Da-ming
    Lou Shu-hui
    Zhang Yuan-xu
    2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 727 - 730
  • [23] A Military Named Entity Relation Extraction Approach Based on Deep Learning
    Wang, Xuefeng
    Yang, Ruopeng
    Feng, Yulong
    Li, Dongsheng
    Hou, Jianfeng
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [24] Research and Application of DBSCAN Algorithm Based on Hadoop Platform
    Fu, Xiufen
    Wang, Yaguang
    Ge, Yanna
    Chen, Peiwen
    Teng, Shaohua
    PERVASIVE COMPUTING AND THE NETWORKED WORLD, 2014, 8351 : 73 - 87
  • [25] HMM-based Korean named entity recognition for information extraction
    Yun, Bo-Hyun
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 526 - 531
  • [26] Chinese Named Entity Implicit Relation Extraction Based on Company Verbs
    Wan C.-X.
    Gan L.-X.
    Jiang T.-J.
    Liu D.-X.
    Liu X.-P.
    Liu Y.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (12): : 2795 - 2820
  • [27] The Research of Recommendation System Based on Hadoop Cloud Platform
    Wang, Chunzhi
    Zheng, Zhou
    Yang, Zhuang
    2014 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2014), 2014, : 193 - 196
  • [28] Research on Named Entity Recognition Based on Gated Interaction Mechanisms
    Liu, Bin
    Chen, Wanyuan
    Tao, Jialing
    He, Lei
    Tang, Dan
    APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [29] Named entity extraction based on a maximum entropy model and transformation rules
    Uchimoto, K
    Ma, Q
    Murata, M
    Ozaku, H
    Isahara, H
    38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 326 - 335
  • [30] Research on Named Entity Recognition Method Based on BERT Model
    Xie, Shaopeng
    2024 IEEE 10TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND MACHINE LEARNING APPLICATIONS, BIGDATASERVICE 2024, 2024, : 92 - 96