Extraction Research about Parallelization of Named Entity Based on Hadoop Platform

被引:0
|
作者
Shi, Quan [1 ]
Yang, Zhendong [1 ]
Xu, Lu [1 ]
机构
[1] Nantong Univ, Nantong, Peoples R China
关键词
Hadoop; Chinese word segmentation; Named entity recognition; Named entity extraction;
D O I
10.4028/www.scientific.net/AMM.397-400.2309
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the era of big data approaching, data becomes more and more important. Faced with such massive amounts of data space, how to quickly identify the contents of a field that the users are interest in and extract them out, is an urgent problem to be solved. To identify the content that users are interested in, we can use NLPIR. Chinese word segmentation framework for speech segmentation, and identify named entity according to part of speech tagging. For extraction, using Hadoop, parallel cluster platform based on a big data MapReduce framework, using the Hadoop Distributed File System (HDFS) for efficient data access and starting Map and Reduce tasks to extract the information of named entity. This task extracts the required information from the interactive encyclopedia and then stores them in the knowledge base. It implements the task of extracting the information data of parallelization of named entity based on Hadoop platform.
引用
收藏
页码:2309 / 2312
页数:4
相关论文
共 50 条
  • [41] Research on parallel algorithm based on hadoop distributed computing platform
    Heilongjiang University of Technology, Jixi, China
    Int. J. Grid Distrib. Comput., 4 (163-170):
  • [42] Research on Distributed Data Mining System Based on Hadoop Platform
    Guo, Jianwei
    Li, Ying
    Du, Liping
    Zhao, Guifen
    Jiang, Jiya
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSAIT 2013), 2014, 255 : 629 - 636
  • [43] Spectral clustering algorithm based on Hadoop cloud platform research
    Zhang, LiSheng
    Hou, Ling
    Lei, DaJiang
    PROCEEDINGS OF THE 2016 5TH INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS AND COMPUTER SCIENCE, 2016, 80 : 495 - 498
  • [44] A Research Toward Chinese Named Entity Recognition Based on Transfer Learning
    Hui Kang
    Jingwu Xiao
    Yunpeng Zhang
    Lei Zhang
    Xu Zhao
    Tie Feng
    International Journal of Computational Intelligence Systems, 16
  • [45] A Research Toward Chinese Named Entity Recognition Based on Transfer Learning
    Kang, Hui
    Xiao, Jingwu
    Zhang, Yunpeng
    Zhang, Lei
    Zhao, Xu
    Feng, Tie
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [46] Bootstrapping Named Entity Extraction for the Creation of Mobile Services
    Polifroni, Joseph
    Kiss, Imre
    Adler, Mark
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1515 - 1520
  • [47] Named Entity Extraction for Knowledge Graphs: A Literature Overview
    Al-Moslmi, Tareq
    Ocana, Marc Gallofre
    Opdahl, Andreas L.
    Veres, Csaba
    IEEE ACCESS, 2020, 8 : 32862 - 32881
  • [48] Active Learning Technique for Biomedical Named Entity Extraction
    Saha, Sriparna
    Ekbal, Asif
    Verma, Mridula
    Sikdar, Utpal
    Poesio, Massimo
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 835 - 841
  • [49] Japanese Named Entity extraction with redundant morphological analysis
    Asahara, M
    Matsumoto, Y
    HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 8 - 15
  • [50] Learning pattern rules for Chinese named entity extraction
    Chua, TS
    Liu, JM
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 411 - 418