Tagging The Web: Building A Robust Web Tagger with Neural Network

被引:0
|
作者
Ma, Ji [1 ]
Zhang, Yue [2 ]
Zhu, Jingbo [1 ]
机构
[1] Northeastern Univ, Shenyang, Liaoning, Peoples R China
[2] Singapore Univ Technol & Design, Singapore, Singapore
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of web-domain POS tagging using a two-phase approach. The first phase learns representations that capture regularities underlying web text. The representation is integrated as features into a neural network that serves as a scorer for an easy-first POS tagger. Parameters of the neural network are trained using guided learning in the second phase. Experiment on the SANCL 2012 shared task show that our approach achieves 93.15% average tagging accuracy, which is the best accuracy reported so far on this data set, higher than those given by ensembled syntactic parsers.
引用
收藏
页码:144 / 154
页数:11
相关论文
共 50 条
  • [1] Exploiting the Social Tagging Network for Web Clustering
    Lu, Caimei
    Hu, Xiaohua
    Park, Jung-ran
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2011, 41 (05): : 840 - 852
  • [2] Neural Tagger for Czech Language: Capturing Linguistic Phenomena in Web Corpora
    Neverilova, Zuzana
    Stara, Marie
    [J]. RASLAN 2019: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2019, : 23 - 32
  • [3] Building GIS Web Services on JXTA Network
    WANG Leichun GUAN Jihong ZHOU Shuigeng WANG Leichun
    Department of Computer
    [J]. Geo-spatial Information Science, 2004, (04) : 268 - 273
  • [4] TAGGER- A TOOL FOR VISUALIZING DATABASE CONTENTS ON THE WEB
    Schmidt, Andreas
    Trixner, Alexandra
    Kimmig, Daniel
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON INTERNET TECHNOLOGIES AND APPLICATIONS (ITA 11), 2011, : 603 - 604
  • [5] Adaptive neural network clustering of web users
    Rangarajan, SK
    Phoha, VV
    Balagani, KS
    Selmic, RR
    Iyengar, SS
    [J]. COMPUTER, 2004, 37 (04) : 34 - +
  • [6] A web oriented recurrent neural network simulator
    Boné, R
    Crucianu, M
    Makris, P
    de Beauville, JPA
    [J]. ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 97 - 100
  • [7] NEURAL NETWORK FRAMEWORK FOR MULTILINGUAL WEB DOCUMENTS
    Prakash, Kolla Bhanu
    Ananthan, T. V.
    Rajavarman, V. N.
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 392 - 397
  • [8] A hybrid neural network for web page classification
    Cao, YK
    Li, YF
    Yu, ZZ
    [J]. DIGITAL LIBRARIES: INTERNATIONAL COLLABORATION AND CROSS-FERTILIZATION, PROCEEDINGS, 2004, 3334 : 641 - 641
  • [9] A novel competitive neural network for web mining
    Dong, YH
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 3417 - 3422
  • [10] A neural network approach to web graph processing
    Tsoi, AC
    Scarselli, F
    Gori, M
    Hagenbuchner, M
    Yong, SL
    [J]. WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 27 - 38