Mining on Terms Extraction from Web News

被引:0
|
作者
Hsu, Li-Fu [1 ]
机构
[1] Hwh Hsia Inst Technol, Dept Informat Management, Taipei 235, Taiwan
关键词
web news; information technology; phrase extraction; pre-process texts;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thousand of news stories are reported each day. How to extract the useful information from the large web news is the important technology today. However, information technology advances have partially automated to processing documents, reducing the amount of text which must be read. In this paper we present a Web News Search System, called WNSS. WNSS can discover automatically phrase extraction from large corpora of web news stories. In addition, we give concrete examples of how to preprocess texts based on the intended use of the discovered results. We also evaluate the extracted phrases can be used for important tasks.
引用
收藏
页码:188 / 194
页数:7
相关论文
共 50 条
  • [1] News item extraction for text mining in web newspapers
    Norvåg, K
    Oyri, R
    International Workshop on Challenges in Web Information Retrieval and Integration, Proceedings, 2005, : 195 - 204
  • [2] Terms Extraction from Clustered Web Search Results
    Bourahla, Chouaib
    Maamri, Ramdane
    Sahnoun, Zaidi
    Bouchemal, Nardjes
    MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 364 - 373
  • [3] Argument Extraction from News, Blogs, and the Social Web
    Goudas, Theodosis
    Louizos, Christos
    Petasis, Georgios
    Karkaletsis, Vangelis
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2015, 24 (05)
  • [4] A Survey on Web News Retrieval and Mining
    Hassanian-esfahani, Roya
    Kargar, Mohammad-javad
    2016 SECOND INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2016, : 90 - 101
  • [5] Extraction techniques for mining services from web sources
    Davulcu, H
    Mukherjee, S
    Ramakrishnan, IV
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 601 - 604
  • [6] Web news mining in an evolving framework
    Antonio Iglesias, Jose
    Tiemblo, Alexandra
    Ledezma, Agapito
    Sanchis, Araceli
    INFORMATION FUSION, 2016, 28 : 90 - 98
  • [7] Extraction of web news from web pages using a ternary tree approach
    Laishram, Debina
    Sebastian, Merin
    2015 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATION ENGINEERING ICACCE 2015, 2015, : 628 - 633
  • [8] Hybrid method for automated news content extraction from the web
    Li, Yu
    Meng, Xiaofeng
    Li, Qing
    Wang, Liping
    WEB INFORMATION SYSTEMS - WISE 2006, PROCEEDINGS, 2006, 4255 : 327 - 338
  • [9] Automatic Extraction of Textual Elements from News Web Pages
    Ibrahim, Hossam
    Darwish, Kareem
    Abdel-sabor, Abdel-Rahim
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1600 - 1603
  • [10] Automated metadata and instance extraction from news Web sites
    Vadrevu, S
    Nagarajan, S
    Gelgi, F
    Davulcu, H
    2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2005, : 38 - 41