TKES: A Novel System for Extracting Trendy Keywords from Online News Sites

被引:0
|
作者
Vo, Tham [1 ,2 ]
Do, Phuc [3 ]
机构
[1] Lac Hong Univ, Dong Nai 71000, Vietnam
[2] Thu Dau Mot Univ, Binh Duong 72000, Vietnam
[3] Univ Informat Technol, VNU HCM, Ho Chi Minh 7000, Vietnam
关键词
Event detection; Burst detection; Keyword extraction; Kleinberg; Burst ranking; TKES; Text stream;
D O I
10.1007/s40305-020-00327-4
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
As the Smart city trend especially artificial intelligence, data science, and the internet of things has attracted lots of attention, many researchers have created various smart applications for improving people's life quality. As it is very essential to automatically collect and exploit information in the era of industry 4.0, a variety of models have been proposed for storage problem solving and efficient data mining. In this paper, we present our proposed system, Trendy Keyword Extraction System (TKES), which is designed for extracting trendy keywords from text streams. The system also supports storing, analyzing, and visualizing documents coming from text streams. The system first automatically collects daily articles, then it ranks the importance of keywords by calculating keywords' frequency of existence in order to find trendy keywords by using the Burst Detection Algorithm which is proposed in this paper based on the idea of Kleinberg. This method is used for detecting bursts. A burst is defined as a period of time when a keyword is continuously and unusually popular over the text stream and the identification of bursts is known as burst detection procedure. The results from user requests could be displayed visually. Furthermore, we create a method in order to find a trendy keyword set which is defined as a set of keywords that belong to the same burst. This work also describes the datasets used for our experiments, processing speed tests of our two proposed algorithms.
引用
收藏
页码:801 / 816
页数:16
相关论文
共 19 条
  • [1] TKES: A Novel System for Extracting Trendy Keywords from Online News Sites
    Tham Vo
    Phuc Do
    [J]. Journal of the Operations Research Society of China, 2022, 10 : 801 - 816
  • [2] SCISOR - EXTRACTING INFORMATION FROM ONLINE NEWS
    JACOBS, PS
    RAU, LF
    [J]. COMMUNICATIONS OF THE ACM, 1990, 33 (11) : 88 - 97
  • [3] Towards the detection of breaking news from online Web search keywords
    Murata, Tsuyoshi
    [J]. 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, Workshops Proceedings, 2006, : 401 - 404
  • [4] A novel method of extracting and rendering news web sites on mobile devices
    Kumar, Harshit
    Park, Sungjoon
    Kang, Sanggil
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT I, PROCEEDINGS, 2007, 4692 : 588 - +
  • [5] Classification Of Breaking News Taken from the Online News Sites
    Kilic, Erdal
    Tavus, Mustafa Resit
    Karhan, Zehra
    [J]. 2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 363 - 366
  • [6] Extracting Key Entities and Significant Events from Online Daily News
    Liu, Mingrong
    Liu, Yicen
    Xiang, Liang
    Chen, Xing
    Yang, Qing
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2008, 2008, 5326 : 201 - 209
  • [7] RnR: A System for Extracting Rationale from Online Reviews and Ratings
    Rahayu, Dwi A. P.
    Krishnaswamy, Shonali
    Labbe, Cyril
    Alhakoon, Oshadi
    [J]. SERVICE-ORIENTED COMPUTING - ICSOC 2010, PROCEEDINGS, 2010, 6470 : 717 - +
  • [8] DataRover: An automated system for extracting product information from online catalogs
    Ahmed, Syed Toufeeq
    Vadrevu, Srinivas
    Davulcu, Hasan
    [J]. ADVANCES IN WEB INTELLIGENCE AND DATA MINING, 2006, 23 : 1 - +
  • [9] A Novel Approach of Extracting Opinion Targets and Opinion Words from Online Review
    Tang, Mingshuang
    Cheng, Hongrong
    Guo, Yanwei
    Luo, Jiaqing
    Zhou, Shijie
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON EDUCATION, SPORTS, ARTS AND MANAGEMENT ENGINEERING, 2016, 54 : 118 - 121
  • [10] Novel Biologically Inspired Approaches to Extracting Online Information from Temporal Data
    Malik, Zeeshan Khawar
    Hussain, Amir
    Wu, Jonathan
    [J]. COGNITIVE COMPUTATION, 2014, 6 (03) : 595 - 607