Automatic Summarization and Keyword Extraction from Web Page or Text File

被引:0
|
作者
You, Xiangdong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Key Lab Universal Wireless Commun, Minist Educ, Beijing 100876, Peoples R China
关键词
automatic summarization; keyword extraction; readability; textrank;
D O I
10.1109/ccet48361.2019.8989315
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we study the automatic summarization and keyword extraction techniques for web page and text file. First, we use the Readability algorithm to extract the text of the web page, and study the PageRank algorithm and TextRank algorithm, and then use the TextRank algorithm to extract keywords, key sentences and abstracts. We also develop the web application that processes web page and text file. The application can input URL, text file, or text paragraph, then application can complete the extraction of main content, abstract, keywords and key sentences.
引用
收藏
页码:154 / 158
页数:5
相关论文
共 50 条
  • [21] Temporal Web page summarization
    Jatowt, A
    Ishizuka, M
    WEB INFORMATION SYSTEMS - WISE 2004, PROCEEDINGS, 2004, 3306 : 303 - 312
  • [22] A novel web page text information extraction method
    Wang, Chongjun
    Wei, Peng
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2213 - 2218
  • [23] TEXT: Automatic Template Extraction from Heterogeneous Web Pages
    Kim, Chulyun
    Shim, Kyuseok
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (04) : 612 - 626
  • [24] The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction
    Najafi, Elham
    Darooneh, Amir H.
    PLOS ONE, 2015, 10 (06):
  • [25] Automatic Text Summarization
    Soumya, S.
    Kumar, Geethu S.
    Naseem, Rasia
    Mohan, Saumya
    COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 787 - 789
  • [26] Extraction Based Automatic Text Summarization System with HMM Tagger
    Manne, Suneetha
    Mohd, Zaheer Parvez Shaik
    Fatima, S. Sameen
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 421 - +
  • [27] Automatic Text Summarization
    Fattah, Mohamed Abdel
    Ren, Fuji
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 27, 2008, 27 : 192 - +
  • [28] An adaptation of a F-measure for automatic text summarization by extraction
    Mohamed Amine Boudia
    Reda Mohamed Hamou
    Abdelmalek Amine
    Ahmed Chaouki Lokbani
    Cluster Computing, 2020, 23 : 2389 - 2398
  • [29] An adaptation of a F-measure for automatic text summarization by extraction
    Boudia, Mohamed Amine
    Hamou, Reda Mohamed
    Amine, Abdelmalek
    Lokbani, Ahmed Chaouki
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (03): : 2389 - 2398
  • [30] Word Concept Extraction Using HOSVD for Automatic Text Summarization
    Biyabangard, Atiyeh
    Abadeh, Mohammad Saniee
    2015 AI & ROBOTICS (IRANOPEN), 2015,