Automatic Summarization and Keyword Extraction from Web Page or Text File

被引:0
|
作者
You, Xiangdong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Key Lab Universal Wireless Commun, Minist Educ, Beijing 100876, Peoples R China
关键词
automatic summarization; keyword extraction; readability; textrank;
D O I
10.1109/ccet48361.2019.8989315
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we study the automatic summarization and keyword extraction techniques for web page and text file. First, we use the Readability algorithm to extract the text of the web page, and study the PageRank algorithm and TextRank algorithm, and then use the TextRank algorithm to extract keywords, key sentences and abstracts. We also develop the web application that processes web page and text file. The application can input URL, text file, or text paragraph, then application can complete the extraction of main content, abstract, keywords and key sentences.
引用
收藏
页码:154 / 158
页数:5
相关论文
共 50 条
  • [11] Speech-to-Text Summarization Using Automatic Phrase Extraction from Recognized Text
    Rott, Michal
    Cerva, Petr
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 101 - 108
  • [12] EFFICIENT KEYWORD EXTRACTION AND TEXT SUMMARIZATION FOR READING ARTICLES ON SMART PHONE
    Jeong, Hyoungil
    Ko, Youngjoong
    Seo, Jungyun
    COMPUTING AND INFORMATICS, 2015, 34 (04) : 779 - 794
  • [13] Performance Analysis of Keyword Extraction Algorithms Assessing Extractive Text Summarization
    Kumar, Akshi
    Sharma, Aditi
    Sharma, Sidhant
    Kashyap, Shashwat
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS AND ELECTRONICS (COMPTELIX), 2017, : 408 - 414
  • [14] Automatic Extraction of Web Page Text Information Based on Network Topology Coincidence Degree
    Shu, Zhinian
    Li, Xiaorong
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [15] Supporting Data Driven Access through Automatic Keyword Extraction and Summarization
    Xu, Weijia
    Luo, Wei
    Woodward, Nicholas
    Zhang, Yan
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 704 - 707
  • [16] Automatic text summarization based on sentences clustering and extraction
    Zhang Pei-ying
    Li Cun-he
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 1, 2009, : 167 - 170
  • [17] Automatic Keyword and Sentence-Based Text Summarization for Software Bug Reports
    Jindal, Shubhra Goyal
    Kaur, Arvinder
    IEEE ACCESS, 2020, 8 : 65352 - 65370
  • [18] Automatic Thai Text Summarization Using Keyword-Based Abstractive Method
    Ngamcharoen, Parun
    Sanglerdsinlapachai, Nuttapong
    Vejjanugraha, Pikul
    2022 17TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2022) / 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (AIOT 2022), 2022,
  • [19] Automatic Keyword Extraction from Bengali Text using Improved RAKE Approach
    Haque, Mozammel
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [20] Building a Dataset for Summarization and Keyword Extraction from Emails
    Loza, Vanessa
    Lahiri, Shibamouli
    Mihalcea, Rada
    Lai, Po-Hsiang
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2441 - 2446