Automatic Summarization and Keyword Extraction from Web Page or Text File

被引:0
|
作者
You, Xiangdong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Key Lab Universal Wireless Commun, Minist Educ, Beijing 100876, Peoples R China
关键词
automatic summarization; keyword extraction; readability; textrank;
D O I
10.1109/ccet48361.2019.8989315
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we study the automatic summarization and keyword extraction techniques for web page and text file. First, we use the Readability algorithm to extract the text of the web page, and study the PageRank algorithm and TextRank algorithm, and then use the TextRank algorithm to extract keywords, key sentences and abstracts. We also develop the web application that processes web page and text file. The application can input URL, text file, or text paragraph, then application can complete the extraction of main content, abstract, keywords and key sentences.
引用
收藏
页码:154 / 158
页数:5
相关论文
共 50 条
  • [31] Automatic Arabic Text Summarization Using Clustering and Keyphrase Extraction
    Fejer, Hamzah Noori
    Omar, Nazlia
    PROCEEDINGS OF THE 2014 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND MULTIMEDIA (ICIM), 2014, : 293 - 298
  • [32] Automatic extraction and verification of page transitions in a Web application
    Kubo, Atsuto
    Washizaki, Hironori
    Fukazawa, Yoshiaki
    14TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS, 2007, : 350 - +
  • [33] Automatic Data Records Extraction from List Page in Deep Web Sources
    Chen Hong-ping
    Fang Wei
    Yang Zhou
    Zhuo Lin
    Cui Zhi-Ming
    2009 ASIA-PACIFIC CONFERENCE ON INFORMATION PROCESSING (APCIP 2009), VOL 1, PROCEEDINGS, 2009, : 370 - 373
  • [34] A Text Feature Based Automatic Keyword Extraction Method for Single Documents
    Campos, Ricardo
    Mangaravite, Vitor
    Pasquali, Arian
    Jorge, Alipio Mario
    Nunes, Celia
    Jatowt, Adam
    ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 684 - 691
  • [35] Automatic text summarization using two-step sentence extraction
    Jung, WC
    Ko, YJ
    Seo, JY
    INFORMATION RETRIEVAL TECHNOLOGY, 2005, 3411 : 71 - 81
  • [36] A Modification to Graph Based Approach for Extraction Based Automatic Text Summarization
    Sehgal, Sunchit
    Kumar, Badal
    Maheshwar
    Rampal, Lakshay
    Chaliya, Ankit
    PROGRESS IN ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, VOL 2, 2018, 564 : 373 - 378
  • [37] CRF Based Feature Extraction Applied for Supervised Automatic Text Summarization
    Batcha, Nowshath K.
    Aziz, Normaziah A.
    Shafie, Sharil I.
    4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI 2013), 2013, 11 : 426 - 436
  • [38] Automatic Text Summarization and Classification
    Simske, Steven J.
    Lins, Rafael
    PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 2018), 2018,
  • [39] Advances in automatic text summarization
    Sanderson, M
    COMPUTATIONAL LINGUISTICS, 2000, 26 (02) : 280 - 281
  • [40] Advances in Automatic Text Summarization
    Elizabeth Liddy
    Information Retrieval, 2001, 4 (1): : 82 - 83