A Novel Automatic Text Summarization System with Feature Terms Identification

被引:0
|
作者
Manne, Suneetha [1 ]
Pervez, Shaik Mohammed Zaheer [1 ]
Fatima, S. Sameen [2 ]
机构
[1] VRSEC, Dept IT, Vijayawada, India
[2] Osmania Univ, Dept CSE, Hyderabad 500007, Andhra Pradesh, India
关键词
Extractive feature terms; HMM tagger; POS tagging; Term frequency;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
with ever growing content on World Wide Web, it has been increasingly difficult for users to search for relevant information. A rough estimation of world's famous search engine Google in year 2010 revealed that the total size of internet has now turned to 2 petabytes. Search engines that are supposed to satisfy user's information need, has too much information to offer than what is required. This problem is referred as information overload. The field of Information Extraction (IE) is offering a huge scope to concise and compact the information enabling the user to decide by mere check at snippets of each link. Automatic text summarization, a subset of IE is an important activity in the analysis of a high volume text documents. In this context, it has been increasingly important to develop information access solutions that can provide an easy and efficient access to users. Automatic summarization systems address information overload problem by producing a summary of related documents that provides an overall understanding of the topic without having to go through every document. In this paper, we propose a feature term based text summarization technique based on the analysis of Parts of Speech Tagging. A new approach of generating summary for a given input document is discussed based on identification and extraction of important sentences in the document. The system obtains the selective terms from the extracted terms and builds qualitative summary with appreciable compression ratio.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Turkish automatic text summarization system
    Altan, Z
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, VOLS 1AND 2, 2004, : 311 - 316
  • [2] Personalized Text Summarization Based on Important Terms Identification
    Moro, Robert
    Bielikova, Maria
    [J]. 2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 131 - 135
  • [3] Automatic Text Summarization
    Soumya, S.
    Kumar, Geethu S.
    Naseem, Rasia
    Mohan, Saumya
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 787 - 789
  • [4] Automatic Text Summarization
    Fattah, Mohamed Abdel
    Ren, Fuji
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 27, 2008, 27 : 192 - +
  • [5] A Novel Hybrid Text Summarization System for Punjabi Text
    Gupta, Vishal
    Kaur, Narvinder
    [J]. COGNITIVE COMPUTATION, 2016, 8 (02) : 261 - 277
  • [6] A Novel Hybrid Text Summarization System for Punjabi Text
    Vishal Gupta
    Narvinder Kaur
    [J]. Cognitive Computation, 2016, 8 : 261 - 277
  • [7] CRF Based Feature Extraction Applied for Supervised Automatic Text Summarization
    Batcha, Nowshath K.
    Aziz, Normaziah A.
    Shafie, Sharil I.
    [J]. 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI 2013), 2013, 11 : 426 - 436
  • [8] A chinese automatic text summarization system for mobile devices
    Faculty of Engineering, The University of Tokushima, 2-1 Minamijosanjima, Tokushima 770-8506, Japan
    不详
    [J]. PACLIC - Proc. Pacific Asia Conf. Lang., Inf. Comput., 2006, (426-429):
  • [9] A domain-based automatic text summarization system
    Geng, Zengmin
    Jia, Yunde
    Liu, Wanchun
    Du, Jianxia
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13 : 64 - 68
  • [10] A Chinese Automatic Text Summarization system for mobile devices
    Yu, Lei
    Liu, Mengge
    Ren, Fuji
    Kuroiwa, Shingo
    [J]. PACLIC 20: PROCEEDINGS OF THE 20TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2006, : 426 - 429