Evaluation of Persian Text Based on Huffman Data Compression

被引:0
|
作者
Jalilian, Omid [1 ]
Haghighat, Abolfazl Toroghi [2 ]
Rezvanian, Alireza [1 ]
机构
[1] Islamic Azad Univ, Hamedan Branch, Tehran, Iran
[2] Islamic Azad Univ, Qazvin Branch, Tehran, Iran
关键词
component; Data mining; Persian language; Persian web; Text compression; Huffman data compression; Persian texts compression;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
According to the growth of information sources in recent years along the web, many of web servers have been dedicated to the information sources storage. Until yet many methods are presented for storing and transforming information on the web in the case of paralleling or processing. But one of the researcher's challenges in derivation and restoring data in data mining and information retrievals are to face to this huge amount of information for storing. One of the solutions of this problem is compression of information resources. Notice that the published statistics, Persian language is one of the oldest and the most diffused languages all around the world and web and also according to its kind of alphabets and variety along the Persian texts, an evaluation on compression for Persian texts will be useful. First of all in this paper variety difficulties and huge amount of information on the web, general aspects of Huffman compression methods are introduced, and also some features of Persian language. The state of choosing Persian texts collections has been investigated and the result of tests in compare with some experimental datasets form Persian, English and Arabic were shown. The experimental results are given at the end of paper.
引用
收藏
页码:180 / +
页数:2
相关论文
共 50 条
  • [1] A Huffman compression based text steganography method
    Satir, Esra
    Isik, Hakan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 70 (03) : 2085 - 2110
  • [2] A Huffman compression based text steganography method
    Esra Satir
    Hakan Isik
    Multimedia Tools and Applications, 2014, 70 : 2085 - 2110
  • [3] Transliteration Based Bengali Text Compression Using Huffman Principle
    Hossain, Md Mamun
    Habib, Ahsan
    Rahman, Mohammad Shahidur
    2014 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2014,
  • [4] Multilevel Security and Compression of Text Data using Bit Stuffing and Huffman Coding
    Kodabagi, M. M.
    Jerabandi, M. V.
    Gadagin, Nagaraj
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 800 - 804
  • [5] ARABIC TEXT COMPRESSION USING HUFFMAN CODE
    OSMAN, MY
    ALHABIB, M
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 1991, 16 (04): : 613 - 618
  • [6] Research on Data Compression Technology based on Huffman Algorithm
    Yao Shu-jun
    2012 THIRD INTERNATIONAL CONFERENCE ON TELECOMMUNICATION AND INFORMATION (TEIN 2012), 2012, : 347 - 352
  • [7] A high capacity Email based text steganography scheme using Huffman compression
    Kumar, Rajeev
    Malik, Aruna
    Singh, Samayveer
    Chand, Satish
    2016 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2016, : 53 - 56
  • [8] Multiple Subgroup Data Compression Technique Based On Huffman Coding
    Shukla, Piyush Kumar
    Rusiya, Pradeep
    Agrawal, Deepak
    Chhablani, Lata
    Raghuwanshi, Balwant Singh
    2009 1ST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS(CICSYN 2009), 2009, : 397 - +
  • [9] DATA-COMPRESSION WITH HUFFMAN CODING
    AMSTERDAM, J
    BYTE, 1986, 11 (05): : 98 - &
  • [10] On the data expansion of the Huffman compression algorithm
    De Prisco, R
    De Santis, A
    COMPUTER JOURNAL, 1998, 41 (03): : 137 - 144