Results 1 to 3 of 3

Thread: Interesting Literature

  1. #1
    Programmer toffer's Avatar
    Join Date
    May 2008
    Location
    Erfurt, Germany
    Posts
    587
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Interesting Literature

    Sorry - i just didn't find any thread that fits. From time to time there're interesting documents appearing. I'd suggest to post stuff like that in this thread.

    S. Deorowicz, "Universal lossless data compression algorithms", Phd. Thesis, 2003
    http://citeseerx.ist.psu.edu/viewdoc...=rep1&type=pdf

    J. Abel, "Post BWT stages of the Burrows-Wheeler compression algorithms", Software - Practise and Experience, Vol. 40, 2010
    http://www.juergen-abel.info/Preprin...BWT_Stages.pdf
    Last edited by toffer; 20th July 2011 at 21:46.
    M1, CMM and other resources - http://sites.google.com/site/toffer86/ or toffer.tk

  2. #2
    Expert
    Matt Mahoney's Avatar
    Join Date
    May 2008
    Location
    Melbourne, Florida, USA
    Posts
    3,255
    Thanks
    306
    Thanked 779 Times in 486 Posts
    Interesting. I compared his results (DM and DW in table 4.15) with zpaq -m2 -b0, which are both BWT based methods. zpaq does better most of the Silesia corpus (all but sao) and most of the Calgary corpus. Testing on a 2.0 GHz T3200, 2 cores, Ubuntu:

    Code:
    matt@matt-M-7301U:~/zpaq$ zpaq -b0 -m2 c silesia-m2-b0 silesia/*
    Setting max block size for -m1 or -m2 to -b268.435199
    [10] silesia/webster 41458703 -> 6545981 (1.2631 bpc)
    [2] silesia/mozilla 51220480 -> 16201277 (2.5304 bpc)
    [8] silesia/samba 21606400 -> 4039911 (1.4958 bpc)
    Creating archive silesia-m2-b0.zpaq
    [4] silesia/nci 33553445 -> 1235922 (0.2947 bpc)
    [1] silesia/dickens 10192446 -> 2290551 (1.7978 bpc)
    [6] silesia/osdb 10085684 -> 2283990 (1.8117 bpc)
    [3] silesia/mr 9970564 -> 2147936 (1.7234 bpc)
    [12] silesia/x-ray 8474240 -> 3717472 (3.5094 bpc)
    [7] silesia/reymont 6627202 -> 1003522 (1.2114 bpc)
    [9] silesia/sao 7251944 -> 4730680 (5.2187 bpc)
    [5] silesia/ooffice 6152192 -> 2589388 (3.3671 bpc)
    [11] silesia/xml 5345280 -> 397102 (0.5943 bpc)
    91 seconds

  3. #3
    Expert
    Matt Mahoney's Avatar
    Join Date
    May 2008
    Location
    Melbourne, Florida, USA
    Posts
    3,255
    Thanks
    306
    Thanked 779 Times in 486 Posts
    Too bad I can't find his original sdc compressor.

    Also, zpaq -m4 -b0 beats the best results in table 4.15 on 9 out of 12 files

Similar Threads

  1. Interesting tools
    By lunaris in forum Data Compression
    Replies: 2
    Last Post: 26th August 2009, 00:50
  2. An interesting test set
    By nanoflooder in forum Data Compression
    Replies: 12
    Last Post: 13th April 2009, 02:33
  3. Interesting Deflate source
    By encode in forum Forum Archive
    Replies: 10
    Last Post: 21st April 2008, 16:30
  4. FreeArc is becoming more and more interesting...
    By Vacon in forum Forum Archive
    Replies: 65
    Last Post: 9th December 2007, 21:41
  5. Outdated, but maybe interesting...?
    By Vacon in forum Forum Archive
    Replies: 1
    Last Post: 20th October 2007, 21:13

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •