enwik8:
15,656,780 bytes, 9,351.442 sec., paq8sk9 -x15 -w -e1,english.dic
Darek (2nd May 2020)
@Sportman - will you try to compress enwik9 or I got this?
kampaster (4th May 2020)
It's improved and generate smaller output files.
I don't mind people copying/cloning open source or reverse enginering parts of none commercial software as long it do not violate the supplied license/copyright (if any) and credits are given or permission is asked and given (if alive).
Ok, we wait for the source then.
Despite this you also need to back to pre v84 (paq8pxd) version builds changes:
https://encode.su/threads/1464-Paq8p...ll=1#post64855
Paq8sk13
this is source and binary of paq8sk13. the result for dickens file(silesia benchmark) is
Total 10192446 bytes compressed to 1901677 bytes.
Time 2230.69 sec, used 1341 MB (1406994532 bytes) of memory
the decompression time is Time 2230.34 sec, used 1192 MB (1250618671 bytes) of memory
the checksum value is match
moisesmcardona (3rd July 2020),Sportman (12th May 2020)
moisesmcardona (3rd July 2020)
enwik8:
15,756,129 bytes, 11,220.447 sec., paq8sk13 -x15 -w
15,639,539 bytes, >11,010.312 sec., paq8sk13 -x15 -w -e1,english.dic
App crash at finish.
Last edited by Sportman; 7th May 2020 at 20:45.
What a difference between v10 and v13?
Which is more advanced?
Darek (8th May 2020)
I've started to test -x14 option, according to Sportman's information that -x15 options generate crash.
My estimate of paq8sk13 with-x14 option for enwik9 is about 122'5xx'xxx to 122'6xx'xxx. Time to compress on my laptiop = 36h (30h to go). Option -x15 isn't much better. Maybe 50KB, maybe 100KB less.
One question more - is the paq8pxd1.cpp file a source of paq8sk13?
Hmmm, -x14 option probably use much more memory than 32GB... compression start to huge slowdown due to use big swap file. I'm not sure if the process could go to the end. And it's "only" -x14 option then -x15 could require even much memory.
@suryakandau - could you limit next version memory usage to 32GB for -x15?
Maybe that's a reason of crashes - too much memory usage. From other side - crunching numbers by exdend memory usage is not the way we should go....
Paq8sk14
here is the latest version of paq8sk with a little improvement on hash function and reduced memory usage.@sportman/darek could you try it use -x15 or -x14 please ?
moisesmcardona (3rd July 2020),Sportman (12th May 2020)
enwik8:
15,755,537 bytes, 11,013.735 sec., paq8sk14 -x15 -w
15,638,616 bytes, >10,958.687 sec., paq8sk14 -x15 -w -e1,english.dic
App crash at finish (in time calculation part?).
Last edited by Sportman; 12th May 2020 at 16:40.
Darek (12th May 2020)
Paq8sk15
the result for dickens file (silesia benchmark) use -s6 -w -e1,english.dic option is
Total 10192446 bytes compressed to 1910568 bytes.Time 1697.33 sec, used 1103 MB (1157402890 bytes) of memory
moisesmcardona (3rd July 2020),Sportman (12th May 2020)