Page 2 of 5 FirstFirst 1234 ... LastLast
Results 31 to 60 of 149

Thread: GDC Competition: Discussions

  1. #31
    Member
    Join Date
    Aug 2008
    Location
    Planet Earth
    Posts
    1,040
    Thanks
    104
    Thanked 420 Times in 293 Posts
    Quote Originally Posted by Ms1 View Post
    Formally, there is that "Members of the competition committee are ineligible for awards." condition.
    Sorry, I missed that sentence when (quick) reading.

  2. #32
    Member
    Join Date
    Aug 2008
    Location
    Planet Earth
    Posts
    1,040
    Thanks
    104
    Thanked 420 Times in 293 Posts
    Quote Originally Posted by Ms1 View Post
    Excuse me, what do you mean?
    Test PC need to be behind a firewall what block Internet connection or without Internet connection while running "alien" (de)compression software.

    For example Krc could store and retrieve a dictionary in/from a remote cloud to save storage space and gain compression speed https://encode.su/threads/1935-krc-k...ll=1#post39550

  3. #33
    Member
    Join Date
    Aug 2008
    Location
    Planet Earth
    Posts
    1,040
    Thanks
    104
    Thanked 420 Times in 293 Posts
    My preprocessor code crashed little over halfway Test 1 file Qualitative data (text only), I discovered some strange data there:

    ..."
    MARIE
    CECILIA
    MARIE
    CECILIA
    MARIE
    CECILIA (with a smile)
    MARIE
    ALBERT
    CECILIA
    AMADEUS
    CECILIA
    MARIE
    CECILIA
    ALBERT
    MARIE
    CECILIA
    AMADEUS (who has been standing a little way off)
    CECILIA
    MARIE
    ALBERT
    CECILIA
    AMADEUS
    CECILIA
    AMADEUS
    CECILIA
    AMADEUS
    CECILIA
    AMADEUS (with assumed brusqueness)
    CECILIA
    AMADEUS
    CECILIA
    AMADEUS
    CECILIA
    "....

    Looks like missing sentences from original while names are listed:

    ..."
    MARIE

    Oh, that would be fine!

    CECILIA

    Did you hear that, Amadeus?

    AMADEUS (who has been standing a little way off)

    Certainly. It would be very nice.... You can wait for us in the Tirol.

    CECILIA

    Could you come and see me to-morrow afternoon, Marie? Then we might settle the matter.

    MARIE

    Yes, indeed. I am always glad when you can spare me a little of your time.—Until to-morrow, then!

    ALBERT

    Good-by. (He and Marie go out)

    AMADEUS (is walking to and fro)

    CECILIA (who is sitting on the couch, follows him with her eyes)

    AMADEUS (after a turn to the window and back, speaking in a peculiarly dry tone) Well, how did it go? Have you got the finale into shape at last?

    CECILIA

    Oh, in a manner.

    AMADEUS

    The day before yesterday it had not yet been brought up to the proper level. I find, for one thing, that they don't let you assert yourself sufficiently. Your voice should be floating above the rest, instead of being submerged in the crowd.

    CECILIA

    Won't you come to the rehearsal to-morrow—just once more—if you can spare the time?

    AMADEUS

    Would it please you...?

    CECILIA

    I always feel more certain of myself when you are within reach. You know that, don't you?

    AMADEUS

    Yes—I'll come. I'll call off my appointments with Neumann and the Countess.

    CECILIA

    If it isn't too great a sacrifice....

    AMADEUS (with assumed brusqueness)

    Oh, I can make her come in the afternoon.

    CECILIA

    But then there will be no time left for your own work. No, better let it be.

    AMADEUS

    What had we better let be?

    CECILIA

    Don't come to the rehearsal to-morrow.
    "...

    https://www.gutenberg.org/files/2974...-h/29745-h.htm

  4. Thanks:

    schnaader (28th June 2020)

  5. #34
    Administrator Shelwien's Avatar
    Join Date
    May 2008
    Location
    Kharkov, Ukraine
    Posts
    4,133
    Thanks
    320
    Thanked 1,396 Times in 801 Posts
    Might be a bug in my filter script. But I don't think its a good reason to change anything.
    In fact, isn't it a good dataset if it lets you detect preprocessors bugs? :)

  6. #35
    Member
    Join Date
    Aug 2008
    Location
    Planet Earth
    Posts
    1,040
    Thanks
    104
    Thanked 420 Times in 293 Posts
    Quote Originally Posted by Shelwien View Post
    In fact, isn't it a good dataset if it lets you detect preprocessors bugs?
    I knew the situation what caused the crash could happen in theory, but I didn't expect it with this text file, so indeed it's a good dataset for lazy programmers

  7. #36
    Member
    Join Date
    Sep 2018
    Location
    Philippines
    Posts
    121
    Thanks
    31
    Thanked 2 Times in 2 Posts
    I might as well join this competition. We have up to Nov. 20, 2020, hmm?

    First, since my dual-core computer crashed, i have to buy a new computer. And i have to learn how to install g++ again oh my! (but i still got bcc32). After almost a decade, i might be coding again. Brave.

    Tried entering lzuf2 as a test submission, but gmail failed to send. Now email is "queued".

    Will sponsor Huawei own the submitted compressors? If not, will it buy the winning compressor?

  8. #37
    Member lz77's Avatar
    Join Date
    Jan 2016
    Location
    Russia
    Posts
    176
    Thanks
    60
    Thanked 16 Times in 12 Posts
    Quote Originally Posted by compgt View Post
    If not, will it buy the winning compressor?
    Reverse engineering is enough.
    Therefore, I don't like to share compressors between competitors.
    Last edited by lz77; 9th July 2020 at 13:25.

  9. #38
    Programmer schnaader's Avatar
    Join Date
    May 2008
    Location
    Hessen, Germany
    Posts
    630
    Thanks
    288
    Thanked 252 Times in 128 Posts
    After a first quick glance, testset 3 looks fine. Looking for compressed leftovers, only found this one so far (very small ZIP part, 210 bytes decompressed).

    This is the most interesting testset for me, because this kind of data dominates things like Android APK files and is a mix of structured pointer lists, program code and string sections (e.g. method names), so some preprocessing (parsing and reordering stuff, detecting structured data) should be the way to go and would help compressing data like this.

    Left side is from the original file, right side is the output of "Precomp -cn"
    Attached Thumbnails Attached Thumbnails Click image for larger version. 

Name:	image_2020-06-30_15-22-20.png 
Views:	85 
Size:	125.3 KB 
ID:	7719  
    http://schnaader.info
    Damn kids. They're all alike.

  10. #39
    Member lz77's Avatar
    Join Date
    Jan 2016
    Location
    Russia
    Posts
    176
    Thanks
    60
    Thanked 16 Times in 12 Posts
    Quote Originally Posted by Shelwien View Post
    Might be a bug in my filter script. But I don't think its a good reason to change anything.
    In fact, isn't it a good dataset if it lets you detect preprocessors bugs?
    TS40.txt is not an ASCII, even not an Latin-1 file. It contains characters left over from UTF-8 (with decimal codes 128, 145, 147, 225, 226). For example see next line after line

    "Thank Heaven! . . . . Good-night."

    Or see next word after

    She learned of the great medicine,

  11. #40
    Administrator Shelwien's Avatar
    Join Date
    May 2008
    Location
    Kharkov, Ukraine
    Posts
    4,133
    Thanks
    320
    Thanked 1,396 Times in 801 Posts
    The description says "texts from Project Gutenberg in UTF-8 characters, so it’s essentially ASCII", not that its ASCII.

    > It contains characters left over from UTF-8

    That's intentional in this case. No need to make it too simple.

  12. #41
    Administrator Shelwien's Avatar
    Join Date
    May 2008
    Location
    Kharkov, Ukraine
    Posts
    4,133
    Thanks
    320
    Thanked 1,396 Times in 801 Posts
    Any comments on the site design? https://globalcompetition.compression.ru/
    How do you think we can improve it to increase participation?

  13. #42
    Member
    Join Date
    Nov 2014
    Location
    California
    Posts
    175
    Thanks
    61
    Thanked 49 Times in 38 Posts
    Given the duration of the competition, there is little incentive to participate early. I think that participation will increase over time.
    ​Perhaps providing the leader boards and specifying the interface for test 4 will speed things up ?

  14. #43
    Programmer schnaader's Avatar
    Join Date
    May 2008
    Location
    Hessen, Germany
    Posts
    630
    Thanks
    288
    Thanked 252 Times in 128 Posts
    Guess there also is some latency involved that delays submissions. For example, I'm quite busy with other things at the moment, but aim at doing a submission (perhaps next month?). First steps are collecting potential algorithms/combinations to reach the time limit and find the resulting ratios though, it doesn't make sense for me to submit anything before that. Also, using Precomp as a base or submitting its base score isn't useful, as it basically would be similar to a base score of pure lzma2 or bzip2 (and none of the Precomp overhead really helps for the contest).
    http://schnaader.info
    Damn kids. They're all alike.

  15. #44
    Administrator Shelwien's Avatar
    Join Date
    May 2008
    Location
    Kharkov, Ukraine
    Posts
    4,133
    Thanks
    320
    Thanked 1,396 Times in 801 Posts
    > Guess there also is some latency involved that delays submissions.

    We also don't get many people reading the Rules page, so I suspect that current GDC site design causes TLDR syndrome.

    > Using Precomp as a base or submitting its base score isn't useful

    Yeah, its not supposed to involve recompression, since its a lot less popular than "universal compression",
    so we'd not get much participation.
    For example, there's a practical task (in storage) of applying recompression to small independent blocks -
    deflate and jpeg recompression for these circumstances would be quite helpful, but would you find
    time to write a custom deflate recompressor for middle blocks of deflate stream?..
    Or an algorithm for reconstruction of huffman code from jpeg data without header?

    But I'm quite sure that you can patch up a working combination of WRT+zstd or a 2D delta for image data.

  16. #45
    Member lz77's Avatar
    Join Date
    Jan 2016
    Location
    Russia
    Posts
    176
    Thanks
    60
    Thanked 16 Times in 12 Posts
    Quote Originally Posted by Shelwien View Post
    Might be a bug in my filter script. But I don't think its a good reason to change anything.
    In fact, isn't it a good dataset if it lets you detect preprocessors bugs?
    Sportman's preprocessor crashes, because "some strange data" in #33 has size > 1024 bytes but zero spaces. Well, we will do compressors for brocken substantially English text.

  17. #46
    Member
    Join Date
    Aug 2015
    Location
    indonesia
    Posts
    511
    Thanks
    63
    Thanked 95 Times in 74 Posts
    Why i can not send submission for GDCC ? i have email globalcompetition@compression.ru and that blocked my submission.

  18. #47
    Member lz77's Avatar
    Join Date
    Jan 2016
    Location
    Russia
    Posts
    176
    Thanks
    60
    Thanked 16 Times in 12 Posts
    Ms1: Hopefully we will have a sufficient number of participants 1-2 months later to build the real leaderboards.
    https://globalcompetition.compression.ru/#leaderboards

    In the leaderboards for lzturbo you could select the option -b1000 (-b400 for a test data) with a bit better compression...

  19. #48
    Member
    Join Date
    Apr 2020
    Location
    Russia
    Posts
    48
    Thanks
    3
    Thanked 47 Times in 17 Posts
    Quote Originally Posted by suryakandau@yahoo.co.id View Post
    Why i can not send submission for GDCC ? i have email globalcompetition@compression.ru and that blocked my submission.
    Not blocked, but you are heading that way. We are glad that you stopped ignoring GPL, and the next step is showing and proving that the submitted compressors essentially differ from the original works created by other people.

  20. #49
    Member
    Join Date
    Apr 2020
    Location
    Russia
    Posts
    48
    Thanks
    3
    Thanked 47 Times in 17 Posts
    Quote Originally Posted by compgt View Post
    Tried entering lzuf2 as a test submission, but gmail failed to send. Now email is "queued".

    Will sponsor Huawei own the submitted compressors? If not, will it buy the winning compressor?
    If possible, avoid attaching files. Mail servers nowadays know better than users. A link is safe (probably).

    There is no license grant or whatever. A submitted compressor belongs to the author(s). Actually, we don't send the executables to Huawei, believe it or not.
    We can't say if Huawei will be interested in buying something. But, apparently, certain divisions of the company are interested in the topic now.
    My own opinion: Reverse engineering, if you care, does not make sense. I hardly see how any big company may be doing this in such situations. If there will be something useful to reverse engineer, it will be cheaper to buy the author. Lock, stock, and barrel.

  21. Thanks:

    Hakan Abbas (1st August 2020)

  22. #50
    Member
    Join Date
    Apr 2020
    Location
    Russia
    Posts
    48
    Thanks
    3
    Thanked 47 Times in 17 Posts
    Quote Originally Posted by lz77 View Post
    In the leaderboards for lzturbo you could select the option -b1000 (-b400 for a test data) with a bit better compression...
    Thanks, noted.
    I guess there are much better options in other situations as well, and we can try to find them automatically. We would be happy to get submissions from authors, though.

  23. #51
    Member lz77's Avatar
    Join Date
    Jan 2016
    Location
    Russia
    Posts
    176
    Thanks
    60
    Thanked 16 Times in 12 Posts
    From https://globalcompetition.compression.ru/test4api/ : "The size of the input buffer is the block size to be used in the test (inSize = 32,768 bytes)."

    I think, it will be great to add 32 bytes to this inSize, because LZ77 compressor may try to read some (two or more) bytes after begin of an input buffer. The check out-of-bounds access of the buffer will cost time...
    Last edited by lz77; 3rd August 2020 at 15:06.

  24. #52
    Member
    Join Date
    Aug 2015
    Location
    indonesia
    Posts
    511
    Thanks
    63
    Thanked 95 Times in 74 Posts
    Quote Originally Posted by Ms1 View Post
    Not blocked, but you are heading that way. We are glad that you stopped ignoring GPL, and the next step is showing and proving that the submitted compressors essentially differ from the original works created by other people.
    @ms1 have you received my newest submission for gdcc ? Thank you

  25. #53
    Member
    Join Date
    Apr 2018
    Location
    Indonesia
    Posts
    85
    Thanks
    21
    Thanked 5 Times in 5 Posts
    Quote Originally Posted by Shelwien View Post
    Any comments on the site design? https://globalcompetition.compression.ru/
    How do you think we can improve it to increase participation?
    I think that the participants is not much interest because the prize is small.

  26. #54
    Member lz77's Avatar
    Join Date
    Jan 2016
    Location
    Russia
    Posts
    176
    Thanks
    60
    Thanked 16 Times in 12 Posts
    Quote Originally Posted by bwt View Post
    I think that the participants is not much interest because the prize is small.
    I think so too. An ordinary programmer in a Google office gets $10,000 per month. Not every programmer wants to write a custom program for several months...

  27. #55
    Member
    Join Date
    Apr 2018
    Location
    Indonesia
    Posts
    85
    Thanks
    21
    Thanked 5 Times in 5 Posts
    Quote Originally Posted by lz77 View Post
    I think so too. An ordinary programmer in a Google office gets $10,000 per month. Not every programmer wants to write a custom program for several months...
    Yes, for several months the programmer only appreciated only 3000 euro. It is so dramatic...

  28. #56
    Administrator Shelwien's Avatar
    Join Date
    May 2008
    Location
    Kharkov, Ukraine
    Posts
    4,133
    Thanks
    320
    Thanked 1,396 Times in 801 Posts
    Combining open-source preprocessors and coders hardly requires months of work.
    And on other hand, an unique new work could claim prizes in multiple categories.

  29. #57
    Member
    Join Date
    Apr 2018
    Location
    Indonesia
    Posts
    85
    Thanks
    21
    Thanked 5 Times in 5 Posts
    Quote Originally Posted by Shelwien View Post
    Combining open-source preprocessors and coders hardly requires months of work.
    And on other hand, an unique new work could claim prizes in multiple categories.
    from gdcc notices thread, ms1 said do own work it means we can not combine open source. it means we must to code from scratch. and it is appreciated only 3000 euros.

  30. #58
    Member
    Join Date
    Apr 2015
    Location
    Greece
    Posts
    127
    Thanks
    43
    Thanked 33 Times in 22 Posts
    The problem is that there are too many categories. If there were half as many, the prize would be 6000.

  31. Thanks:

    lz77 (11th August 2020)

  32. #59
    Member lz77's Avatar
    Join Date
    Jan 2016
    Location
    Russia
    Posts
    176
    Thanks
    60
    Thanked 16 Times in 12 Posts
    Quote Originally Posted by Shelwien View Post
    an unique new work could claim prizes in multiple categories.
    An unique LZ77+Huffman can claim price only in Rapid Compression of english text...

  33. #60
    Member
    Join Date
    Apr 2018
    Location
    Indonesia
    Posts
    85
    Thanks
    21
    Thanked 5 Times in 5 Posts
    Quote Originally Posted by algorithm View Post
    The problem is that there are too many categories. If there were half as many, the prize would be 6000.
    The main important thing You must have fresh idea to write it from scratch n improve for several months..

Page 2 of 5 FirstFirst 1234 ... LastLast

Similar Threads

  1. GDC Competition: Notices
    By Ms1 in forum Data Compression
    Replies: 15
    Last Post: 15th December 2020, 19:58
  2. Competition on compression ($)
    By Sshingen in forum Data Compression
    Replies: 8
    Last Post: 10th August 2013, 07:02
  3. Compression Competition -- $15,000 USD
    By Fixee in forum Data Compression
    Replies: 153
    Last Post: 27th March 2013, 16:38

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •