Results 1 to 22 of 22

Thread: DNA Corpus

Threaded View

Previous Post Previous Post   Next Post Next Post
  1. #1
    Join Date
    Jan 2014
    Bothell, Washington, USA
    Thanked 186 Times in 109 Posts

    DNA Corpus

    This DNA corpus is referenced in several grammar compression papers, but is hard to find. Even then, each file needs to be decompressed (ever so slightly) with gzip and then run through dnau, for which I could only find source code. It consists of the following 11 files: chmpxx, chntxx, hehcmv, humdyst, humghcs, humhbb, humhdab, humprtb, mpomtcg, mtpacga, and vaccg. Here it is in a single .zip file.
    Attached Files Attached Files

  2. Thanks (6):

    byronknoll (28th March 2019),comp1 (6th January 2015),encode (8th April 2019),Gotty (1st February 2021),Paul W. (6th January 2015),Sabrina (23rd June 2018)

Similar Threads

  1. Encode's Compression Corpus (EncCC)
    By encode in forum Download Area
    Replies: 5
    Last Post: 21st December 2017, 13:43
  2. Silesia compression corpus
    By encode in forum Data Compression
    Replies: 29
    Last Post: 8th June 2012, 11:53
  3. Repstsb Corpus
    By Mihai Cartoaje in forum Download Area
    Replies: 0
    Last Post: 14th March 2009, 07:25
  4. Canterbury Corpus
    By LovePimple in forum Download Area
    Replies: 0
    Last Post: 1st August 2008, 00:35
  5. Calgary Corpus
    By LovePimple in forum Download Area
    Replies: 0
    Last Post: 31st July 2008, 22:55

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts