Some time ago somebody was talking about a test data generator for testing compression ratios.
I could need such a generator that would create repeated sequences of different length and different distances apart and also some RLE sequences and different probability distributions of the bytes.
I just don't remember the URL or who in here developed it.