Most I guess?
Ratio-wise its not so simple, as BWT has more problems with local adaptation,
also PPM results on binary files can be improved with sparse models etc,
while its not really possible for BWT.
But sparse models are only available in ppmonstr, while ppmd is pretty similar to plain BWT.
Code:
21,388,296 33.473s 34.148s // ppmd -o10 -m256 -r1
22,325,416 19.387s 19.849s // ppmd -o10 -m256 -r0
27,343,261 12.084s 13.082s // ppms -o8
21,688,998 12.937s 8.294s // bsc -b51 -m0 -cp -e2 -p -t -T (256MB memory usage)
21,835,062 11.412s 6.656s // bsc -b51 -m0 -cp -e1 -p -t -T (faster entropy model)
21,820,628 11.600s 6.734s // bsc -b51 -m0 -cp -e1 -t -T (preproc enabled)
21,323,232 14.400s 15.763s // bcm 2.00a3 c51r
Maniscalco's and Lucas' works are probably better than old bsc,
but would likely require more preparation work for fair comparison.