http://nishi.dreamhosters.com/u/bsc240_qlfc_v0.rar
Looks like I've been too optimistic about it.
1. Actually there're 2-3 rc calls per byte, <1 is only possible with something highly redundant
2. There's no huge speed difference with (much simpler) direct CMs
Code:
book1bwt:
l = 768771 bytes = 6150168 bits, rc_count=2152534
enwik8bwt:
l = 100000004 bytes = 800000032 bits, rc_count=214320941
book1bwt enwik8bwt enctime dectime
216816 21334589 6.344s 6.688s // ic110_no_PGO, mixtest_v2, static linear mix
216098 21204128 6.797s 7.156s // ic110_no_PGO, mixtest_v2, adaptive linear mix
214695 21065497 9.954s 10.609s // ic110_no_PGO, mixtest_v3, adaptive logistic mix
216838 21337208 6.015s 6.797s // ic111, o01_v0, static linear mix
212088 20787423 5.813s 6.468s // ic110, bsc 2.4.0 "slow" qlfc
5.703s 6.500s // ic111
8.094s 8.000s // ic111_no_PGO
6.891s 8.297s // gcc450