What do state-of-the-art compressors do to compress x86 binaries? (Besides relative/absolute address conversion for a couple of opcodes.)
Does anybody have a fast table-driven stream-splitter for x86 code lying around? I found the filter in kkrunchy, but it's got all sorts of nested conditionals in it.
Seems like a 64K-element table would work to do aligned byte pairs fast, if you rig the table to handle byte pairs that my have have one two-byte opcode or two one-byte opcodes, or a one-byte opcode and a byte of operand. (But maybe not worth it cache-locality-wise, depending on how often you're hitting weird stuff in the table.)