When people write compression code. They general have several stages with often the last stage being an entropy compressor. The question is what is the best approach
for this last stage. Do you general assume a nonstationary data stream at this point so that you don't use a pure entropy coder so that you make up for the mistakes in models leading to this last stage by tuning it to various files. Or do you make it as pure as possible and tune the preceding stages to give a more stationary stream of data to this final state of entropy compression. Or does one just take a set of files and try to tune the whole set of passes to make it work well on some set of data.
I am just curious what other people think. I feel most know my thoughts in this area so please feel free to discuss your own. Or do most people think the same. Also do you like to work in binary for last stage or some larger set?