|
Yes, compression based on the (XML) Schema can indeed be much better than pure textual compression. This applies to data XML where the same message, with little or no structural variation, is transmitted many times. In this situation, the XML markup can be 80% of characters in the message. ZIP/GZIP have to compress the element/attribute names. Schema-based methods produce compress/decompressor pairs (on a per-Schema basis) that already know what the element/attribute names are, and so no bandwidth is wasted on encoding them. For many data messages, this makes a huge difference. For document XML the gains would be less, but for data XML it can be very worthwhile. Cheers, Tony. |