The ACM Journal of Experimental Algorithmics |
We focus on the application of FP in the context of the LZW variant of the Lempel-Ziv dictionary construction method, which is of considerable practical interest. We implement two compression algorithms which use (i) FP with LZW dictionary (LZW-FP) and (ii) FP with an alternative flexible dictionary (FPA). Our implementations are based on novel online data structures enabling us to use linear time and space. We test our implementations on a collection of input sequences which include textual files, DNA sequences, medical images, and speudorandom binary files, and compare our results with compress and gzip. Our results demonstrate that flexible parsing is especially useful for non-textual data, on which it improves over the compression rates of compress and gzip by up to 20% and 35%, respectively.
Keywords: Experimental Algorithms; parsing; data compression.
Sources: