Fred Drake | 295da24 | 1998-08-10 19:42:37 +0000 | [diff] [blame] | 1 | \section{\module{zlib} --- |
Fred Drake | b11d108 | 1999-04-21 18:44:41 +0000 | [diff] [blame] | 2 | Compression compatible with \program{gzip}} |
Fred Drake | b91e934 | 1998-07-23 17:59:49 +0000 | [diff] [blame] | 3 | |
Fred Drake | bbac432 | 1999-02-20 00:14:17 +0000 | [diff] [blame] | 4 | \declaremodule{builtin}{zlib} |
Fred Drake | 08caa96 | 1998-07-27 22:08:49 +0000 | [diff] [blame] | 5 | \modulesynopsis{Low-level interface to compression and decompression |
Fred Drake | b11d108 | 1999-04-21 18:44:41 +0000 | [diff] [blame] | 6 | routines compatible with \program{gzip}.} |
Fred Drake | b91e934 | 1998-07-23 17:59:49 +0000 | [diff] [blame] | 7 | |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 8 | |
| 9 | For applications that require data compression, the functions in this |
Fred Drake | 8a254b5 | 1998-04-09 15:41:44 +0000 | [diff] [blame] | 10 | module allow compression and decompression, using the zlib library. |
| 11 | The zlib library has its own home page at |
Fred Drake | b037d33 | 2001-06-25 15:30:13 +0000 | [diff] [blame] | 12 | \url{http://www.gzip.org/zlib/}. Version 1.1.3 is the |
Fred Drake | 315b9e0 | 2000-09-16 06:18:26 +0000 | [diff] [blame] | 13 | most recent version as of September 2000; use a later version if one |
| 14 | is available. There are known incompatibilities between the Python |
| 15 | module and earlier versions of the zlib library. |
Jeremy Hylton | 45b0aed | 1999-04-05 21:55:21 +0000 | [diff] [blame] | 16 | |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 17 | The available exception and functions in this module are: |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 18 | |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 19 | \begin{excdesc}{error} |
| 20 | Exception raised on compression and decompression errors. |
| 21 | \end{excdesc} |
| 22 | |
| 23 | |
Fred Drake | cce1090 | 1998-03-17 06:33:25 +0000 | [diff] [blame] | 24 | \begin{funcdesc}{adler32}{string\optional{, value}} |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 25 | Computes a Adler-32 checksum of \var{string}. (An Adler-32 |
| 26 | checksum is almost as reliable as a CRC32 but can be computed much |
| 27 | more quickly.) If \var{value} is present, it is used as the |
| 28 | starting value of the checksum; otherwise, a fixed default value is |
| 29 | used. This allows computing a running checksum over the |
| 30 | concatenation of several input strings. The algorithm is not |
| 31 | cryptographically strong, and should not be used for |
| 32 | authentication or digital signatures. |
| 33 | \end{funcdesc} |
| 34 | |
Fred Drake | cce1090 | 1998-03-17 06:33:25 +0000 | [diff] [blame] | 35 | \begin{funcdesc}{compress}{string\optional{, level}} |
Fred Drake | 5916070 | 1998-06-19 21:18:28 +0000 | [diff] [blame] | 36 | Compresses the data in \var{string}, returning a string contained |
| 37 | compressed data. \var{level} is an integer from \code{1} to |
| 38 | \code{9} controlling the level of compression; \code{1} is fastest |
| 39 | and produces the least compression, \code{9} is slowest and produces |
| 40 | the most. The default value is \code{6}. Raises the |
| 41 | \exception{error} exception if any error occurs. |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 42 | \end{funcdesc} |
| 43 | |
| 44 | \begin{funcdesc}{compressobj}{\optional{level}} |
Fred Drake | ed79783 | 1998-01-22 16:11:18 +0000 | [diff] [blame] | 45 | Returns a compression object, to be used for compressing data streams |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 46 | that won't fit into memory at once. \var{level} is an integer from |
Fred Drake | ed79783 | 1998-01-22 16:11:18 +0000 | [diff] [blame] | 47 | \code{1} to \code{9} controlling the level of compression; \code{1} is |
| 48 | fastest and produces the least compression, \code{9} is slowest and |
| 49 | produces the most. The default value is \code{6}. |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 50 | \end{funcdesc} |
| 51 | |
Fred Drake | cce1090 | 1998-03-17 06:33:25 +0000 | [diff] [blame] | 52 | \begin{funcdesc}{crc32}{string\optional{, value}} |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 53 | Computes a CRC (Cyclic Redundancy Check)% |
| 54 | \index{Cyclic Redundancy Check} |
Fred Drake | b208f12 | 1998-04-04 06:28:54 +0000 | [diff] [blame] | 55 | \index{checksum!Cyclic Redundancy Check} |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 56 | checksum of \var{string}. If |
| 57 | \var{value} is present, it is used as the starting value of the |
| 58 | checksum; otherwise, a fixed default value is used. This allows |
| 59 | computing a running checksum over the concatenation of several |
| 60 | input strings. The algorithm is not cryptographically strong, and |
| 61 | should not be used for authentication or digital signatures. |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 62 | \end{funcdesc} |
| 63 | |
Fred Drake | 38e5d27 | 2000-04-03 20:13:55 +0000 | [diff] [blame] | 64 | \begin{funcdesc}{decompress}{string\optional{, wbits\optional{, bufsize}}} |
Fred Drake | 5916070 | 1998-06-19 21:18:28 +0000 | [diff] [blame] | 65 | Decompresses the data in \var{string}, returning a string containing |
| 66 | the uncompressed data. The \var{wbits} parameter controls the size of |
Fred Drake | 38e5d27 | 2000-04-03 20:13:55 +0000 | [diff] [blame] | 67 | the window buffer. If \var{bufsize} is given, it is used as the |
Fred Drake | 5916070 | 1998-06-19 21:18:28 +0000 | [diff] [blame] | 68 | initial size of the output buffer. Raises the \exception{error} |
| 69 | exception if any error occurs. |
Fred Drake | 38e5d27 | 2000-04-03 20:13:55 +0000 | [diff] [blame] | 70 | |
| 71 | The absolute value of \var{wbits} is the base two logarithm of the |
| 72 | size of the history buffer (the ``window size'') used when compressing |
| 73 | data. Its absolute value should be between 8 and 15 for the most |
| 74 | recent versions of the zlib library, larger values resulting in better |
| 75 | compression at the expense of greater memory usage. The default value |
| 76 | is 15. When \var{wbits} is negative, the standard |
| 77 | \program{gzip} header is suppressed; this is an undocumented feature |
| 78 | of the zlib library, used for compatibility with \program{unzip}'s |
| 79 | compression file format. |
| 80 | |
| 81 | \var{bufsize} is the initial size of the buffer used to hold |
| 82 | decompressed data. If more space is required, the buffer size will be |
| 83 | increased as needed, so you don't have to get this value exactly |
| 84 | right; tuning it will only save a few calls to \cfunction{malloc()}. The |
| 85 | default size is 16384. |
| 86 | |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 87 | \end{funcdesc} |
| 88 | |
| 89 | \begin{funcdesc}{decompressobj}{\optional{wbits}} |
Fred Drake | bc524c4 | 2001-04-18 20:16:51 +0000 | [diff] [blame] | 90 | Returns a decompression object, to be used for decompressing data |
Fred Drake | 5916070 | 1998-06-19 21:18:28 +0000 | [diff] [blame] | 91 | streams that won't fit into memory at once. The \var{wbits} |
| 92 | parameter controls the size of the window buffer. |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 93 | \end{funcdesc} |
| 94 | |
| 95 | Compression objects support the following methods: |
| 96 | |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 97 | \begin{methoddesc}[Compress]{compress}{string} |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 98 | Compress \var{string}, returning a string containing compressed data |
| 99 | for at least part of the data in \var{string}. This data should be |
| 100 | concatenated to the output produced by any preceding calls to the |
Fred Drake | ed79783 | 1998-01-22 16:11:18 +0000 | [diff] [blame] | 101 | \method{compress()} method. Some input may be kept in internal buffers |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 102 | for later processing. |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 103 | \end{methoddesc} |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 104 | |
Andrew M. Kuchling | f07c328 | 1998-12-31 21:14:23 +0000 | [diff] [blame] | 105 | \begin{methoddesc}[Compress]{flush}{\optional{mode}} |
| 106 | All pending input is processed, and a string containing the remaining |
| 107 | compressed output is returned. \var{mode} can be selected from the |
| 108 | constants \constant{Z_SYNC_FLUSH}, \constant{Z_FULL_FLUSH}, or |
| 109 | \constant{Z_FINISH}, defaulting to \constant{Z_FINISH}. \constant{Z_SYNC_FLUSH} and |
| 110 | \constant{Z_FULL_FLUSH} allow compressing further strings of data and |
| 111 | are used to allow partial error recovery on decompression, while |
| 112 | \constant{Z_FINISH} finishes the compressed stream and |
| 113 | prevents compressing any more data. After calling |
| 114 | \method{flush()} with \var{mode} set to \constant{Z_FINISH}, the |
Fred Drake | ed79783 | 1998-01-22 16:11:18 +0000 | [diff] [blame] | 115 | \method{compress()} method cannot be called again; the only realistic |
Andrew M. Kuchling | f07c328 | 1998-12-31 21:14:23 +0000 | [diff] [blame] | 116 | action is to delete the object. |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 117 | \end{methoddesc} |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 118 | |
Fred Drake | 38e5d27 | 2000-04-03 20:13:55 +0000 | [diff] [blame] | 119 | Decompression objects support the following methods, and a single attribute: |
| 120 | |
| 121 | \begin{memberdesc}{unused_data} |
| 122 | A string which contains any unused data from the last string fed to |
| 123 | this decompression object. If the whole string turned out to contain |
| 124 | compressed data, this is \code{""}, the empty string. |
| 125 | |
| 126 | The only way to determine where a string of compressed data ends is by |
| 127 | actually decompressing it. This means that when compressed data is |
| 128 | contained part of a larger file, you can only find the end of it by |
| 129 | reading data and feeding it into a decompression object's |
| 130 | \method{decompress} method until the \member{unused_data} attribute is |
| 131 | no longer the empty string. |
| 132 | \end{memberdesc} |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 133 | |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 134 | \begin{methoddesc}[Decompress]{decompress}{string} |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 135 | Decompress \var{string}, returning a string containing the |
| 136 | uncompressed data corresponding to at least part of the data in |
| 137 | \var{string}. This data should be concatenated to the output produced |
| 138 | by any preceding calls to the |
Fred Drake | ed79783 | 1998-01-22 16:11:18 +0000 | [diff] [blame] | 139 | \method{decompress()} method. Some of the input data may be preserved |
Guido van Rossum | 412154f | 1997-04-30 19:39:21 +0000 | [diff] [blame] | 140 | in internal buffers for later processing. |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 141 | \end{methoddesc} |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 142 | |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 143 | \begin{methoddesc}[Decompress]{flush}{} |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 144 | All pending input is processed, and a string containing the remaining |
Fred Drake | ed79783 | 1998-01-22 16:11:18 +0000 | [diff] [blame] | 145 | uncompressed output is returned. After calling \method{flush()}, the |
| 146 | \method{decompress()} method cannot be called again; the only realistic |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 147 | action is to delete the object. |
Fred Drake | 74810d5 | 1998-04-03 06:49:26 +0000 | [diff] [blame] | 148 | \end{methoddesc} |
Guido van Rossum | 04bc9d6 | 1997-04-30 18:12:27 +0000 | [diff] [blame] | 149 | |
Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame] | 150 | \begin{seealso} |
Fred Drake | ba0a989 | 2000-10-18 17:43:06 +0000 | [diff] [blame] | 151 | \seemodule{gzip}{Reading and writing \program{gzip}-format files.} |
Fred Drake | b037d33 | 2001-06-25 15:30:13 +0000 | [diff] [blame] | 152 | \seeurl{http://www.gzip.org/zlib/}{The zlib library home page.} |
Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame] | 153 | \end{seealso} |