| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 1 | :mod:`bz2` --- Support for :program:`bzip2` compression | 
 | 2 | ======================================================= | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 3 |  | 
 | 4 | .. module:: bz2 | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 5 |    :synopsis: Interfaces for bzip2 compression and decompression. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 6 | .. moduleauthor:: Gustavo Niemeyer <niemeyer@conectiva.com> | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 7 | .. moduleauthor:: Nadeem Vawda <nadeem.vawda@gmail.com> | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 8 | .. sectionauthor:: Gustavo Niemeyer <niemeyer@conectiva.com> | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 9 | .. sectionauthor:: Nadeem Vawda <nadeem.vawda@gmail.com> | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 10 |  | 
 | 11 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 12 | This module provides a comprehensive interface for compressing and | 
 | 13 | decompressing data using the bzip2 compression algorithm. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 14 |  | 
| Nadeem Vawda | 3ff069e | 2011-11-30 00:25:06 +0200 | [diff] [blame] | 15 | For related file formats, see the :mod:`gzip`, :mod:`lzma`, :mod:`zipfile`, and | 
| Guido van Rossum | 7767711 | 2007-11-05 19:43:04 +0000 | [diff] [blame] | 16 | :mod:`tarfile` modules. | 
 | 17 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 18 | The :mod:`bz2` module contains: | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 19 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 20 | * The :class:`BZ2File` class for reading and writing compressed files. | 
 | 21 | * The :class:`BZ2Compressor` and :class:`BZ2Decompressor` classes for | 
 | 22 |   incremental (de)compression. | 
 | 23 | * The :func:`compress` and :func:`decompress` functions for one-shot | 
 | 24 |   (de)compression. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 25 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 26 | All of the classes in this module may safely be accessed from multiple threads. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 27 |  | 
 | 28 |  | 
 | 29 | (De)compression of files | 
 | 30 | ------------------------ | 
 | 31 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 32 | .. class:: BZ2File(filename=None, mode='r', buffering=None, compresslevel=9, fileobj=None) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 33 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 34 |    Open a bzip2-compressed file. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 35 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 36 |    The :class:`BZ2File` can wrap an existing :term:`file object` (given by | 
 | 37 |    *fileobj*), or operate directly on a named file (named by *filename*). | 
 | 38 |    Exactly one of these two parameters should be provided. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 39 |  | 
| Nadeem Vawda | 200e00a | 2011-05-27 01:52:16 +0200 | [diff] [blame] | 40 |    The *mode* argument can be either ``'r'`` for reading (default), ``'w'`` for | 
 | 41 |    overwriting, or ``'a'`` for appending. If *fileobj* is provided, a mode of | 
 | 42 |    ``'w'`` does not truncate the file, and is instead equivalent to ``'a'``. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 43 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 44 |    The *buffering* argument is ignored. Its use is deprecated. | 
 | 45 |  | 
| Nadeem Vawda | 200e00a | 2011-05-27 01:52:16 +0200 | [diff] [blame] | 46 |    If *mode* is ``'w'`` or ``'a'``, *compresslevel* can be a number between | 
 | 47 |    ``1`` and ``9`` specifying the level of compression: ``1`` produces the | 
 | 48 |    least compression, and ``9`` (default) produces the most compression. | 
 | 49 |  | 
 | 50 |    If *mode* is ``'r'``, the input file may be the concatenation of multiple | 
 | 51 |    compressed streams. | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 52 |  | 
 | 53 |    :class:`BZ2File` provides all of the members specified by the | 
 | 54 |    :class:`io.BufferedIOBase`, except for :meth:`detach` and :meth:`truncate`. | 
 | 55 |    Iteration and the :keyword:`with` statement are supported. | 
 | 56 |  | 
 | 57 |    :class:`BZ2File` also provides the following method: | 
 | 58 |  | 
 | 59 |    .. method:: peek([n]) | 
 | 60 |  | 
 | 61 |       Return buffered data without advancing the file position. At least one | 
 | 62 |       byte of data will be returned (unless at EOF). The exact number of bytes | 
 | 63 |       returned is unspecified. | 
 | 64 |  | 
 | 65 |       .. versionadded:: 3.3 | 
| Benjamin Peterson | e0124bd | 2009-03-09 21:04:33 +0000 | [diff] [blame] | 66 |  | 
| Benjamin Peterson | 10745a9 | 2009-03-09 21:08:47 +0000 | [diff] [blame] | 67 |    .. versionchanged:: 3.1 | 
| Benjamin Peterson | e0124bd | 2009-03-09 21:04:33 +0000 | [diff] [blame] | 68 |       Support for the :keyword:`with` statement was added. | 
 | 69 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 70 |    .. versionchanged:: 3.3 | 
 | 71 |       The :meth:`fileno`, :meth:`readable`, :meth:`seekable`, :meth:`writable`, | 
 | 72 |       :meth:`read1` and :meth:`readinto` methods were added. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 73 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 74 |    .. versionchanged:: 3.3 | 
 | 75 |       The *fileobj* argument to the constructor was added. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 76 |  | 
| Nadeem Vawda | 200e00a | 2011-05-27 01:52:16 +0200 | [diff] [blame] | 77 |    .. versionchanged:: 3.3 | 
 | 78 |       The ``'a'`` (append) mode was added, along with support for reading | 
 | 79 |       multi-stream files. | 
 | 80 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 81 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 82 | Incremental (de)compression | 
 | 83 | --------------------------- | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 84 |  | 
| Georg Brandl | 0d8f073 | 2009-04-05 22:20:44 +0000 | [diff] [blame] | 85 | .. class:: BZ2Compressor(compresslevel=9) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 86 |  | 
 | 87 |    Create a new compressor object. This object may be used to compress data | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 88 |    incrementally. For one-shot compression, use the :func:`compress` function | 
 | 89 |    instead. | 
 | 90 |  | 
 | 91 |    *compresslevel*, if given, must be a number between ``1`` and ``9``. The | 
 | 92 |    default is ``9``. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 93 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 94 |    .. method:: compress(data) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 95 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 96 |       Provide data to the compressor object. Returns a chunk of compressed data | 
 | 97 |       if possible, or an empty byte string otherwise. | 
 | 98 |  | 
 | 99 |       When you have finished providing data to the compressor, call the | 
 | 100 |       :meth:`flush` method to finish the compression process. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 101 |  | 
 | 102 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 103 |    .. method:: flush() | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 104 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 105 |       Finish the compression process. Returns the compressed data left in | 
 | 106 |       internal buffers. | 
 | 107 |  | 
 | 108 |       The compressor object may not be used after this method has been called. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 109 |  | 
 | 110 |  | 
 | 111 | .. class:: BZ2Decompressor() | 
 | 112 |  | 
 | 113 |    Create a new decompressor object. This object may be used to decompress data | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 114 |    incrementally. For one-shot compression, use the :func:`decompress` function | 
 | 115 |    instead. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 116 |  | 
| Nadeem Vawda | 200e00a | 2011-05-27 01:52:16 +0200 | [diff] [blame] | 117 |    .. note:: | 
 | 118 |       This class does not transparently handle inputs containing multiple | 
 | 119 |       compressed streams, unlike :func:`decompress` and :class:`BZ2File`. If | 
 | 120 |       you need to decompress a multi-stream input with :class:`BZ2Decompressor`, | 
 | 121 |       you must use a new decompressor for each stream. | 
 | 122 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 123 |    .. method:: decompress(data) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 124 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 125 |       Provide data to the decompressor object. Returns a chunk of decompressed | 
 | 126 |       data if possible, or an empty byte string otherwise. | 
 | 127 |  | 
| Nadeem Vawda | 200e00a | 2011-05-27 01:52:16 +0200 | [diff] [blame] | 128 |       Attempting to decompress data after the end of the current stream is | 
 | 129 |       reached raises an :exc:`EOFError`. If any data is found after the end of | 
 | 130 |       the stream, it is ignored and saved in the :attr:`unused_data` attribute. | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 131 |  | 
 | 132 |  | 
 | 133 |    .. attribute:: eof | 
 | 134 |  | 
 | 135 |       True if the end-of-stream marker has been reached. | 
 | 136 |  | 
 | 137 |       .. versionadded:: 3.3 | 
 | 138 |  | 
 | 139 |  | 
 | 140 |    .. attribute:: unused_data | 
 | 141 |  | 
 | 142 |       Data found after the end of the compressed stream. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 143 |  | 
| Nadeem Vawda | 200e00a | 2011-05-27 01:52:16 +0200 | [diff] [blame] | 144 |       If this attribute is accessed before the end of the stream has been | 
 | 145 |       reached, its value will be ``b''``. | 
 | 146 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 147 |  | 
 | 148 | One-shot (de)compression | 
 | 149 | ------------------------ | 
 | 150 |  | 
| Georg Brandl | 0d8f073 | 2009-04-05 22:20:44 +0000 | [diff] [blame] | 151 | .. function:: compress(data, compresslevel=9) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 152 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 153 |    Compress *data*. | 
 | 154 |  | 
 | 155 |    *compresslevel*, if given, must be a number between ``1`` and ``9``. The | 
 | 156 |    default is ``9``. | 
 | 157 |  | 
 | 158 |    For incremental compression, use a :class:`BZ2Compressor` instead. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 159 |  | 
 | 160 |  | 
 | 161 | .. function:: decompress(data) | 
 | 162 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 163 |    Decompress *data*. | 
 | 164 |  | 
| Nadeem Vawda | 200e00a | 2011-05-27 01:52:16 +0200 | [diff] [blame] | 165 |    If *data* is the concatenation of multiple compressed streams, decompress | 
 | 166 |    all of the streams. | 
 | 167 |  | 
| Antoine Pitrou | 37dc5f8 | 2011-04-03 17:05:46 +0200 | [diff] [blame] | 168 |    For incremental decompression, use a :class:`BZ2Decompressor` instead. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 169 |  | 
| Nadeem Vawda | 200e00a | 2011-05-27 01:52:16 +0200 | [diff] [blame] | 170 |    .. versionchanged:: 3.3 | 
 | 171 |       Support for multi-stream inputs was added. | 
 | 172 |  |