Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 1 | :mod:`zipfile` --- Work with ZIP archives |
| 2 | ========================================= |
| 3 | |
| 4 | .. module:: zipfile |
| 5 | :synopsis: Read and write ZIP-format archive files. |
| 6 | .. moduleauthor:: James C. Ahlstrom <jim@interet.com> |
| 7 | .. sectionauthor:: James C. Ahlstrom <jim@interet.com> |
| 8 | |
Raymond Hettinger | 469271d | 2011-01-27 20:38:46 +0000 | [diff] [blame] | 9 | **Source code:** :source:`Lib/zipfile.py` |
| 10 | |
| 11 | -------------- |
| 12 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 13 | The ZIP file format is a common archive and compression standard. This module |
| 14 | provides tools to create, read, write, append, and list a ZIP file. Any |
| 15 | advanced use of this module will require an understanding of the format, as |
| 16 | defined in `PKZIP Application Note |
Christian Heimes | dd15f6c | 2008-03-16 00:07:10 +0000 | [diff] [blame] | 17 | <http://www.pkware.com/documents/casestudies/APPNOTE.TXT>`_. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 18 | |
Georg Brandl | 98be996 | 2010-08-02 20:52:10 +0000 | [diff] [blame] | 19 | This module does not currently handle multi-disk ZIP files. |
| 20 | It can handle ZIP files that use the ZIP64 extensions |
Guido van Rossum | 7767711 | 2007-11-05 19:43:04 +0000 | [diff] [blame] | 21 | (that is ZIP files that are more than 4 GByte in size). It supports |
| 22 | decryption of encrypted files in ZIP archives, but it currently cannot |
Christian Heimes | fdab48e | 2008-01-20 09:06:41 +0000 | [diff] [blame] | 23 | create an encrypted file. Decryption is extremely slow as it is |
Benjamin Peterson | 2021100 | 2009-11-25 18:34:42 +0000 | [diff] [blame] | 24 | implemented in native Python rather than C. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 25 | |
Guido van Rossum | 7767711 | 2007-11-05 19:43:04 +0000 | [diff] [blame] | 26 | For other archive formats, see the :mod:`bz2`, :mod:`gzip`, and |
| 27 | :mod:`tarfile` modules. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 28 | |
Guido van Rossum | 7767711 | 2007-11-05 19:43:04 +0000 | [diff] [blame] | 29 | The module defines the following items: |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 30 | |
Georg Brandl | 4d54088 | 2010-10-28 06:42:33 +0000 | [diff] [blame] | 31 | .. exception:: BadZipFile |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 32 | |
| 33 | The error raised for bad ZIP files (old name: ``zipfile.error``). |
| 34 | |
Georg Brandl | 4d54088 | 2010-10-28 06:42:33 +0000 | [diff] [blame] | 35 | .. versionadded:: 3.2 |
| 36 | |
| 37 | |
| 38 | .. exception:: BadZipfile |
| 39 | |
| 40 | This is an alias for :exc:`BadZipFile` that exists for compatibility with |
| 41 | Python versions prior to 3.2. Usage is deprecated. |
| 42 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 43 | |
| 44 | .. exception:: LargeZipFile |
| 45 | |
| 46 | The error raised when a ZIP file would require ZIP64 functionality but that has |
| 47 | not been enabled. |
| 48 | |
| 49 | |
| 50 | .. class:: ZipFile |
Georg Brandl | 5e92a50 | 2010-11-12 06:20:12 +0000 | [diff] [blame] | 51 | :noindex: |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 52 | |
| 53 | The class for reading and writing ZIP files. See section |
| 54 | :ref:`zipfile-objects` for constructor details. |
| 55 | |
| 56 | |
| 57 | .. class:: PyZipFile |
Georg Brandl | 8334fd9 | 2010-12-04 10:26:46 +0000 | [diff] [blame] | 58 | :noindex: |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 59 | |
| 60 | Class for creating ZIP archives containing Python libraries. |
| 61 | |
| 62 | |
Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 63 | .. class:: ZipInfo(filename='NoName', date_time=(1980,1,1,0,0,0)) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 64 | |
| 65 | Class used to represent information about a member of an archive. Instances |
| 66 | of this class are returned by the :meth:`getinfo` and :meth:`infolist` |
| 67 | methods of :class:`ZipFile` objects. Most users of the :mod:`zipfile` module |
| 68 | will not need to create these, but only use those created by this |
| 69 | module. *filename* should be the full name of the archive member, and |
| 70 | *date_time* should be a tuple containing six fields which describe the time |
| 71 | of the last modification to the file; the fields are described in section |
| 72 | :ref:`zipinfo-objects`. |
| 73 | |
| 74 | |
| 75 | .. function:: is_zipfile(filename) |
| 76 | |
| 77 | Returns ``True`` if *filename* is a valid ZIP file based on its magic number, |
Antoine Pitrou | db5fe66 | 2008-12-27 15:50:40 +0000 | [diff] [blame] | 78 | otherwise returns ``False``. *filename* may be a file or file-like object too. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 79 | |
Georg Brandl | 277a150 | 2009-01-04 00:28:14 +0000 | [diff] [blame] | 80 | .. versionchanged:: 3.1 |
| 81 | Support for file and file-like objects. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 82 | |
Georg Brandl | 67b21b7 | 2010-08-17 15:07:14 +0000 | [diff] [blame] | 83 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 84 | .. data:: ZIP_STORED |
| 85 | |
| 86 | The numeric constant for an uncompressed archive member. |
| 87 | |
| 88 | |
| 89 | .. data:: ZIP_DEFLATED |
| 90 | |
| 91 | The numeric constant for the usual ZIP compression method. This requires the |
| 92 | zlib module. No other compression methods are currently supported. |
| 93 | |
| 94 | |
| 95 | .. seealso:: |
| 96 | |
Christian Heimes | dd15f6c | 2008-03-16 00:07:10 +0000 | [diff] [blame] | 97 | `PKZIP Application Note <http://www.pkware.com/documents/casestudies/APPNOTE.TXT>`_ |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 98 | Documentation on the ZIP file format by Phil Katz, the creator of the format and |
| 99 | algorithms used. |
| 100 | |
| 101 | `Info-ZIP Home Page <http://www.info-zip.org/>`_ |
| 102 | Information about the Info-ZIP project's ZIP archive programs and development |
| 103 | libraries. |
| 104 | |
| 105 | |
| 106 | .. _zipfile-objects: |
| 107 | |
| 108 | ZipFile Objects |
| 109 | --------------- |
| 110 | |
| 111 | |
Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 112 | .. class:: ZipFile(file, mode='r', compression=ZIP_STORED, allowZip64=False) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 113 | |
| 114 | Open a ZIP file, where *file* can be either a path to a file (a string) or a |
| 115 | file-like object. The *mode* parameter should be ``'r'`` to read an existing |
| 116 | file, ``'w'`` to truncate and write a new file, or ``'a'`` to append to an |
Ezio Melotti | faa6b7f | 2009-12-30 12:34:59 +0000 | [diff] [blame] | 117 | existing file. If *mode* is ``'a'`` and *file* refers to an existing ZIP |
| 118 | file, then additional files are added to it. If *file* does not refer to a |
| 119 | ZIP file, then a new ZIP archive is appended to the file. This is meant for |
| 120 | adding a ZIP archive to another file (such as :file:`python.exe`). If |
| 121 | *mode* is ``a`` and the file does not exist at all, it is created. |
| 122 | *compression* is the ZIP compression method to use when writing the archive, |
| 123 | and should be :const:`ZIP_STORED` or :const:`ZIP_DEFLATED`; unrecognized |
| 124 | values will cause :exc:`RuntimeError` to be raised. If :const:`ZIP_DEFLATED` |
| 125 | is specified but the :mod:`zlib` module is not available, :exc:`RuntimeError` |
| 126 | is also raised. The default is :const:`ZIP_STORED`. If *allowZip64* is |
| 127 | ``True`` zipfile will create ZIP files that use the ZIP64 extensions when |
| 128 | the zipfile is larger than 2 GB. If it is false (the default) :mod:`zipfile` |
| 129 | will raise an exception when the ZIP file would require ZIP64 extensions. |
| 130 | ZIP64 extensions are disabled by default because the default :program:`zip` |
| 131 | and :program:`unzip` commands on Unix (the InfoZIP utilities) don't support |
| 132 | these extensions. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 133 | |
Georg Brandl | 268e4d4 | 2010-10-14 06:59:45 +0000 | [diff] [blame] | 134 | If the file is created with mode ``'a'`` or ``'w'`` and then |
| 135 | :meth:`close`\ d without adding any files to the archive, the appropriate |
| 136 | ZIP structures for an empty archive will be written to the file. |
| 137 | |
Ezio Melotti | faa6b7f | 2009-12-30 12:34:59 +0000 | [diff] [blame] | 138 | ZipFile is also a context manager and therefore supports the |
| 139 | :keyword:`with` statement. In the example, *myzip* is closed after the |
| 140 | :keyword:`with` statement's suite is finished---even if an exception occurs:: |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 141 | |
Ezio Melotti | faa6b7f | 2009-12-30 12:34:59 +0000 | [diff] [blame] | 142 | with ZipFile('spam.zip', 'w') as myzip: |
| 143 | myzip.write('eggs.txt') |
| 144 | |
| 145 | .. versionadded:: 3.2 |
| 146 | Added the ability to use :class:`ZipFile` as a context manager. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 147 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 148 | |
| 149 | .. method:: ZipFile.close() |
| 150 | |
| 151 | Close the archive file. You must call :meth:`close` before exiting your program |
| 152 | or essential records will not be written. |
| 153 | |
| 154 | |
| 155 | .. method:: ZipFile.getinfo(name) |
| 156 | |
| 157 | Return a :class:`ZipInfo` object with information about the archive member |
| 158 | *name*. Calling :meth:`getinfo` for a name not currently contained in the |
| 159 | archive will raise a :exc:`KeyError`. |
| 160 | |
| 161 | |
| 162 | .. method:: ZipFile.infolist() |
| 163 | |
| 164 | Return a list containing a :class:`ZipInfo` object for each member of the |
| 165 | archive. The objects are in the same order as their entries in the actual ZIP |
| 166 | file on disk if an existing archive was opened. |
| 167 | |
| 168 | |
| 169 | .. method:: ZipFile.namelist() |
| 170 | |
| 171 | Return a list of archive members by name. |
| 172 | |
| 173 | |
Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 174 | .. method:: ZipFile.open(name, mode='r', pwd=None) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 175 | |
| 176 | Extract a member from the archive as a file-like object (ZipExtFile). *name* is |
Georg Brandl | b533e26 | 2008-05-25 18:19:30 +0000 | [diff] [blame] | 177 | the name of the file in the archive, or a :class:`ZipInfo` object. The *mode* |
| 178 | parameter, if included, must be one of the following: ``'r'`` (the default), |
| 179 | ``'U'``, or ``'rU'``. Choosing ``'U'`` or ``'rU'`` will enable universal newline |
| 180 | support in the read-only object. *pwd* is the password used for encrypted files. |
| 181 | Calling :meth:`open` on a closed ZipFile will raise a :exc:`RuntimeError`. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 182 | |
| 183 | .. note:: |
| 184 | |
| 185 | The file-like object is read-only and provides the following methods: |
Georg Brandl | 8f358aa | 2010-11-30 07:43:28 +0000 | [diff] [blame] | 186 | :meth:`!read`, :meth:`!readline`, :meth:`!readlines`, :meth:`!__iter__`, |
| 187 | :meth:`!__next__`. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 188 | |
| 189 | .. note:: |
| 190 | |
| 191 | If the ZipFile was created by passing in a file-like object as the first |
Guido van Rossum | da27fd2 | 2007-08-17 00:24:54 +0000 | [diff] [blame] | 192 | argument to the constructor, then the object returned by :meth:`.open` shares the |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 193 | ZipFile's file pointer. Under these circumstances, the object returned by |
Guido van Rossum | da27fd2 | 2007-08-17 00:24:54 +0000 | [diff] [blame] | 194 | :meth:`.open` should not be used after any additional operations are performed |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 195 | on the ZipFile object. If the ZipFile was created by passing in a string (the |
Guido van Rossum | da27fd2 | 2007-08-17 00:24:54 +0000 | [diff] [blame] | 196 | filename) as the first argument to the constructor, then :meth:`.open` will |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 197 | create a new file object that will be held by the ZipExtFile, allowing it to |
| 198 | operate independently of the ZipFile. |
| 199 | |
Georg Brandl | b533e26 | 2008-05-25 18:19:30 +0000 | [diff] [blame] | 200 | .. note:: |
| 201 | |
| 202 | The :meth:`open`, :meth:`read` and :meth:`extract` methods can take a filename |
| 203 | or a :class:`ZipInfo` object. You will appreciate this when trying to read a |
| 204 | ZIP file that contains members with duplicate names. |
| 205 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 206 | |
Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 207 | .. method:: ZipFile.extract(member, path=None, pwd=None) |
Christian Heimes | 790c823 | 2008-01-07 21:14:23 +0000 | [diff] [blame] | 208 | |
Georg Brandl | b533e26 | 2008-05-25 18:19:30 +0000 | [diff] [blame] | 209 | Extract a member from the archive to the current working directory; *member* |
| 210 | must be its full name or a :class:`ZipInfo` object). Its file information is |
| 211 | extracted as accurately as possible. *path* specifies a different directory |
| 212 | to extract to. *member* can be a filename or a :class:`ZipInfo` object. |
| 213 | *pwd* is the password used for encrypted files. |
Christian Heimes | 790c823 | 2008-01-07 21:14:23 +0000 | [diff] [blame] | 214 | |
Christian Heimes | 790c823 | 2008-01-07 21:14:23 +0000 | [diff] [blame] | 215 | |
Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 216 | .. method:: ZipFile.extractall(path=None, members=None, pwd=None) |
Christian Heimes | 790c823 | 2008-01-07 21:14:23 +0000 | [diff] [blame] | 217 | |
Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 218 | Extract all members from the archive to the current working directory. *path* |
Christian Heimes | 790c823 | 2008-01-07 21:14:23 +0000 | [diff] [blame] | 219 | specifies a different directory to extract to. *members* is optional and must |
| 220 | be a subset of the list returned by :meth:`namelist`. *pwd* is the password |
| 221 | used for encrypted files. |
| 222 | |
Benjamin Peterson | a0dfa82 | 2009-11-13 02:25:08 +0000 | [diff] [blame] | 223 | .. warning:: |
| 224 | |
| 225 | Never extract archives from untrusted sources without prior inspection. |
| 226 | It is possible that files are created outside of *path*, e.g. members |
| 227 | that have absolute filenames starting with ``"/"`` or filenames with two |
| 228 | dots ``".."``. |
| 229 | |
Christian Heimes | 790c823 | 2008-01-07 21:14:23 +0000 | [diff] [blame] | 230 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 231 | .. method:: ZipFile.printdir() |
| 232 | |
| 233 | Print a table of contents for the archive to ``sys.stdout``. |
| 234 | |
| 235 | |
| 236 | .. method:: ZipFile.setpassword(pwd) |
| 237 | |
| 238 | Set *pwd* as default password to extract encrypted files. |
| 239 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 240 | |
Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 241 | .. method:: ZipFile.read(name, pwd=None) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 242 | |
Georg Brandl | b533e26 | 2008-05-25 18:19:30 +0000 | [diff] [blame] | 243 | Return the bytes of the file *name* in the archive. *name* is the name of the |
| 244 | file in the archive, or a :class:`ZipInfo` object. The archive must be open for |
| 245 | read or append. *pwd* is the password used for encrypted files and, if specified, |
| 246 | it will override the default password set with :meth:`setpassword`. Calling |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 247 | :meth:`read` on a closed ZipFile will raise a :exc:`RuntimeError`. |
| 248 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 249 | |
| 250 | .. method:: ZipFile.testzip() |
| 251 | |
| 252 | Read all the files in the archive and check their CRC's and file headers. |
| 253 | Return the name of the first bad file, or else return ``None``. Calling |
| 254 | :meth:`testzip` on a closed ZipFile will raise a :exc:`RuntimeError`. |
| 255 | |
| 256 | |
Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 257 | .. method:: ZipFile.write(filename, arcname=None, compress_type=None) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 258 | |
| 259 | Write the file named *filename* to the archive, giving it the archive name |
| 260 | *arcname* (by default, this will be the same as *filename*, but without a drive |
| 261 | letter and with leading path separators removed). If given, *compress_type* |
| 262 | overrides the value given for the *compression* parameter to the constructor for |
| 263 | the new entry. The archive must be open with mode ``'w'`` or ``'a'`` -- calling |
| 264 | :meth:`write` on a ZipFile created with mode ``'r'`` will raise a |
| 265 | :exc:`RuntimeError`. Calling :meth:`write` on a closed ZipFile will raise a |
| 266 | :exc:`RuntimeError`. |
| 267 | |
| 268 | .. note:: |
| 269 | |
| 270 | There is no official file name encoding for ZIP files. If you have unicode file |
Thomas Wouters | 47b49bf | 2007-08-30 22:15:33 +0000 | [diff] [blame] | 271 | names, you must convert them to byte strings in your desired encoding before |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 272 | passing them to :meth:`write`. WinZip interprets all file names as encoded in |
| 273 | CP437, also known as DOS Latin. |
| 274 | |
| 275 | .. note:: |
| 276 | |
| 277 | Archive names should be relative to the archive root, that is, they should not |
| 278 | start with a path separator. |
| 279 | |
| 280 | .. note:: |
| 281 | |
| 282 | If ``arcname`` (or ``filename``, if ``arcname`` is not given) contains a null |
| 283 | byte, the name of the file in the archive will be truncated at the null byte. |
| 284 | |
| 285 | |
Ronald Oussoren | ee5c885 | 2010-02-07 20:24:02 +0000 | [diff] [blame] | 286 | .. method:: ZipFile.writestr(zinfo_or_arcname, bytes[, compress_type]) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 287 | |
| 288 | Write the string *bytes* to the archive; *zinfo_or_arcname* is either the file |
| 289 | name it will be given in the archive, or a :class:`ZipInfo` instance. If it's |
| 290 | an instance, at least the filename, date, and time must be given. If it's a |
| 291 | name, the date and time is set to the current date and time. The archive must be |
| 292 | opened with mode ``'w'`` or ``'a'`` -- calling :meth:`writestr` on a ZipFile |
| 293 | created with mode ``'r'`` will raise a :exc:`RuntimeError`. Calling |
| 294 | :meth:`writestr` on a closed ZipFile will raise a :exc:`RuntimeError`. |
| 295 | |
Ronald Oussoren | ee5c885 | 2010-02-07 20:24:02 +0000 | [diff] [blame] | 296 | If given, *compress_type* overrides the value given for the *compression* |
| 297 | parameter to the constructor for the new entry, or in the *zinfo_or_arcname* |
| 298 | (if that is a :class:`ZipInfo` instance). |
| 299 | |
Christian Heimes | 790c823 | 2008-01-07 21:14:23 +0000 | [diff] [blame] | 300 | .. note:: |
| 301 | |
Éric Araujo | 0d4bcf4 | 2010-12-26 17:53:27 +0000 | [diff] [blame] | 302 | When passing a :class:`ZipInfo` instance as the *zinfo_or_arcname* parameter, |
Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 303 | the compression method used will be that specified in the *compress_type* |
| 304 | member of the given :class:`ZipInfo` instance. By default, the |
Christian Heimes | 790c823 | 2008-01-07 21:14:23 +0000 | [diff] [blame] | 305 | :class:`ZipInfo` constructor sets this member to :const:`ZIP_STORED`. |
| 306 | |
Ezio Melotti | f8754a6 | 2010-03-21 07:16:43 +0000 | [diff] [blame] | 307 | .. versionchanged:: 3.2 |
Ronald Oussoren | ee5c885 | 2010-02-07 20:24:02 +0000 | [diff] [blame] | 308 | The *compression_type* argument. |
| 309 | |
Martin v. Löwis | b09b844 | 2008-07-03 14:13:42 +0000 | [diff] [blame] | 310 | The following data attributes are also available: |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 311 | |
| 312 | |
| 313 | .. attribute:: ZipFile.debug |
| 314 | |
| 315 | The level of debug output to use. This may be set from ``0`` (the default, no |
| 316 | output) to ``3`` (the most output). Debugging information is written to |
| 317 | ``sys.stdout``. |
| 318 | |
Martin v. Löwis | b09b844 | 2008-07-03 14:13:42 +0000 | [diff] [blame] | 319 | .. attribute:: ZipFile.comment |
| 320 | |
Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 321 | The comment text associated with the ZIP file. If assigning a comment to a |
| 322 | :class:`ZipFile` instance created with mode 'a' or 'w', this should be a |
| 323 | string no longer than 65535 bytes. Comments longer than this will be |
Martin v. Löwis | b09b844 | 2008-07-03 14:13:42 +0000 | [diff] [blame] | 324 | truncated in the written archive when :meth:`ZipFile.close` is called. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 325 | |
Georg Brandl | 8334fd9 | 2010-12-04 10:26:46 +0000 | [diff] [blame] | 326 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 327 | .. _pyzipfile-objects: |
| 328 | |
| 329 | PyZipFile Objects |
| 330 | ----------------- |
| 331 | |
| 332 | The :class:`PyZipFile` constructor takes the same parameters as the |
Georg Brandl | 8334fd9 | 2010-12-04 10:26:46 +0000 | [diff] [blame] | 333 | :class:`ZipFile` constructor, and one additional parameter, *optimize*. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 334 | |
Georg Brandl | 8334fd9 | 2010-12-04 10:26:46 +0000 | [diff] [blame] | 335 | .. class:: PyZipFile(file, mode='r', compression=ZIP_STORED, allowZip64=False, \ |
| 336 | optimize=-1) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 337 | |
Georg Brandl | 8334fd9 | 2010-12-04 10:26:46 +0000 | [diff] [blame] | 338 | .. versionadded:: 3.2 |
| 339 | The *optimize* parameter. |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 340 | |
Georg Brandl | 8334fd9 | 2010-12-04 10:26:46 +0000 | [diff] [blame] | 341 | Instances have one method in addition to those of :class:`ZipFile` objects: |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 342 | |
Georg Brandl | 8334fd9 | 2010-12-04 10:26:46 +0000 | [diff] [blame] | 343 | .. method:: PyZipFile.writepy(pathname, basename='') |
| 344 | |
| 345 | Search for files :file:`\*.py` and add the corresponding file to the |
| 346 | archive. |
| 347 | |
| 348 | If the *optimize* parameter to :class:`PyZipFile` was not given or ``-1``, |
| 349 | the corresponding file is a :file:`\*.pyo` file if available, else a |
| 350 | :file:`\*.pyc` file, compiling if necessary. |
| 351 | |
| 352 | If the *optimize* parameter to :class:`PyZipFile` was ``0``, ``1`` or |
| 353 | ``2``, only files with that optimization level (see :func:`compile`) are |
| 354 | added to the archive, compiling if necessary. |
| 355 | |
| 356 | If the pathname is a file, the filename must end with :file:`.py`, and |
| 357 | just the (corresponding :file:`\*.py[co]`) file is added at the top level |
| 358 | (no path information). If the pathname is a file that does not end with |
| 359 | :file:`.py`, a :exc:`RuntimeError` will be raised. If it is a directory, |
| 360 | and the directory is not a package directory, then all the files |
| 361 | :file:`\*.py[co]` are added at the top level. If the directory is a |
| 362 | package directory, then all :file:`\*.py[co]` are added under the package |
| 363 | name as a file path, and if any subdirectories are package directories, |
| 364 | all of these are added recursively. *basename* is intended for internal |
| 365 | use only. The :meth:`writepy` method makes archives with file names like |
| 366 | this:: |
| 367 | |
| 368 | string.pyc # Top level name |
| 369 | test/__init__.pyc # Package directory |
| 370 | test/testall.pyc # Module test.testall |
| 371 | test/bogus/__init__.pyc # Subpackage directory |
| 372 | test/bogus/myfile.pyc # Submodule test.bogus.myfile |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 373 | |
| 374 | |
| 375 | .. _zipinfo-objects: |
| 376 | |
| 377 | ZipInfo Objects |
| 378 | --------------- |
| 379 | |
| 380 | Instances of the :class:`ZipInfo` class are returned by the :meth:`getinfo` and |
| 381 | :meth:`infolist` methods of :class:`ZipFile` objects. Each object stores |
| 382 | information about a single member of the ZIP archive. |
| 383 | |
| 384 | Instances have the following attributes: |
| 385 | |
| 386 | |
| 387 | .. attribute:: ZipInfo.filename |
| 388 | |
| 389 | Name of the file in the archive. |
| 390 | |
| 391 | |
| 392 | .. attribute:: ZipInfo.date_time |
| 393 | |
| 394 | The time and date of the last modification to the archive member. This is a |
| 395 | tuple of six values: |
| 396 | |
| 397 | +-------+--------------------------+ |
| 398 | | Index | Value | |
| 399 | +=======+==========================+ |
| 400 | | ``0`` | Year | |
| 401 | +-------+--------------------------+ |
| 402 | | ``1`` | Month (one-based) | |
| 403 | +-------+--------------------------+ |
| 404 | | ``2`` | Day of month (one-based) | |
| 405 | +-------+--------------------------+ |
| 406 | | ``3`` | Hours (zero-based) | |
| 407 | +-------+--------------------------+ |
| 408 | | ``4`` | Minutes (zero-based) | |
| 409 | +-------+--------------------------+ |
| 410 | | ``5`` | Seconds (zero-based) | |
| 411 | +-------+--------------------------+ |
| 412 | |
| 413 | |
| 414 | .. attribute:: ZipInfo.compress_type |
| 415 | |
| 416 | Type of compression for the archive member. |
| 417 | |
| 418 | |
| 419 | .. attribute:: ZipInfo.comment |
| 420 | |
| 421 | Comment for the individual archive member. |
| 422 | |
| 423 | |
| 424 | .. attribute:: ZipInfo.extra |
| 425 | |
| 426 | Expansion field data. The `PKZIP Application Note |
Christian Heimes | dd15f6c | 2008-03-16 00:07:10 +0000 | [diff] [blame] | 427 | <http://www.pkware.com/documents/casestudies/APPNOTE.TXT>`_ contains |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 428 | some comments on the internal structure of the data contained in this string. |
| 429 | |
| 430 | |
| 431 | .. attribute:: ZipInfo.create_system |
| 432 | |
| 433 | System which created ZIP archive. |
| 434 | |
| 435 | |
| 436 | .. attribute:: ZipInfo.create_version |
| 437 | |
| 438 | PKZIP version which created ZIP archive. |
| 439 | |
| 440 | |
| 441 | .. attribute:: ZipInfo.extract_version |
| 442 | |
| 443 | PKZIP version needed to extract archive. |
| 444 | |
| 445 | |
| 446 | .. attribute:: ZipInfo.reserved |
| 447 | |
| 448 | Must be zero. |
| 449 | |
| 450 | |
| 451 | .. attribute:: ZipInfo.flag_bits |
| 452 | |
| 453 | ZIP flag bits. |
| 454 | |
| 455 | |
| 456 | .. attribute:: ZipInfo.volume |
| 457 | |
| 458 | Volume number of file header. |
| 459 | |
| 460 | |
| 461 | .. attribute:: ZipInfo.internal_attr |
| 462 | |
| 463 | Internal attributes. |
| 464 | |
| 465 | |
| 466 | .. attribute:: ZipInfo.external_attr |
| 467 | |
| 468 | External file attributes. |
| 469 | |
| 470 | |
| 471 | .. attribute:: ZipInfo.header_offset |
| 472 | |
| 473 | Byte offset to the file header. |
| 474 | |
| 475 | |
| 476 | .. attribute:: ZipInfo.CRC |
| 477 | |
| 478 | CRC-32 of the uncompressed file. |
| 479 | |
| 480 | |
| 481 | .. attribute:: ZipInfo.compress_size |
| 482 | |
| 483 | Size of the compressed data. |
| 484 | |
| 485 | |
| 486 | .. attribute:: ZipInfo.file_size |
| 487 | |
| 488 | Size of the uncompressed file. |
| 489 | |