blob: a1457d070b02ce1ddde257d3fe545b1c1985e9d7 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`shutil` --- High-level file operations
2============================================
3
4.. module:: shutil
5 :synopsis: High-level file operations, including copying.
6.. sectionauthor:: Fred L. Drake, Jr. <fdrake@acm.org>
Christian Heimes5b5e81c2007-12-31 16:14:33 +00007.. partly based on the docstrings
Georg Brandl116aa622007-08-15 14:28:22 +00008
9.. index::
10 single: file; copying
11 single: copying files
12
Raymond Hettinger10480942011-01-10 03:26:08 +000013**Source code:** :source:`Lib/shutil.py`
14
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000015--------------
16
Georg Brandl116aa622007-08-15 14:28:22 +000017The :mod:`shutil` module offers a number of high-level operations on files and
18collections of files. In particular, functions are provided which support file
Guido van Rossum2cc30da2007-11-02 23:46:40 +000019copying and removal. For operations on individual files, see also the
20:mod:`os` module.
Georg Brandl116aa622007-08-15 14:28:22 +000021
Guido van Rossumda27fd22007-08-17 00:24:54 +000022.. warning::
Christian Heimes7f044312008-01-06 17:05:40 +000023
Senthil Kumaran7f728c12012-02-13 23:30:47 +080024 Even the higher-level file copying functions (:func:`shutil.copy`,
25 :func:`shutil.copy2`) cannot copy all file metadata.
Georg Brandl48310cd2009-01-03 21:18:54 +000026
Christian Heimes7f044312008-01-06 17:05:40 +000027 On POSIX platforms, this means that file owner and group are lost as well
Georg Brandlc575c902008-09-13 17:46:05 +000028 as ACLs. On Mac OS, the resource fork and other metadata are not used.
Christian Heimes7f044312008-01-06 17:05:40 +000029 This means that resources will be lost and file type and creator codes will
30 not be correct. On Windows, file owners, ACLs and alternate data streams
31 are not copied.
Georg Brandl116aa622007-08-15 14:28:22 +000032
Éric Araujo6e6cb8e2010-11-16 19:13:50 +000033
Éric Araujof2fbb9c2012-01-16 16:55:55 +010034.. _file-operations:
35
Tarek Ziadé396fad72010-02-23 05:30:31 +000036Directory and files operations
37------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +000038
Georg Brandl116aa622007-08-15 14:28:22 +000039.. function:: copyfileobj(fsrc, fdst[, length])
40
41 Copy the contents of the file-like object *fsrc* to the file-like object *fdst*.
42 The integer *length*, if given, is the buffer size. In particular, a negative
43 *length* value means to copy the data without looping over the source data in
44 chunks; by default the data is read in chunks to avoid uncontrolled memory
45 consumption. Note that if the current file position of the *fsrc* object is not
46 0, only the contents from the current file position to the end of the file will
47 be copied.
48
49
Larry Hastingsb4038062012-07-15 10:57:38 -070050.. function:: copyfile(src, dst, *, follow_symlinks=True)
Christian Heimesa342c012008-04-20 21:01:16 +000051
Senthil Kumaran7f728c12012-02-13 23:30:47 +080052 Copy the contents (no metadata) of the file named *src* to a file named
Larry Hastings60eba572012-09-21 10:12:14 -070053 *dst* and return *dst*. *src* and *dst* are path names given as strings.
54 *dst* must be the complete target file name; look at :func:`shutil.copy`
55 for a copy that accepts a target directory path. If *src* and *dst*
56 specify the same file, :exc:`Error` is raised.
Senthil Kumaran1fd64822012-02-13 23:35:44 +080057
Larry Hastings60eba572012-09-21 10:12:14 -070058 The destination location must be writable; otherwise, an :exc:`OSError`
59 exception will be raised. If *dst* already exists, it will be replaced.
60 Special files such as character or block devices and pipes cannot be
61 copied with this function.
Christian Heimesa342c012008-04-20 21:01:16 +000062
Larry Hastings7aa2c8b2012-07-15 16:58:29 -070063 If *follow_symlinks* is false and *src* is a symbolic link,
64 a new symbolic link will be created instead of copying the
65 file *src* points to.
Antoine Pitrou78091e62011-12-29 18:54:15 +010066
Antoine Pitrou62ab10a02011-10-12 20:10:51 +020067 .. versionchanged:: 3.3
68 :exc:`IOError` used to be raised instead of :exc:`OSError`.
Larry Hastings7aa2c8b2012-07-15 16:58:29 -070069 Added *follow_symlinks* argument.
70 Now returns *dst*.
Antoine Pitrou62ab10a02011-10-12 20:10:51 +020071
Larry Hastings7aa2c8b2012-07-15 16:58:29 -070072.. function:: copymode(src, dst, *, follow_symlinks=True)
Georg Brandl116aa622007-08-15 14:28:22 +000073
74 Copy the permission bits from *src* to *dst*. The file contents, owner, and
Larry Hastings60eba572012-09-21 10:12:14 -070075 group are unaffected. *src* and *dst* are path names given as strings.
76 If *follow_symlinks* is false, and both *src* and *dst* are symbolic links,
77 :func:`copymode` will attempt to modify the mode of *dst* itself (rather
78 than the file it points to). This functionality is not available on every
79 platform; please see :func:`copystat` for more information. If
80 :func:`copymode` cannot modify symbolic links on the local platform, and it
81 is asked to do so, it will do nothing and return.
Georg Brandl116aa622007-08-15 14:28:22 +000082
Antoine Pitrou78091e62011-12-29 18:54:15 +010083 .. versionchanged:: 3.3
Larry Hastings7aa2c8b2012-07-15 16:58:29 -070084 Added *follow_symlinks* argument.
Georg Brandl116aa622007-08-15 14:28:22 +000085
Larry Hastings7aa2c8b2012-07-15 16:58:29 -070086.. function:: copystat(src, dst, *, follow_symlinks=True)
Georg Brandl116aa622007-08-15 14:28:22 +000087
Larry Hastings60eba572012-09-21 10:12:14 -070088 Copy the permission bits, last access time, last modification time, and
89 flags from *src* to *dst*. On Linux, :func:`copystat` also copies the
90 "extended attributes" where possible. The file contents, owner, and
91 group are unaffected. *src* and *dst* are path names given as strings.
92
93 If *follow_symlinks* is false, and *src* and *dst* both
94 refer to symbolic links, :func:`copystat` will operate on
95 the symbolic links themselves rather than the files the
96 symbolic links refer to--reading the information from the
97 *src* symbolic link, and writing the information to the
98 *dst* symbolic link.
99
100 .. note::
101
102 Not all platforms provide the ability to examine and
103 modify symbolic links. Python itself can tell you what
104 functionality is locally available.
105
106 * If ``os.chmod in os.supports_follow_symlinks`` is
107 ``True``, :func:`copystat` can modify the permission
108 bits of a symbolic link.
109
110 * If ``os.utime in os.supports_follow_symlinks`` is
111 ``True``, :func:`copystat` can modify the last access
112 and modification times of a symbolic link.
113
114 * If ``os.chflags in os.supports_follow_symlinks`` is
115 ``True``, :func:`copystat` can modify the flags of
116 a symbolic link. (``os.chflags`` is not available on
117 all platforms.)
118
119 On platforms where some or all of this functionality
120 is unavailable, when asked to modify a symbolic link,
121 :func:`copystat` will copy everything it can.
122 :func:`copystat` never returns failure.
123
124 Please see :data:`os.supports_follow_symlinks`
125 for more information.
Georg Brandl116aa622007-08-15 14:28:22 +0000126
Antoine Pitrou78091e62011-12-29 18:54:15 +0100127 .. versionchanged:: 3.3
Larry Hastings7aa2c8b2012-07-15 16:58:29 -0700128 Added *follow_symlinks* argument and support for Linux extended attributes.
Georg Brandl116aa622007-08-15 14:28:22 +0000129
Larry Hastings7aa2c8b2012-07-15 16:58:29 -0700130.. function:: copy(src, dst, *, follow_symlinks=True)
Georg Brandl116aa622007-08-15 14:28:22 +0000131
Larry Hastings60eba572012-09-21 10:12:14 -0700132 Copies the file *src* to the file or directory *dst*. *src* and *dst*
133 should be strings. If *dst* specifies a directory, the file will be
134 copied into *dst* using the base filename from *src*. Returns the
135 path to the newly created file.
136
137 If *follow_symlinks* is false, and *src* is a symbolic link,
138 *dst* will be created as a symbolic link. If *follow_symlinks*
139 is true and *src* is a symbolic link, *dst* will be a copy of
140 the file *src* refers to.
141
142 :func:`copy` copies the file data and the file's permission
143 mode (see :func:`os.chmod`). Other metadata, like the
144 file's creation and modification times, is not preserved.
145 To preserve all file metadata from the original, use
146 :func:`~shutil.copy2` instead.
Georg Brandl116aa622007-08-15 14:28:22 +0000147
Antoine Pitrou78091e62011-12-29 18:54:15 +0100148 .. versionchanged:: 3.3
Larry Hastings7aa2c8b2012-07-15 16:58:29 -0700149 Added *follow_symlinks* argument.
Larry Hastings60eba572012-09-21 10:12:14 -0700150 Now returns path to the newly created file.
Georg Brandl116aa622007-08-15 14:28:22 +0000151
Larry Hastings7aa2c8b2012-07-15 16:58:29 -0700152.. function:: copy2(src, dst, *, follow_symlinks=True)
Georg Brandl116aa622007-08-15 14:28:22 +0000153
Larry Hastings60eba572012-09-21 10:12:14 -0700154 Identical to :func:`~shutil.copy` except that :func:`copy2`
155 also attempts to preserve all file metadata.
156
157 When *follow_symlinks* is false, and *src* is a symbolic
158 link, :func:`copy2` attempts to copy all metadata from the
159 *src* symbolic link to the newly-created *dst* symbolic link.
160 However, this functionality is not available on all platforms.
161 On platforms where some or all of this functionality is
162 unavailable, :func:`copy2` will preserve all the metadata
163 it can; :func:`copy2` never returns failure.
164
165 :func:`copy2` uses :func:`copystat` to copy the file metadata.
166 Please see :func:`copystat` for more information
167 about platform support for modifying symbolic link metadata.
Georg Brandl116aa622007-08-15 14:28:22 +0000168
Antoine Pitrou78091e62011-12-29 18:54:15 +0100169 .. versionchanged:: 3.3
Larry Hastings7aa2c8b2012-07-15 16:58:29 -0700170 Added *follow_symlinks* argument, try to copy extended
171 file system attributes too (currently Linux only).
Larry Hastings60eba572012-09-21 10:12:14 -0700172 Now returns path to the newly created file.
Brian Curtin066dacf2012-06-19 10:03:05 -0500173
Georg Brandl86b2fb92008-07-16 03:43:04 +0000174.. function:: ignore_patterns(\*patterns)
175
176 This factory function creates a function that can be used as a callable for
177 :func:`copytree`\'s *ignore* argument, ignoring files and directories that
178 match one of the glob-style *patterns* provided. See the example below.
179
180
Ezio Melotticb999a32010-04-20 11:26:51 +0000181.. function:: copytree(src, dst, symlinks=False, ignore=None, copy_function=copy2, ignore_dangling_symlinks=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000182
Brian Curtin0d0a1de2012-06-18 18:41:07 -0500183 Recursively copy an entire directory tree rooted at *src*, returning the
184 destination directory. The destination
Senthil Kumaran7f728c12012-02-13 23:30:47 +0800185 directory, named by *dst*, must not already exist; it will be created as
186 well as missing parent directories. Permissions and times of directories
187 are copied with :func:`copystat`, individual files are copied using
188 :func:`shutil.copy2`.
Georg Brandl116aa622007-08-15 14:28:22 +0000189
Georg Brandl86b2fb92008-07-16 03:43:04 +0000190 If *symlinks* is true, symbolic links in the source tree are represented as
Antoine Pitrou78091e62011-12-29 18:54:15 +0100191 symbolic links in the new tree and the metadata of the original links will
192 be copied as far as the platform allows; if false or omitted, the contents
193 and metadata of the linked files are copied to the new tree.
Georg Brandl86b2fb92008-07-16 03:43:04 +0000194
Tarek Ziadéfb437512010-04-20 08:57:33 +0000195 When *symlinks* is false, if the file pointed by the symlink doesn't
196 exist, a exception will be added in the list of errors raised in
197 a :exc:`Error` exception at the end of the copy process.
198 You can set the optional *ignore_dangling_symlinks* flag to true if you
Tarek Ziadé8c26c7d2010-04-23 13:03:50 +0000199 want to silence this exception. Notice that this option has no effect
200 on platforms that don't support :func:`os.symlink`.
Tarek Ziadéfb437512010-04-20 08:57:33 +0000201
Georg Brandl86b2fb92008-07-16 03:43:04 +0000202 If *ignore* is given, it must be a callable that will receive as its
203 arguments the directory being visited by :func:`copytree`, and a list of its
204 contents, as returned by :func:`os.listdir`. Since :func:`copytree` is
205 called recursively, the *ignore* callable will be called once for each
206 directory that is copied. The callable must return a sequence of directory
207 and file names relative to the current directory (i.e. a subset of the items
208 in its second argument); these names will then be ignored in the copy
209 process. :func:`ignore_patterns` can be used to create such a callable that
210 ignores names based on glob-style patterns.
211
212 If exception(s) occur, an :exc:`Error` is raised with a list of reasons.
213
Senthil Kumaran7f728c12012-02-13 23:30:47 +0800214 If *copy_function* is given, it must be a callable that will be used to copy
215 each file. It will be called with the source path and the destination path
216 as arguments. By default, :func:`shutil.copy2` is used, but any function
Senthil Kumaran1fd64822012-02-13 23:35:44 +0800217 that supports the same signature (like :func:`shutil.copy`) can be used.
Georg Brandl116aa622007-08-15 14:28:22 +0000218
Larry Hastings7aa2c8b2012-07-15 16:58:29 -0700219 .. versionchanged:: 3.3
220 Copy metadata when *symlinks* is false.
221 Now returns *dst*.
222
Tarek Ziadé5340db32010-04-19 22:30:51 +0000223 .. versionchanged:: 3.2
224 Added the *copy_function* argument to be able to provide a custom copy
225 function.
Tarek Ziadéfb437512010-04-20 08:57:33 +0000226 Added the *ignore_dangling_symlinks* argument to silent dangling symlinks
227 errors when *symlinks* is false.
228
Georg Brandl96acb732012-06-24 17:39:05 +0200229
Georg Brandl18244152009-09-02 20:34:52 +0000230.. function:: rmtree(path, ignore_errors=False, onerror=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000231
232 .. index:: single: directory; deleting
233
Christian Heimes9bd667a2008-01-20 15:14:11 +0000234 Delete an entire directory tree; *path* must point to a directory (but not a
235 symbolic link to a directory). If *ignore_errors* is true, errors resulting
236 from failed removals will be ignored; if false or omitted, such errors are
237 handled by calling a handler specified by *onerror* or, if that is omitted,
238 they raise an exception.
Georg Brandl116aa622007-08-15 14:28:22 +0000239
Nick Coghlan5b0eca12012-06-24 16:43:06 +1000240 .. note::
Hynek Schlawack67be92b2012-06-23 17:58:42 +0200241
Nick Coghlan5b0eca12012-06-24 16:43:06 +1000242 On platforms that support the necessary fd-based functions a symlink
Georg Brandl96acb732012-06-24 17:39:05 +0200243 attack resistant version of :func:`rmtree` is used by default. On other
244 platforms, the :func:`rmtree` implementation is susceptible to a symlink
245 attack: given proper timing and circumstances, attackers can manipulate
246 symlinks on the filesystem to delete files they wouldn't be able to access
247 otherwise. Applications can use the :data:`rmtree.avoids_symlink_attacks`
248 function attribute to determine which case applies.
Hynek Schlawack67be92b2012-06-23 17:58:42 +0200249
Christian Heimes9bd667a2008-01-20 15:14:11 +0000250 If *onerror* is provided, it must be a callable that accepts three
Hynek Schlawack67be92b2012-06-23 17:58:42 +0200251 parameters: *function*, *path*, and *excinfo*.
252
253 The first parameter, *function*, is the function which raised the exception;
254 it depends on the platform and implementation. The second parameter,
255 *path*, will be the path name passed to *function*. The third parameter,
256 *excinfo*, will be the exception information returned by
257 :func:`sys.exc_info`. Exceptions raised by *onerror* will not be caught.
258
259 .. versionchanged:: 3.3
Nick Coghlan5b0eca12012-06-24 16:43:06 +1000260 Added a symlink attack resistant version that is used automatically
261 if platform supports fd-based functions.
Christian Heimes9bd667a2008-01-20 15:14:11 +0000262
Éric Araujo544e13d2012-06-24 13:53:48 -0400263 .. attribute:: rmtree.avoids_symlink_attacks
Hynek Schlawack2100b422012-06-23 20:28:32 +0200264
Nick Coghlan5b0eca12012-06-24 16:43:06 +1000265 Indicates whether the current platform and implementation provides a
Georg Brandl96acb732012-06-24 17:39:05 +0200266 symlink attack resistant version of :func:`rmtree`. Currently this is
Nick Coghlan5b0eca12012-06-24 16:43:06 +1000267 only true for platforms supporting fd-based directory access functions.
Hynek Schlawack2100b422012-06-23 20:28:32 +0200268
Nick Coghlan5b0eca12012-06-24 16:43:06 +1000269 .. versionadded:: 3.3
Georg Brandl116aa622007-08-15 14:28:22 +0000270
Georg Brandl96acb732012-06-24 17:39:05 +0200271
Georg Brandl116aa622007-08-15 14:28:22 +0000272.. function:: move(src, dst)
273
Brian Curtin0d0a1de2012-06-18 18:41:07 -0500274 Recursively move a file or directory (*src*) to another location (*dst*)
275 and return the destination.
Georg Brandl116aa622007-08-15 14:28:22 +0000276
Éric Araujo14382dc2011-07-28 22:49:11 +0200277 If the destination is a directory or a symlink to a directory, then *src* is
278 moved inside that directory.
279
280 The destination directory must not already exist. If the destination already
281 exists but is not a directory, it may be overwritten depending on
282 :func:`os.rename` semantics.
283
284 If the destination is on the current filesystem, then :func:`os.rename` is
Senthil Kumaran7f728c12012-02-13 23:30:47 +0800285 used. Otherwise, *src* is copied (using :func:`shutil.copy2`) to *dst* and
Senthil Kumaran1fd64822012-02-13 23:35:44 +0800286 then removed. In case of symlinks, a new symlink pointing to the target of
287 *src* will be created in or as *dst* and *src* will be removed.
Antoine Pitrou0a08d7a2012-01-06 20:16:19 +0100288
289 .. versionchanged:: 3.3
290 Added explicit symlink handling for foreign filesystems, thus adapting
291 it to the behavior of GNU's :program:`mv`.
Larry Hastings7aa2c8b2012-07-15 16:58:29 -0700292 Now returns *dst*.
Brian Curtin066dacf2012-06-19 10:03:05 -0500293
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200294.. function:: disk_usage(path)
295
Éric Araujoe4d5b8e2011-08-08 16:51:11 +0200296 Return disk usage statistics about the given path as a :term:`named tuple`
297 with the attributes *total*, *used* and *free*, which are the amount of
298 total, used and free space, in bytes.
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200299
300 .. versionadded:: 3.3
301
302 Availability: Unix, Windows.
Georg Brandl116aa622007-08-15 14:28:22 +0000303
Sandro Tosid902a142011-08-22 23:28:27 +0200304.. function:: chown(path, user=None, group=None)
305
306 Change owner *user* and/or *group* of the given *path*.
307
308 *user* can be a system user name or a uid; the same applies to *group*. At
309 least one argument is required.
310
311 See also :func:`os.chown`, the underlying function.
312
313 Availability: Unix.
314
315 .. versionadded:: 3.3
316
Georg Brandl4a7e25f2012-06-24 17:37:07 +0200317
Brian Curtinc57a3452012-06-22 16:00:30 -0500318.. function:: which(cmd, mode=os.F_OK | os.X_OK, path=None)
319
Georg Brandl4a7e25f2012-06-24 17:37:07 +0200320 Return the path to an executable which would be run if the given *cmd* was
321 called. If no *cmd* would be called, return ``None``.
Brian Curtinc57a3452012-06-22 16:00:30 -0500322
323 *mode* is a permission mask passed a to :func:`os.access`, by default
324 determining if the file exists and executable.
325
Georg Brandl4a7e25f2012-06-24 17:37:07 +0200326 When no *path* is specified, the results of :func:`os.environ` are used,
327 returning either the "PATH" value or a fallback of :attr:`os.defpath`.
Brian Curtinc57a3452012-06-22 16:00:30 -0500328
Georg Brandl4a7e25f2012-06-24 17:37:07 +0200329 On Windows, the current directory is always prepended to the *path* whether
330 or not you use the default or provide your own, which is the behavior the
331 command shell uses when finding executables. Additionaly, when finding the
332 *cmd* in the *path*, the ``PATHEXT`` environment variable is checked. For
333 example, if you call ``shutil.which("python")``, :func:`which` will search
334 ``PATHEXT`` to know that it should look for ``python.exe`` within the *path*
335 directories. For example, on Windows::
Brian Curtinc57a3452012-06-22 16:00:30 -0500336
Georg Brandl4a7e25f2012-06-24 17:37:07 +0200337 >>> shutil.which("python")
Serhiy Storchaka80c88f42013-01-22 10:31:36 +0200338 'C:\\Python33\\python.EXE'
Brian Curtinc57a3452012-06-22 16:00:30 -0500339
340 .. versionadded:: 3.3
Sandro Tosid902a142011-08-22 23:28:27 +0200341
Georg Brandl4a7e25f2012-06-24 17:37:07 +0200342
Georg Brandl116aa622007-08-15 14:28:22 +0000343.. exception:: Error
344
Éric Araujo14382dc2011-07-28 22:49:11 +0200345 This exception collects exceptions that are raised during a multi-file
346 operation. For :func:`copytree`, the exception argument is a list of 3-tuples
347 (*srcname*, *dstname*, *exception*).
Georg Brandl116aa622007-08-15 14:28:22 +0000348
Georg Brandl116aa622007-08-15 14:28:22 +0000349
Éric Araujof2fbb9c2012-01-16 16:55:55 +0100350.. _shutil-copytree-example:
Georg Brandl116aa622007-08-15 14:28:22 +0000351
Tarek Ziadé396fad72010-02-23 05:30:31 +0000352copytree example
Georg Brandl03b9ad02012-06-24 18:09:40 +0200353~~~~~~~~~~~~~~~~
Georg Brandl116aa622007-08-15 14:28:22 +0000354
355This example is the implementation of the :func:`copytree` function, described
356above, with the docstring omitted. It demonstrates many of the other functions
357provided by this module. ::
358
359 def copytree(src, dst, symlinks=False):
360 names = os.listdir(src)
361 os.makedirs(dst)
362 errors = []
363 for name in names:
364 srcname = os.path.join(src, name)
365 dstname = os.path.join(dst, name)
366 try:
367 if symlinks and os.path.islink(srcname):
368 linkto = os.readlink(srcname)
369 os.symlink(linkto, dstname)
370 elif os.path.isdir(srcname):
371 copytree(srcname, dstname, symlinks)
372 else:
373 copy2(srcname, dstname)
374 # XXX What about devices, sockets etc.?
Andrew Svetlov618c2e12012-12-15 22:59:24 +0200375 except OSError as why:
Georg Brandl116aa622007-08-15 14:28:22 +0000376 errors.append((srcname, dstname, str(why)))
377 # catch the Error from the recursive copytree so that we can
378 # continue with other files
379 except Error as err:
380 errors.extend(err.args[0])
381 try:
382 copystat(src, dst)
383 except WindowsError:
384 # can't copy file access times on Windows
385 pass
386 except OSError as why:
387 errors.extend((src, dst, str(why)))
388 if errors:
Collin Winterc79461b2007-09-01 23:34:30 +0000389 raise Error(errors)
Georg Brandl116aa622007-08-15 14:28:22 +0000390
Tarek Ziadé396fad72010-02-23 05:30:31 +0000391Another example that uses the :func:`ignore_patterns` helper::
392
393 from shutil import copytree, ignore_patterns
394
395 copytree(source, destination, ignore=ignore_patterns('*.pyc', 'tmp*'))
396
397This will copy everything except ``.pyc`` files and files or directories whose
398name starts with ``tmp``.
399
400Another example that uses the *ignore* argument to add a logging call::
401
402 from shutil import copytree
403 import logging
404
405 def _logpath(path, names):
406 logging.info('Working in %s' % path)
407 return [] # nothing will be ignored
408
409 copytree(source, destination, ignore=_logpath)
410
411
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000412.. _archiving-operations:
413
414Archiving operations
415--------------------
Tarek Ziadé396fad72010-02-23 05:30:31 +0000416
Georg Brandl03b9ad02012-06-24 18:09:40 +0200417.. versionadded:: 3.2
418
Éric Araujof2fbb9c2012-01-16 16:55:55 +0100419High-level utilities to create and read compressed and archived files are also
420provided. They rely on the :mod:`zipfile` and :mod:`tarfile` modules.
421
Tarek Ziadé396fad72010-02-23 05:30:31 +0000422.. function:: make_archive(base_name, format, [root_dir, [base_dir, [verbose, [dry_run, [owner, [group, [logger]]]]]]])
423
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000424 Create an archive file (such as zip or tar) and return its name.
Tarek Ziadé396fad72010-02-23 05:30:31 +0000425
426 *base_name* is the name of the file to create, including the path, minus
427 any format-specific extension. *format* is the archive format: one of
Tarek Ziadéffa155a2010-04-29 13:34:35 +0000428 "zip", "tar", "bztar" (if the :mod:`bz2` module is available) or "gztar".
Tarek Ziadé396fad72010-02-23 05:30:31 +0000429
430 *root_dir* is a directory that will be the root directory of the
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000431 archive; for example, we typically chdir into *root_dir* before creating the
Tarek Ziadé396fad72010-02-23 05:30:31 +0000432 archive.
433
434 *base_dir* is the directory where we start archiving from;
Ezio Melotticb999a32010-04-20 11:26:51 +0000435 i.e. *base_dir* will be the common prefix of all files and
Tarek Ziadé396fad72010-02-23 05:30:31 +0000436 directories in the archive.
437
438 *root_dir* and *base_dir* both default to the current directory.
439
440 *owner* and *group* are used when creating a tar archive. By default,
441 uses the current owner and group.
442
Éric Araujo06c42a32011-11-07 17:31:07 +0100443 *logger* must be an object compatible with :pep:`282`, usually an instance of
444 :class:`logging.Logger`.
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000445
Tarek Ziadé396fad72010-02-23 05:30:31 +0000446
447.. function:: get_archive_formats()
448
Éric Araujo14382dc2011-07-28 22:49:11 +0200449 Return a list of supported formats for archiving.
Tarek Ziadé396fad72010-02-23 05:30:31 +0000450 Each element of the returned sequence is a tuple ``(name, description)``
451
452 By default :mod:`shutil` provides these formats:
453
454 - *gztar*: gzip'ed tar-file
Tarek Ziadéffa155a2010-04-29 13:34:35 +0000455 - *bztar*: bzip2'ed tar-file (if the :mod:`bz2` module is available.)
Tarek Ziadé396fad72010-02-23 05:30:31 +0000456 - *tar*: uncompressed tar file
457 - *zip*: ZIP file
458
459 You can register new formats or provide your own archiver for any existing
460 formats, by using :func:`register_archive_format`.
461
Tarek Ziadé396fad72010-02-23 05:30:31 +0000462
463.. function:: register_archive_format(name, function, [extra_args, [description]])
464
Éric Araujo14382dc2011-07-28 22:49:11 +0200465 Register an archiver for the format *name*. *function* is a callable that
Tarek Ziadé396fad72010-02-23 05:30:31 +0000466 will be used to invoke the archiver.
467
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000468 If given, *extra_args* is a sequence of ``(name, value)`` pairs that will be
Tarek Ziadé396fad72010-02-23 05:30:31 +0000469 used as extra keywords arguments when the archiver callable is used.
470
471 *description* is used by :func:`get_archive_formats` which returns the
472 list of archivers. Defaults to an empty list.
473
Tarek Ziadé396fad72010-02-23 05:30:31 +0000474
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000475.. function:: unregister_archive_format(name)
Tarek Ziadé396fad72010-02-23 05:30:31 +0000476
477 Remove the archive format *name* from the list of supported formats.
478
Tarek Ziadé396fad72010-02-23 05:30:31 +0000479
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000480.. function:: unpack_archive(filename[, extract_dir[, format]])
481
482 Unpack an archive. *filename* is the full path of the archive.
483
484 *extract_dir* is the name of the target directory where the archive is
485 unpacked. If not provided, the current working directory is used.
486
487 *format* is the archive format: one of "zip", "tar", or "gztar". Or any
488 other format registered with :func:`register_unpack_format`. If not
489 provided, :func:`unpack_archive` will use the archive file name extension
490 and see if an unpacker was registered for that extension. In case none is
491 found, a :exc:`ValueError` is raised.
492
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000493
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000494.. function:: register_unpack_format(name, extensions, function[, extra_args[, description]])
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000495
496 Registers an unpack format. *name* is the name of the format and
497 *extensions* is a list of extensions corresponding to the format, like
498 ``.zip`` for Zip files.
499
500 *function* is the callable that will be used to unpack archives. The
501 callable will receive the path of the archive, followed by the directory
502 the archive must be extracted to.
503
504 When provided, *extra_args* is a sequence of ``(name, value)`` tuples that
505 will be passed as keywords arguments to the callable.
506
507 *description* can be provided to describe the format, and will be returned
508 by the :func:`get_unpack_formats` function.
509
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000510
511.. function:: unregister_unpack_format(name)
512
513 Unregister an unpack format. *name* is the name of the format.
514
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000515
516.. function:: get_unpack_formats()
517
518 Return a list of all registered formats for unpacking.
519 Each element of the returned sequence is a tuple
520 ``(name, extensions, description)``.
521
522 By default :mod:`shutil` provides these formats:
523
524 - *gztar*: gzip'ed tar-file
Tarek Ziadéffa155a2010-04-29 13:34:35 +0000525 - *bztar*: bzip2'ed tar-file (if the :mod:`bz2` module is available.)
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000526 - *tar*: uncompressed tar file
527 - *zip*: ZIP file
528
529 You can register new formats or provide your own unpacker for any existing
530 formats, by using :func:`register_unpack_format`.
531
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000532
Éric Araujof2fbb9c2012-01-16 16:55:55 +0100533.. _shutil-archiving-example:
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000534
Tarek Ziadé396fad72010-02-23 05:30:31 +0000535Archiving example
Georg Brandl03b9ad02012-06-24 18:09:40 +0200536~~~~~~~~~~~~~~~~~
Tarek Ziadé396fad72010-02-23 05:30:31 +0000537
538In this example, we create a gzip'ed tar-file archive containing all files
539found in the :file:`.ssh` directory of the user::
540
541 >>> from shutil import make_archive
542 >>> import os
543 >>> archive_name = os.path.expanduser(os.path.join('~', 'myarchive'))
544 >>> root_dir = os.path.expanduser(os.path.join('~', '.ssh'))
545 >>> make_archive(archive_name, 'gztar', root_dir)
546 '/Users/tarek/myarchive.tar.gz'
547
548The resulting archive contains::
549
550 $ tar -tzvf /Users/tarek/myarchive.tar.gz
551 drwx------ tarek/staff 0 2010-02-01 16:23:40 ./
552 -rw-r--r-- tarek/staff 609 2008-06-09 13:26:54 ./authorized_keys
553 -rwxr-xr-x tarek/staff 65 2008-06-09 13:26:54 ./config
554 -rwx------ tarek/staff 668 2008-06-09 13:26:54 ./id_dsa
555 -rwxr-xr-x tarek/staff 609 2008-06-09 13:26:54 ./id_dsa.pub
556 -rw------- tarek/staff 1675 2008-06-09 13:26:54 ./id_rsa
557 -rw-r--r-- tarek/staff 397 2008-06-09 13:26:54 ./id_rsa.pub
558 -rw-r--r-- tarek/staff 37192 2010-02-06 18:23:10 ./known_hosts
Antoine Pitroubcf2b592012-02-08 23:28:36 +0100559
560
561Querying the size of the output terminal
562----------------------------------------
563
564.. versionadded:: 3.3
565
566.. function:: get_terminal_size(fallback=(columns, lines))
567
568 Get the size of the terminal window.
569
570 For each of the two dimensions, the environment variable, ``COLUMNS``
571 and ``LINES`` respectively, is checked. If the variable is defined and
572 the value is a positive integer, it is used.
573
574 When ``COLUMNS`` or ``LINES`` is not defined, which is the common case,
575 the terminal connected to :data:`sys.__stdout__` is queried
576 by invoking :func:`os.get_terminal_size`.
577
578 If the terminal size cannot be successfully queried, either because
579 the system doesn't support querying, or because we are not
580 connected to a terminal, the value given in ``fallback`` parameter
581 is used. ``fallback`` defaults to ``(80, 24)`` which is the default
582 size used by many terminal emulators.
583
584 The value returned is a named tuple of type :class:`os.terminal_size`.
585
586 See also: The Single UNIX Specification, Version 2,
587 `Other Environment Variables`_.
588
589.. _`Other Environment Variables`:
590 http://pubs.opengroup.org/onlinepubs/7908799/xbd/envvar.html#tag_002_003
591