blob: 6cb03b86648b2c4717def10d20e455990f76f606 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`shutil` --- High-level file operations
2============================================
3
4.. module:: shutil
5 :synopsis: High-level file operations, including copying.
6.. sectionauthor:: Fred L. Drake, Jr. <fdrake@acm.org>
Christian Heimes5b5e81c2007-12-31 16:14:33 +00007.. partly based on the docstrings
Georg Brandl116aa622007-08-15 14:28:22 +00008
9.. index::
10 single: file; copying
11 single: copying files
12
Raymond Hettinger10480942011-01-10 03:26:08 +000013**Source code:** :source:`Lib/shutil.py`
14
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000015--------------
16
Georg Brandl116aa622007-08-15 14:28:22 +000017The :mod:`shutil` module offers a number of high-level operations on files and
18collections of files. In particular, functions are provided which support file
Guido van Rossum2cc30da2007-11-02 23:46:40 +000019copying and removal. For operations on individual files, see also the
20:mod:`os` module.
Georg Brandl116aa622007-08-15 14:28:22 +000021
Guido van Rossumda27fd22007-08-17 00:24:54 +000022.. warning::
Christian Heimes7f044312008-01-06 17:05:40 +000023
24 Even the higher-level file copying functions (:func:`copy`, :func:`copy2`)
Raymond Hettinger10480942011-01-10 03:26:08 +000025 cannot copy all file metadata.
Georg Brandl48310cd2009-01-03 21:18:54 +000026
Christian Heimes7f044312008-01-06 17:05:40 +000027 On POSIX platforms, this means that file owner and group are lost as well
Georg Brandlc575c902008-09-13 17:46:05 +000028 as ACLs. On Mac OS, the resource fork and other metadata are not used.
Christian Heimes7f044312008-01-06 17:05:40 +000029 This means that resources will be lost and file type and creator codes will
30 not be correct. On Windows, file owners, ACLs and alternate data streams
31 are not copied.
Georg Brandl116aa622007-08-15 14:28:22 +000032
Éric Araujo6e6cb8e2010-11-16 19:13:50 +000033
Tarek Ziadé396fad72010-02-23 05:30:31 +000034Directory and files operations
35------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +000036
Georg Brandl116aa622007-08-15 14:28:22 +000037.. function:: copyfileobj(fsrc, fdst[, length])
38
39 Copy the contents of the file-like object *fsrc* to the file-like object *fdst*.
40 The integer *length*, if given, is the buffer size. In particular, a negative
41 *length* value means to copy the data without looping over the source data in
42 chunks; by default the data is read in chunks to avoid uncontrolled memory
43 consumption. Note that if the current file position of the *fsrc* object is not
44 0, only the contents from the current file position to the end of the file will
45 be copied.
46
47
Christian Heimesa342c012008-04-20 21:01:16 +000048.. function:: copyfile(src, dst)
49
50 Copy the contents (no metadata) of the file named *src* to a file named *dst*.
51 *dst* must be the complete target file name; look at :func:`copy` for a copy that
Georg Brandlaf265f42008-12-07 15:06:20 +000052 accepts a target directory path. If *src* and *dst* are the same files,
53 :exc:`Error` is raised.
Antoine Pitrou62ab10a2011-10-12 20:10:51 +020054 The destination location must be writable; otherwise, an :exc:`OSError` exception
Christian Heimesa342c012008-04-20 21:01:16 +000055 will be raised. If *dst* already exists, it will be replaced. Special files
56 such as character or block devices and pipes cannot be copied with this
57 function. *src* and *dst* are path names given as strings.
58
Antoine Pitrou62ab10a2011-10-12 20:10:51 +020059 .. versionchanged:: 3.3
60 :exc:`IOError` used to be raised instead of :exc:`OSError`.
61
Christian Heimesa342c012008-04-20 21:01:16 +000062
Georg Brandl116aa622007-08-15 14:28:22 +000063.. function:: copymode(src, dst)
64
65 Copy the permission bits from *src* to *dst*. The file contents, owner, and
66 group are unaffected. *src* and *dst* are path names given as strings.
67
68
69.. function:: copystat(src, dst)
70
71 Copy the permission bits, last access time, last modification time, and flags
72 from *src* to *dst*. The file contents, owner, and group are unaffected. *src*
73 and *dst* are path names given as strings.
74
75
76.. function:: copy(src, dst)
77
78 Copy the file *src* to the file or directory *dst*. If *dst* is a directory, a
79 file with the same basename as *src* is created (or overwritten) in the
80 directory specified. Permission bits are copied. *src* and *dst* are path
81 names given as strings.
82
83
84.. function:: copy2(src, dst)
85
Alexandre Vassalottibee32532008-05-16 18:15:12 +000086 Similar to :func:`copy`, but metadata is copied as well -- in fact, this is just
87 :func:`copy` followed by :func:`copystat`. This is similar to the
88 Unix command :program:`cp -p`.
Georg Brandl116aa622007-08-15 14:28:22 +000089
90
Georg Brandl86b2fb92008-07-16 03:43:04 +000091.. function:: ignore_patterns(\*patterns)
92
93 This factory function creates a function that can be used as a callable for
94 :func:`copytree`\'s *ignore* argument, ignoring files and directories that
95 match one of the glob-style *patterns* provided. See the example below.
96
97
Ezio Melotticb999a32010-04-20 11:26:51 +000098.. function:: copytree(src, dst, symlinks=False, ignore=None, copy_function=copy2, ignore_dangling_symlinks=False)
Georg Brandl116aa622007-08-15 14:28:22 +000099
100 Recursively copy an entire directory tree rooted at *src*. The destination
Georg Brandl86b2fb92008-07-16 03:43:04 +0000101 directory, named by *dst*, must not already exist; it will be created as well
102 as missing parent directories. Permissions and times of directories are
103 copied with :func:`copystat`, individual files are copied using
104 :func:`copy2`.
Georg Brandl116aa622007-08-15 14:28:22 +0000105
Georg Brandl86b2fb92008-07-16 03:43:04 +0000106 If *symlinks* is true, symbolic links in the source tree are represented as
Senthil Kumaranef5c7162011-08-02 18:52:28 +0800107 symbolic links in the new tree, but the metadata of the original links is NOT
108 copied; if false or omitted, the contents and metadata of the linked files
109 are copied to the new tree.
Georg Brandl86b2fb92008-07-16 03:43:04 +0000110
Tarek Ziadéfb437512010-04-20 08:57:33 +0000111 When *symlinks* is false, if the file pointed by the symlink doesn't
112 exist, a exception will be added in the list of errors raised in
113 a :exc:`Error` exception at the end of the copy process.
114 You can set the optional *ignore_dangling_symlinks* flag to true if you
Tarek Ziadé8c26c7d2010-04-23 13:03:50 +0000115 want to silence this exception. Notice that this option has no effect
116 on platforms that don't support :func:`os.symlink`.
Tarek Ziadéfb437512010-04-20 08:57:33 +0000117
Georg Brandl86b2fb92008-07-16 03:43:04 +0000118 If *ignore* is given, it must be a callable that will receive as its
119 arguments the directory being visited by :func:`copytree`, and a list of its
120 contents, as returned by :func:`os.listdir`. Since :func:`copytree` is
121 called recursively, the *ignore* callable will be called once for each
122 directory that is copied. The callable must return a sequence of directory
123 and file names relative to the current directory (i.e. a subset of the items
124 in its second argument); these names will then be ignored in the copy
125 process. :func:`ignore_patterns` can be used to create such a callable that
126 ignores names based on glob-style patterns.
127
128 If exception(s) occur, an :exc:`Error` is raised with a list of reasons.
129
Tarek Ziadé5340db32010-04-19 22:30:51 +0000130 If *copy_function* is given, it must be a callable that will be used
131 to copy each file. It will be called with the source path and the
132 destination path as arguments. By default, :func:`copy2` is used, but any
133 function that supports the same signature (like :func:`copy`) can be used.
Georg Brandl116aa622007-08-15 14:28:22 +0000134
Tarek Ziadé5340db32010-04-19 22:30:51 +0000135 .. versionchanged:: 3.2
136 Added the *copy_function* argument to be able to provide a custom copy
137 function.
Georg Brandl116aa622007-08-15 14:28:22 +0000138
Ezio Melotticb999a32010-04-20 11:26:51 +0000139 .. versionchanged:: 3.2
Tarek Ziadéfb437512010-04-20 08:57:33 +0000140 Added the *ignore_dangling_symlinks* argument to silent dangling symlinks
141 errors when *symlinks* is false.
142
143
Georg Brandl18244152009-09-02 20:34:52 +0000144.. function:: rmtree(path, ignore_errors=False, onerror=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000145
146 .. index:: single: directory; deleting
147
Christian Heimes9bd667a2008-01-20 15:14:11 +0000148 Delete an entire directory tree; *path* must point to a directory (but not a
149 symbolic link to a directory). If *ignore_errors* is true, errors resulting
150 from failed removals will be ignored; if false or omitted, such errors are
151 handled by calling a handler specified by *onerror* or, if that is omitted,
152 they raise an exception.
Georg Brandl116aa622007-08-15 14:28:22 +0000153
Christian Heimes9bd667a2008-01-20 15:14:11 +0000154 If *onerror* is provided, it must be a callable that accepts three
155 parameters: *function*, *path*, and *excinfo*. The first parameter,
156 *function*, is the function which raised the exception; it will be
157 :func:`os.path.islink`, :func:`os.listdir`, :func:`os.remove` or
158 :func:`os.rmdir`. The second parameter, *path*, will be the path name passed
159 to *function*. The third parameter, *excinfo*, will be the exception
160 information return by :func:`sys.exc_info`. Exceptions raised by *onerror*
161 will not be caught.
162
Georg Brandl116aa622007-08-15 14:28:22 +0000163
164.. function:: move(src, dst)
165
Éric Araujo14382dc2011-07-28 22:49:11 +0200166 Recursively move a file or directory (*src*) to another location (*dst*).
Georg Brandl116aa622007-08-15 14:28:22 +0000167
Éric Araujo14382dc2011-07-28 22:49:11 +0200168 If the destination is a directory or a symlink to a directory, then *src* is
169 moved inside that directory.
170
171 The destination directory must not already exist. If the destination already
172 exists but is not a directory, it may be overwritten depending on
173 :func:`os.rename` semantics.
174
175 If the destination is on the current filesystem, then :func:`os.rename` is
176 used. Otherwise, *src* is copied (using :func:`copy2`) to *dst* and then
177 removed.
Georg Brandl116aa622007-08-15 14:28:22 +0000178
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200179.. function:: disk_usage(path)
180
Éric Araujoe4d5b8e2011-08-08 16:51:11 +0200181 Return disk usage statistics about the given path as a :term:`named tuple`
182 with the attributes *total*, *used* and *free*, which are the amount of
183 total, used and free space, in bytes.
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200184
185 .. versionadded:: 3.3
186
187 Availability: Unix, Windows.
Georg Brandl116aa622007-08-15 14:28:22 +0000188
Sandro Tosid902a142011-08-22 23:28:27 +0200189.. function:: chown(path, user=None, group=None)
190
191 Change owner *user* and/or *group* of the given *path*.
192
193 *user* can be a system user name or a uid; the same applies to *group*. At
194 least one argument is required.
195
196 See also :func:`os.chown`, the underlying function.
197
198 Availability: Unix.
199
200 .. versionadded:: 3.3
201
202
Georg Brandl116aa622007-08-15 14:28:22 +0000203.. exception:: Error
204
Éric Araujo14382dc2011-07-28 22:49:11 +0200205 This exception collects exceptions that are raised during a multi-file
206 operation. For :func:`copytree`, the exception argument is a list of 3-tuples
207 (*srcname*, *dstname*, *exception*).
Georg Brandl116aa622007-08-15 14:28:22 +0000208
Georg Brandl116aa622007-08-15 14:28:22 +0000209
210.. _shutil-example:
211
Tarek Ziadé396fad72010-02-23 05:30:31 +0000212copytree example
213::::::::::::::::
Georg Brandl116aa622007-08-15 14:28:22 +0000214
215This example is the implementation of the :func:`copytree` function, described
216above, with the docstring omitted. It demonstrates many of the other functions
217provided by this module. ::
218
219 def copytree(src, dst, symlinks=False):
220 names = os.listdir(src)
221 os.makedirs(dst)
222 errors = []
223 for name in names:
224 srcname = os.path.join(src, name)
225 dstname = os.path.join(dst, name)
226 try:
227 if symlinks and os.path.islink(srcname):
228 linkto = os.readlink(srcname)
229 os.symlink(linkto, dstname)
230 elif os.path.isdir(srcname):
231 copytree(srcname, dstname, symlinks)
232 else:
233 copy2(srcname, dstname)
234 # XXX What about devices, sockets etc.?
235 except (IOError, os.error) as why:
236 errors.append((srcname, dstname, str(why)))
237 # catch the Error from the recursive copytree so that we can
238 # continue with other files
239 except Error as err:
240 errors.extend(err.args[0])
241 try:
242 copystat(src, dst)
243 except WindowsError:
244 # can't copy file access times on Windows
245 pass
246 except OSError as why:
247 errors.extend((src, dst, str(why)))
248 if errors:
Collin Winterc79461b2007-09-01 23:34:30 +0000249 raise Error(errors)
Georg Brandl116aa622007-08-15 14:28:22 +0000250
Tarek Ziadé396fad72010-02-23 05:30:31 +0000251Another example that uses the :func:`ignore_patterns` helper::
252
253 from shutil import copytree, ignore_patterns
254
255 copytree(source, destination, ignore=ignore_patterns('*.pyc', 'tmp*'))
256
257This will copy everything except ``.pyc`` files and files or directories whose
258name starts with ``tmp``.
259
260Another example that uses the *ignore* argument to add a logging call::
261
262 from shutil import copytree
263 import logging
264
265 def _logpath(path, names):
266 logging.info('Working in %s' % path)
267 return [] # nothing will be ignored
268
269 copytree(source, destination, ignore=_logpath)
270
271
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000272.. _archiving-operations:
273
274Archiving operations
275--------------------
Tarek Ziadé396fad72010-02-23 05:30:31 +0000276
277.. function:: make_archive(base_name, format, [root_dir, [base_dir, [verbose, [dry_run, [owner, [group, [logger]]]]]]])
278
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000279 Create an archive file (such as zip or tar) and return its name.
Tarek Ziadé396fad72010-02-23 05:30:31 +0000280
281 *base_name* is the name of the file to create, including the path, minus
282 any format-specific extension. *format* is the archive format: one of
Tarek Ziadéffa155a2010-04-29 13:34:35 +0000283 "zip", "tar", "bztar" (if the :mod:`bz2` module is available) or "gztar".
Tarek Ziadé396fad72010-02-23 05:30:31 +0000284
285 *root_dir* is a directory that will be the root directory of the
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000286 archive; for example, we typically chdir into *root_dir* before creating the
Tarek Ziadé396fad72010-02-23 05:30:31 +0000287 archive.
288
289 *base_dir* is the directory where we start archiving from;
Ezio Melotticb999a32010-04-20 11:26:51 +0000290 i.e. *base_dir* will be the common prefix of all files and
Tarek Ziadé396fad72010-02-23 05:30:31 +0000291 directories in the archive.
292
293 *root_dir* and *base_dir* both default to the current directory.
294
295 *owner* and *group* are used when creating a tar archive. By default,
296 uses the current owner and group.
297
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000298 *logger* is an instance of :class:`logging.Logger`.
299
Ezio Melottif8754a62010-03-21 07:16:43 +0000300 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000301
302
303.. function:: get_archive_formats()
304
Éric Araujo14382dc2011-07-28 22:49:11 +0200305 Return a list of supported formats for archiving.
Tarek Ziadé396fad72010-02-23 05:30:31 +0000306 Each element of the returned sequence is a tuple ``(name, description)``
307
308 By default :mod:`shutil` provides these formats:
309
310 - *gztar*: gzip'ed tar-file
Tarek Ziadéffa155a2010-04-29 13:34:35 +0000311 - *bztar*: bzip2'ed tar-file (if the :mod:`bz2` module is available.)
Tarek Ziadé396fad72010-02-23 05:30:31 +0000312 - *tar*: uncompressed tar file
313 - *zip*: ZIP file
314
315 You can register new formats or provide your own archiver for any existing
316 formats, by using :func:`register_archive_format`.
317
Ezio Melottif8754a62010-03-21 07:16:43 +0000318 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000319
320
321.. function:: register_archive_format(name, function, [extra_args, [description]])
322
Éric Araujo14382dc2011-07-28 22:49:11 +0200323 Register an archiver for the format *name*. *function* is a callable that
Tarek Ziadé396fad72010-02-23 05:30:31 +0000324 will be used to invoke the archiver.
325
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000326 If given, *extra_args* is a sequence of ``(name, value)`` pairs that will be
Tarek Ziadé396fad72010-02-23 05:30:31 +0000327 used as extra keywords arguments when the archiver callable is used.
328
329 *description* is used by :func:`get_archive_formats` which returns the
330 list of archivers. Defaults to an empty list.
331
Ezio Melottif8754a62010-03-21 07:16:43 +0000332 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000333
334
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000335.. function:: unregister_archive_format(name)
Tarek Ziadé396fad72010-02-23 05:30:31 +0000336
337 Remove the archive format *name* from the list of supported formats.
338
Ezio Melottif8754a62010-03-21 07:16:43 +0000339 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000340
341
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000342.. function:: unpack_archive(filename[, extract_dir[, format]])
343
344 Unpack an archive. *filename* is the full path of the archive.
345
346 *extract_dir* is the name of the target directory where the archive is
347 unpacked. If not provided, the current working directory is used.
348
349 *format* is the archive format: one of "zip", "tar", or "gztar". Or any
350 other format registered with :func:`register_unpack_format`. If not
351 provided, :func:`unpack_archive` will use the archive file name extension
352 and see if an unpacker was registered for that extension. In case none is
353 found, a :exc:`ValueError` is raised.
354
355 .. versionadded:: 3.2
356
357
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000358.. function:: register_unpack_format(name, extensions, function[, extra_args[, description]])
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000359
360 Registers an unpack format. *name* is the name of the format and
361 *extensions* is a list of extensions corresponding to the format, like
362 ``.zip`` for Zip files.
363
364 *function* is the callable that will be used to unpack archives. The
365 callable will receive the path of the archive, followed by the directory
366 the archive must be extracted to.
367
368 When provided, *extra_args* is a sequence of ``(name, value)`` tuples that
369 will be passed as keywords arguments to the callable.
370
371 *description* can be provided to describe the format, and will be returned
372 by the :func:`get_unpack_formats` function.
373
374 .. versionadded:: 3.2
375
376
377.. function:: unregister_unpack_format(name)
378
379 Unregister an unpack format. *name* is the name of the format.
380
381 .. versionadded:: 3.2
382
383
384.. function:: get_unpack_formats()
385
386 Return a list of all registered formats for unpacking.
387 Each element of the returned sequence is a tuple
388 ``(name, extensions, description)``.
389
390 By default :mod:`shutil` provides these formats:
391
392 - *gztar*: gzip'ed tar-file
Tarek Ziadéffa155a2010-04-29 13:34:35 +0000393 - *bztar*: bzip2'ed tar-file (if the :mod:`bz2` module is available.)
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000394 - *tar*: uncompressed tar file
395 - *zip*: ZIP file
396
397 You can register new formats or provide your own unpacker for any existing
398 formats, by using :func:`register_unpack_format`.
399
400 .. versionadded:: 3.2
401
402
403
Tarek Ziadé396fad72010-02-23 05:30:31 +0000404Archiving example
405:::::::::::::::::
406
407In this example, we create a gzip'ed tar-file archive containing all files
408found in the :file:`.ssh` directory of the user::
409
410 >>> from shutil import make_archive
411 >>> import os
412 >>> archive_name = os.path.expanduser(os.path.join('~', 'myarchive'))
413 >>> root_dir = os.path.expanduser(os.path.join('~', '.ssh'))
414 >>> make_archive(archive_name, 'gztar', root_dir)
415 '/Users/tarek/myarchive.tar.gz'
416
417The resulting archive contains::
418
419 $ tar -tzvf /Users/tarek/myarchive.tar.gz
420 drwx------ tarek/staff 0 2010-02-01 16:23:40 ./
421 -rw-r--r-- tarek/staff 609 2008-06-09 13:26:54 ./authorized_keys
422 -rwxr-xr-x tarek/staff 65 2008-06-09 13:26:54 ./config
423 -rwx------ tarek/staff 668 2008-06-09 13:26:54 ./id_dsa
424 -rwxr-xr-x tarek/staff 609 2008-06-09 13:26:54 ./id_dsa.pub
425 -rw------- tarek/staff 1675 2008-06-09 13:26:54 ./id_rsa
426 -rw-r--r-- tarek/staff 397 2008-06-09 13:26:54 ./id_rsa.pub
427 -rw-r--r-- tarek/staff 37192 2010-02-06 18:23:10 ./known_hosts
428
429