blob: d87e605b99ffb6991b00af0a9c27892f83921b16 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`shutil` --- High-level file operations
2============================================
3
4.. module:: shutil
5 :synopsis: High-level file operations, including copying.
6.. sectionauthor:: Fred L. Drake, Jr. <fdrake@acm.org>
Christian Heimes5b5e81c2007-12-31 16:14:33 +00007.. partly based on the docstrings
Georg Brandl116aa622007-08-15 14:28:22 +00008
9.. index::
10 single: file; copying
11 single: copying files
12
Raymond Hettinger10480942011-01-10 03:26:08 +000013**Source code:** :source:`Lib/shutil.py`
14
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000015--------------
16
Georg Brandl116aa622007-08-15 14:28:22 +000017The :mod:`shutil` module offers a number of high-level operations on files and
18collections of files. In particular, functions are provided which support file
Guido van Rossum2cc30da2007-11-02 23:46:40 +000019copying and removal. For operations on individual files, see also the
20:mod:`os` module.
Georg Brandl116aa622007-08-15 14:28:22 +000021
Guido van Rossumda27fd22007-08-17 00:24:54 +000022.. warning::
Christian Heimes7f044312008-01-06 17:05:40 +000023
24 Even the higher-level file copying functions (:func:`copy`, :func:`copy2`)
Raymond Hettinger10480942011-01-10 03:26:08 +000025 cannot copy all file metadata.
Georg Brandl48310cd2009-01-03 21:18:54 +000026
Christian Heimes7f044312008-01-06 17:05:40 +000027 On POSIX platforms, this means that file owner and group are lost as well
Georg Brandlc575c902008-09-13 17:46:05 +000028 as ACLs. On Mac OS, the resource fork and other metadata are not used.
Christian Heimes7f044312008-01-06 17:05:40 +000029 This means that resources will be lost and file type and creator codes will
30 not be correct. On Windows, file owners, ACLs and alternate data streams
31 are not copied.
Georg Brandl116aa622007-08-15 14:28:22 +000032
Éric Araujo6e6cb8e2010-11-16 19:13:50 +000033
Tarek Ziadé396fad72010-02-23 05:30:31 +000034Directory and files operations
35------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +000036
Georg Brandl116aa622007-08-15 14:28:22 +000037.. function:: copyfileobj(fsrc, fdst[, length])
38
39 Copy the contents of the file-like object *fsrc* to the file-like object *fdst*.
40 The integer *length*, if given, is the buffer size. In particular, a negative
41 *length* value means to copy the data without looping over the source data in
42 chunks; by default the data is read in chunks to avoid uncontrolled memory
43 consumption. Note that if the current file position of the *fsrc* object is not
44 0, only the contents from the current file position to the end of the file will
45 be copied.
46
47
Christian Heimesa342c012008-04-20 21:01:16 +000048.. function:: copyfile(src, dst)
49
50 Copy the contents (no metadata) of the file named *src* to a file named *dst*.
51 *dst* must be the complete target file name; look at :func:`copy` for a copy that
Georg Brandlaf265f42008-12-07 15:06:20 +000052 accepts a target directory path. If *src* and *dst* are the same files,
53 :exc:`Error` is raised.
Christian Heimesa342c012008-04-20 21:01:16 +000054 The destination location must be writable; otherwise, an :exc:`IOError` exception
55 will be raised. If *dst* already exists, it will be replaced. Special files
56 such as character or block devices and pipes cannot be copied with this
57 function. *src* and *dst* are path names given as strings.
58
59
Georg Brandl116aa622007-08-15 14:28:22 +000060.. function:: copymode(src, dst)
61
62 Copy the permission bits from *src* to *dst*. The file contents, owner, and
63 group are unaffected. *src* and *dst* are path names given as strings.
64
65
66.. function:: copystat(src, dst)
67
68 Copy the permission bits, last access time, last modification time, and flags
69 from *src* to *dst*. The file contents, owner, and group are unaffected. *src*
70 and *dst* are path names given as strings.
71
72
73.. function:: copy(src, dst)
74
75 Copy the file *src* to the file or directory *dst*. If *dst* is a directory, a
76 file with the same basename as *src* is created (or overwritten) in the
77 directory specified. Permission bits are copied. *src* and *dst* are path
78 names given as strings.
79
80
81.. function:: copy2(src, dst)
82
Alexandre Vassalottibee32532008-05-16 18:15:12 +000083 Similar to :func:`copy`, but metadata is copied as well -- in fact, this is just
84 :func:`copy` followed by :func:`copystat`. This is similar to the
85 Unix command :program:`cp -p`.
Georg Brandl116aa622007-08-15 14:28:22 +000086
87
Georg Brandl86b2fb92008-07-16 03:43:04 +000088.. function:: ignore_patterns(\*patterns)
89
90 This factory function creates a function that can be used as a callable for
91 :func:`copytree`\'s *ignore* argument, ignoring files and directories that
92 match one of the glob-style *patterns* provided. See the example below.
93
94
Ezio Melotticb999a32010-04-20 11:26:51 +000095.. function:: copytree(src, dst, symlinks=False, ignore=None, copy_function=copy2, ignore_dangling_symlinks=False)
Georg Brandl116aa622007-08-15 14:28:22 +000096
97 Recursively copy an entire directory tree rooted at *src*. The destination
Georg Brandl86b2fb92008-07-16 03:43:04 +000098 directory, named by *dst*, must not already exist; it will be created as well
99 as missing parent directories. Permissions and times of directories are
100 copied with :func:`copystat`, individual files are copied using
101 :func:`copy2`.
Georg Brandl116aa622007-08-15 14:28:22 +0000102
Georg Brandl86b2fb92008-07-16 03:43:04 +0000103 If *symlinks* is true, symbolic links in the source tree are represented as
Senthil Kumaranef5c7162011-08-02 18:52:28 +0800104 symbolic links in the new tree, but the metadata of the original links is NOT
105 copied; if false or omitted, the contents and metadata of the linked files
106 are copied to the new tree.
Georg Brandl86b2fb92008-07-16 03:43:04 +0000107
Tarek Ziadéfb437512010-04-20 08:57:33 +0000108 When *symlinks* is false, if the file pointed by the symlink doesn't
109 exist, a exception will be added in the list of errors raised in
110 a :exc:`Error` exception at the end of the copy process.
111 You can set the optional *ignore_dangling_symlinks* flag to true if you
Tarek Ziadé8c26c7d2010-04-23 13:03:50 +0000112 want to silence this exception. Notice that this option has no effect
113 on platforms that don't support :func:`os.symlink`.
Tarek Ziadéfb437512010-04-20 08:57:33 +0000114
Georg Brandl86b2fb92008-07-16 03:43:04 +0000115 If *ignore* is given, it must be a callable that will receive as its
116 arguments the directory being visited by :func:`copytree`, and a list of its
117 contents, as returned by :func:`os.listdir`. Since :func:`copytree` is
118 called recursively, the *ignore* callable will be called once for each
119 directory that is copied. The callable must return a sequence of directory
120 and file names relative to the current directory (i.e. a subset of the items
121 in its second argument); these names will then be ignored in the copy
122 process. :func:`ignore_patterns` can be used to create such a callable that
123 ignores names based on glob-style patterns.
124
125 If exception(s) occur, an :exc:`Error` is raised with a list of reasons.
126
Tarek Ziadé5340db32010-04-19 22:30:51 +0000127 If *copy_function* is given, it must be a callable that will be used
128 to copy each file. It will be called with the source path and the
129 destination path as arguments. By default, :func:`copy2` is used, but any
130 function that supports the same signature (like :func:`copy`) can be used.
Georg Brandl116aa622007-08-15 14:28:22 +0000131
Tarek Ziadé5340db32010-04-19 22:30:51 +0000132 .. versionchanged:: 3.2
133 Added the *copy_function* argument to be able to provide a custom copy
134 function.
Georg Brandl116aa622007-08-15 14:28:22 +0000135
Ezio Melotticb999a32010-04-20 11:26:51 +0000136 .. versionchanged:: 3.2
Tarek Ziadéfb437512010-04-20 08:57:33 +0000137 Added the *ignore_dangling_symlinks* argument to silent dangling symlinks
138 errors when *symlinks* is false.
139
140
Georg Brandl18244152009-09-02 20:34:52 +0000141.. function:: rmtree(path, ignore_errors=False, onerror=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000142
143 .. index:: single: directory; deleting
144
Christian Heimes9bd667a2008-01-20 15:14:11 +0000145 Delete an entire directory tree; *path* must point to a directory (but not a
146 symbolic link to a directory). If *ignore_errors* is true, errors resulting
147 from failed removals will be ignored; if false or omitted, such errors are
148 handled by calling a handler specified by *onerror* or, if that is omitted,
149 they raise an exception.
Georg Brandl116aa622007-08-15 14:28:22 +0000150
Christian Heimes9bd667a2008-01-20 15:14:11 +0000151 If *onerror* is provided, it must be a callable that accepts three
152 parameters: *function*, *path*, and *excinfo*. The first parameter,
153 *function*, is the function which raised the exception; it will be
154 :func:`os.path.islink`, :func:`os.listdir`, :func:`os.remove` or
155 :func:`os.rmdir`. The second parameter, *path*, will be the path name passed
156 to *function*. The third parameter, *excinfo*, will be the exception
157 information return by :func:`sys.exc_info`. Exceptions raised by *onerror*
158 will not be caught.
159
Georg Brandl116aa622007-08-15 14:28:22 +0000160
161.. function:: move(src, dst)
162
Éric Araujo14382dc2011-07-28 22:49:11 +0200163 Recursively move a file or directory (*src*) to another location (*dst*).
Georg Brandl116aa622007-08-15 14:28:22 +0000164
Éric Araujo14382dc2011-07-28 22:49:11 +0200165 If the destination is a directory or a symlink to a directory, then *src* is
166 moved inside that directory.
167
168 The destination directory must not already exist. If the destination already
169 exists but is not a directory, it may be overwritten depending on
170 :func:`os.rename` semantics.
171
172 If the destination is on the current filesystem, then :func:`os.rename` is
173 used. Otherwise, *src* is copied (using :func:`copy2`) to *dst* and then
174 removed.
Georg Brandl116aa622007-08-15 14:28:22 +0000175
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200176.. function:: disk_usage(path)
177
Éric Araujoe4d5b8e2011-08-08 16:51:11 +0200178 Return disk usage statistics about the given path as a :term:`named tuple`
179 with the attributes *total*, *used* and *free*, which are the amount of
180 total, used and free space, in bytes.
Giampaolo Rodola'210e7ca2011-07-01 13:55:36 +0200181
182 .. versionadded:: 3.3
183
184 Availability: Unix, Windows.
Georg Brandl116aa622007-08-15 14:28:22 +0000185
Sandro Tosid902a142011-08-22 23:28:27 +0200186.. function:: chown(path, user=None, group=None)
187
188 Change owner *user* and/or *group* of the given *path*.
189
190 *user* can be a system user name or a uid; the same applies to *group*. At
191 least one argument is required.
192
193 See also :func:`os.chown`, the underlying function.
194
195 Availability: Unix.
196
197 .. versionadded:: 3.3
198
199
Georg Brandl116aa622007-08-15 14:28:22 +0000200.. exception:: Error
201
Éric Araujo14382dc2011-07-28 22:49:11 +0200202 This exception collects exceptions that are raised during a multi-file
203 operation. For :func:`copytree`, the exception argument is a list of 3-tuples
204 (*srcname*, *dstname*, *exception*).
Georg Brandl116aa622007-08-15 14:28:22 +0000205
Georg Brandl116aa622007-08-15 14:28:22 +0000206
207.. _shutil-example:
208
Tarek Ziadé396fad72010-02-23 05:30:31 +0000209copytree example
210::::::::::::::::
Georg Brandl116aa622007-08-15 14:28:22 +0000211
212This example is the implementation of the :func:`copytree` function, described
213above, with the docstring omitted. It demonstrates many of the other functions
214provided by this module. ::
215
216 def copytree(src, dst, symlinks=False):
217 names = os.listdir(src)
218 os.makedirs(dst)
219 errors = []
220 for name in names:
221 srcname = os.path.join(src, name)
222 dstname = os.path.join(dst, name)
223 try:
224 if symlinks and os.path.islink(srcname):
225 linkto = os.readlink(srcname)
226 os.symlink(linkto, dstname)
227 elif os.path.isdir(srcname):
228 copytree(srcname, dstname, symlinks)
229 else:
230 copy2(srcname, dstname)
231 # XXX What about devices, sockets etc.?
232 except (IOError, os.error) as why:
233 errors.append((srcname, dstname, str(why)))
234 # catch the Error from the recursive copytree so that we can
235 # continue with other files
236 except Error as err:
237 errors.extend(err.args[0])
238 try:
239 copystat(src, dst)
240 except WindowsError:
241 # can't copy file access times on Windows
242 pass
243 except OSError as why:
244 errors.extend((src, dst, str(why)))
245 if errors:
Collin Winterc79461b2007-09-01 23:34:30 +0000246 raise Error(errors)
Georg Brandl116aa622007-08-15 14:28:22 +0000247
Tarek Ziadé396fad72010-02-23 05:30:31 +0000248Another example that uses the :func:`ignore_patterns` helper::
249
250 from shutil import copytree, ignore_patterns
251
252 copytree(source, destination, ignore=ignore_patterns('*.pyc', 'tmp*'))
253
254This will copy everything except ``.pyc`` files and files or directories whose
255name starts with ``tmp``.
256
257Another example that uses the *ignore* argument to add a logging call::
258
259 from shutil import copytree
260 import logging
261
262 def _logpath(path, names):
263 logging.info('Working in %s' % path)
264 return [] # nothing will be ignored
265
266 copytree(source, destination, ignore=_logpath)
267
268
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000269.. _archiving-operations:
270
271Archiving operations
272--------------------
Tarek Ziadé396fad72010-02-23 05:30:31 +0000273
274.. function:: make_archive(base_name, format, [root_dir, [base_dir, [verbose, [dry_run, [owner, [group, [logger]]]]]]])
275
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000276 Create an archive file (such as zip or tar) and return its name.
Tarek Ziadé396fad72010-02-23 05:30:31 +0000277
278 *base_name* is the name of the file to create, including the path, minus
279 any format-specific extension. *format* is the archive format: one of
Tarek Ziadéffa155a2010-04-29 13:34:35 +0000280 "zip", "tar", "bztar" (if the :mod:`bz2` module is available) or "gztar".
Tarek Ziadé396fad72010-02-23 05:30:31 +0000281
282 *root_dir* is a directory that will be the root directory of the
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000283 archive; for example, we typically chdir into *root_dir* before creating the
Tarek Ziadé396fad72010-02-23 05:30:31 +0000284 archive.
285
286 *base_dir* is the directory where we start archiving from;
Ezio Melotticb999a32010-04-20 11:26:51 +0000287 i.e. *base_dir* will be the common prefix of all files and
Tarek Ziadé396fad72010-02-23 05:30:31 +0000288 directories in the archive.
289
290 *root_dir* and *base_dir* both default to the current directory.
291
292 *owner* and *group* are used when creating a tar archive. By default,
293 uses the current owner and group.
294
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000295 *logger* is an instance of :class:`logging.Logger`.
296
Ezio Melottif8754a62010-03-21 07:16:43 +0000297 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000298
299
300.. function:: get_archive_formats()
301
Éric Araujo14382dc2011-07-28 22:49:11 +0200302 Return a list of supported formats for archiving.
Tarek Ziadé396fad72010-02-23 05:30:31 +0000303 Each element of the returned sequence is a tuple ``(name, description)``
304
305 By default :mod:`shutil` provides these formats:
306
307 - *gztar*: gzip'ed tar-file
Tarek Ziadéffa155a2010-04-29 13:34:35 +0000308 - *bztar*: bzip2'ed tar-file (if the :mod:`bz2` module is available.)
Tarek Ziadé396fad72010-02-23 05:30:31 +0000309 - *tar*: uncompressed tar file
310 - *zip*: ZIP file
311
312 You can register new formats or provide your own archiver for any existing
313 formats, by using :func:`register_archive_format`.
314
Ezio Melottif8754a62010-03-21 07:16:43 +0000315 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000316
317
318.. function:: register_archive_format(name, function, [extra_args, [description]])
319
Éric Araujo14382dc2011-07-28 22:49:11 +0200320 Register an archiver for the format *name*. *function* is a callable that
Tarek Ziadé396fad72010-02-23 05:30:31 +0000321 will be used to invoke the archiver.
322
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000323 If given, *extra_args* is a sequence of ``(name, value)`` pairs that will be
Tarek Ziadé396fad72010-02-23 05:30:31 +0000324 used as extra keywords arguments when the archiver callable is used.
325
326 *description* is used by :func:`get_archive_formats` which returns the
327 list of archivers. Defaults to an empty list.
328
Ezio Melottif8754a62010-03-21 07:16:43 +0000329 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000330
331
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000332.. function:: unregister_archive_format(name)
Tarek Ziadé396fad72010-02-23 05:30:31 +0000333
334 Remove the archive format *name* from the list of supported formats.
335
Ezio Melottif8754a62010-03-21 07:16:43 +0000336 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000337
338
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000339.. function:: unpack_archive(filename[, extract_dir[, format]])
340
341 Unpack an archive. *filename* is the full path of the archive.
342
343 *extract_dir* is the name of the target directory where the archive is
344 unpacked. If not provided, the current working directory is used.
345
346 *format* is the archive format: one of "zip", "tar", or "gztar". Or any
347 other format registered with :func:`register_unpack_format`. If not
348 provided, :func:`unpack_archive` will use the archive file name extension
349 and see if an unpacker was registered for that extension. In case none is
350 found, a :exc:`ValueError` is raised.
351
352 .. versionadded:: 3.2
353
354
Raymond Hettinger0929b1f2011-01-23 11:29:08 +0000355.. function:: register_unpack_format(name, extensions, function[, extra_args[, description]])
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000356
357 Registers an unpack format. *name* is the name of the format and
358 *extensions* is a list of extensions corresponding to the format, like
359 ``.zip`` for Zip files.
360
361 *function* is the callable that will be used to unpack archives. The
362 callable will receive the path of the archive, followed by the directory
363 the archive must be extracted to.
364
365 When provided, *extra_args* is a sequence of ``(name, value)`` tuples that
366 will be passed as keywords arguments to the callable.
367
368 *description* can be provided to describe the format, and will be returned
369 by the :func:`get_unpack_formats` function.
370
371 .. versionadded:: 3.2
372
373
374.. function:: unregister_unpack_format(name)
375
376 Unregister an unpack format. *name* is the name of the format.
377
378 .. versionadded:: 3.2
379
380
381.. function:: get_unpack_formats()
382
383 Return a list of all registered formats for unpacking.
384 Each element of the returned sequence is a tuple
385 ``(name, extensions, description)``.
386
387 By default :mod:`shutil` provides these formats:
388
389 - *gztar*: gzip'ed tar-file
Tarek Ziadéffa155a2010-04-29 13:34:35 +0000390 - *bztar*: bzip2'ed tar-file (if the :mod:`bz2` module is available.)
Tarek Ziadé6ac91722010-04-28 17:51:36 +0000391 - *tar*: uncompressed tar file
392 - *zip*: ZIP file
393
394 You can register new formats or provide your own unpacker for any existing
395 formats, by using :func:`register_unpack_format`.
396
397 .. versionadded:: 3.2
398
399
400
Tarek Ziadé396fad72010-02-23 05:30:31 +0000401Archiving example
402:::::::::::::::::
403
404In this example, we create a gzip'ed tar-file archive containing all files
405found in the :file:`.ssh` directory of the user::
406
407 >>> from shutil import make_archive
408 >>> import os
409 >>> archive_name = os.path.expanduser(os.path.join('~', 'myarchive'))
410 >>> root_dir = os.path.expanduser(os.path.join('~', '.ssh'))
411 >>> make_archive(archive_name, 'gztar', root_dir)
412 '/Users/tarek/myarchive.tar.gz'
413
414The resulting archive contains::
415
416 $ tar -tzvf /Users/tarek/myarchive.tar.gz
417 drwx------ tarek/staff 0 2010-02-01 16:23:40 ./
418 -rw-r--r-- tarek/staff 609 2008-06-09 13:26:54 ./authorized_keys
419 -rwxr-xr-x tarek/staff 65 2008-06-09 13:26:54 ./config
420 -rwx------ tarek/staff 668 2008-06-09 13:26:54 ./id_dsa
421 -rwxr-xr-x tarek/staff 609 2008-06-09 13:26:54 ./id_dsa.pub
422 -rw------- tarek/staff 1675 2008-06-09 13:26:54 ./id_rsa
423 -rw-r--r-- tarek/staff 397 2008-06-09 13:26:54 ./id_rsa.pub
424 -rw-r--r-- tarek/staff 37192 2010-02-06 18:23:10 ./known_hosts
425
426