blob: 8b9ee8ed1a51047b193c1ec2f697534c656fd097 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`shutil` --- High-level file operations
2============================================
3
4.. module:: shutil
5 :synopsis: High-level file operations, including copying.
6.. sectionauthor:: Fred L. Drake, Jr. <fdrake@acm.org>
Christian Heimes5b5e81c2007-12-31 16:14:33 +00007.. partly based on the docstrings
Georg Brandl116aa622007-08-15 14:28:22 +00008
9.. index::
10 single: file; copying
11 single: copying files
12
13The :mod:`shutil` module offers a number of high-level operations on files and
14collections of files. In particular, functions are provided which support file
Guido van Rossum2cc30da2007-11-02 23:46:40 +000015copying and removal. For operations on individual files, see also the
16:mod:`os` module.
Georg Brandl116aa622007-08-15 14:28:22 +000017
Guido van Rossumda27fd22007-08-17 00:24:54 +000018.. warning::
Christian Heimes7f044312008-01-06 17:05:40 +000019
20 Even the higher-level file copying functions (:func:`copy`, :func:`copy2`)
21 can't copy all file metadata.
Georg Brandl48310cd2009-01-03 21:18:54 +000022
Christian Heimes7f044312008-01-06 17:05:40 +000023 On POSIX platforms, this means that file owner and group are lost as well
Georg Brandlc575c902008-09-13 17:46:05 +000024 as ACLs. On Mac OS, the resource fork and other metadata are not used.
Christian Heimes7f044312008-01-06 17:05:40 +000025 This means that resources will be lost and file type and creator codes will
26 not be correct. On Windows, file owners, ACLs and alternate data streams
27 are not copied.
Georg Brandl116aa622007-08-15 14:28:22 +000028
Tarek Ziadé396fad72010-02-23 05:30:31 +000029Directory and files operations
30------------------------------
Georg Brandl116aa622007-08-15 14:28:22 +000031
Georg Brandl116aa622007-08-15 14:28:22 +000032.. function:: copyfileobj(fsrc, fdst[, length])
33
34 Copy the contents of the file-like object *fsrc* to the file-like object *fdst*.
35 The integer *length*, if given, is the buffer size. In particular, a negative
36 *length* value means to copy the data without looping over the source data in
37 chunks; by default the data is read in chunks to avoid uncontrolled memory
38 consumption. Note that if the current file position of the *fsrc* object is not
39 0, only the contents from the current file position to the end of the file will
40 be copied.
41
42
Christian Heimesa342c012008-04-20 21:01:16 +000043.. function:: copyfile(src, dst)
44
45 Copy the contents (no metadata) of the file named *src* to a file named *dst*.
46 *dst* must be the complete target file name; look at :func:`copy` for a copy that
Georg Brandlaf265f42008-12-07 15:06:20 +000047 accepts a target directory path. If *src* and *dst* are the same files,
48 :exc:`Error` is raised.
Christian Heimesa342c012008-04-20 21:01:16 +000049 The destination location must be writable; otherwise, an :exc:`IOError` exception
50 will be raised. If *dst* already exists, it will be replaced. Special files
51 such as character or block devices and pipes cannot be copied with this
52 function. *src* and *dst* are path names given as strings.
53
54
Georg Brandl116aa622007-08-15 14:28:22 +000055.. function:: copymode(src, dst)
56
57 Copy the permission bits from *src* to *dst*. The file contents, owner, and
58 group are unaffected. *src* and *dst* are path names given as strings.
59
60
61.. function:: copystat(src, dst)
62
63 Copy the permission bits, last access time, last modification time, and flags
64 from *src* to *dst*. The file contents, owner, and group are unaffected. *src*
65 and *dst* are path names given as strings.
66
67
68.. function:: copy(src, dst)
69
70 Copy the file *src* to the file or directory *dst*. If *dst* is a directory, a
71 file with the same basename as *src* is created (or overwritten) in the
72 directory specified. Permission bits are copied. *src* and *dst* are path
73 names given as strings.
74
75
76.. function:: copy2(src, dst)
77
Alexandre Vassalottibee32532008-05-16 18:15:12 +000078 Similar to :func:`copy`, but metadata is copied as well -- in fact, this is just
79 :func:`copy` followed by :func:`copystat`. This is similar to the
80 Unix command :program:`cp -p`.
Georg Brandl116aa622007-08-15 14:28:22 +000081
82
Georg Brandl86b2fb92008-07-16 03:43:04 +000083.. function:: ignore_patterns(\*patterns)
84
85 This factory function creates a function that can be used as a callable for
86 :func:`copytree`\'s *ignore* argument, ignoring files and directories that
87 match one of the glob-style *patterns* provided. See the example below.
88
89
Tarek Ziadé5340db32010-04-19 22:30:51 +000090.. function:: copytree(src, dst, symlinks=False, ignore=None, copy_function=copy2)
Georg Brandl116aa622007-08-15 14:28:22 +000091
92 Recursively copy an entire directory tree rooted at *src*. The destination
Georg Brandl86b2fb92008-07-16 03:43:04 +000093 directory, named by *dst*, must not already exist; it will be created as well
94 as missing parent directories. Permissions and times of directories are
95 copied with :func:`copystat`, individual files are copied using
96 :func:`copy2`.
Georg Brandl116aa622007-08-15 14:28:22 +000097
Georg Brandl86b2fb92008-07-16 03:43:04 +000098 If *symlinks* is true, symbolic links in the source tree are represented as
99 symbolic links in the new tree; if false or omitted, the contents of the
100 linked files are copied to the new tree.
101
102 If *ignore* is given, it must be a callable that will receive as its
103 arguments the directory being visited by :func:`copytree`, and a list of its
104 contents, as returned by :func:`os.listdir`. Since :func:`copytree` is
105 called recursively, the *ignore* callable will be called once for each
106 directory that is copied. The callable must return a sequence of directory
107 and file names relative to the current directory (i.e. a subset of the items
108 in its second argument); these names will then be ignored in the copy
109 process. :func:`ignore_patterns` can be used to create such a callable that
110 ignores names based on glob-style patterns.
111
112 If exception(s) occur, an :exc:`Error` is raised with a list of reasons.
113
Tarek Ziadé5340db32010-04-19 22:30:51 +0000114 If *copy_function* is given, it must be a callable that will be used
115 to copy each file. It will be called with the source path and the
116 destination path as arguments. By default, :func:`copy2` is used, but any
117 function that supports the same signature (like :func:`copy`) can be used.
Georg Brandl116aa622007-08-15 14:28:22 +0000118
Tarek Ziadé5340db32010-04-19 22:30:51 +0000119 .. versionchanged:: 3.2
120 Added the *copy_function* argument to be able to provide a custom copy
121 function.
Georg Brandl116aa622007-08-15 14:28:22 +0000122
Georg Brandl18244152009-09-02 20:34:52 +0000123.. function:: rmtree(path, ignore_errors=False, onerror=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000124
125 .. index:: single: directory; deleting
126
Christian Heimes9bd667a2008-01-20 15:14:11 +0000127 Delete an entire directory tree; *path* must point to a directory (but not a
128 symbolic link to a directory). If *ignore_errors* is true, errors resulting
129 from failed removals will be ignored; if false or omitted, such errors are
130 handled by calling a handler specified by *onerror* or, if that is omitted,
131 they raise an exception.
Georg Brandl116aa622007-08-15 14:28:22 +0000132
Christian Heimes9bd667a2008-01-20 15:14:11 +0000133 If *onerror* is provided, it must be a callable that accepts three
134 parameters: *function*, *path*, and *excinfo*. The first parameter,
135 *function*, is the function which raised the exception; it will be
136 :func:`os.path.islink`, :func:`os.listdir`, :func:`os.remove` or
137 :func:`os.rmdir`. The second parameter, *path*, will be the path name passed
138 to *function*. The third parameter, *excinfo*, will be the exception
139 information return by :func:`sys.exc_info`. Exceptions raised by *onerror*
140 will not be caught.
141
Georg Brandl116aa622007-08-15 14:28:22 +0000142
143.. function:: move(src, dst)
144
145 Recursively move a file or directory to another location.
146
Christian Heimes7f044312008-01-06 17:05:40 +0000147 If the destination is on the current filesystem, then simply use rename.
Benjamin Peterson6ebe78f2008-12-21 00:06:59 +0000148 Otherwise, copy src (with :func:`copy2`) to the dst and then remove src.
Georg Brandl116aa622007-08-15 14:28:22 +0000149
Georg Brandl116aa622007-08-15 14:28:22 +0000150
151.. exception:: Error
152
Christian Heimes7f044312008-01-06 17:05:40 +0000153 This exception collects exceptions that raised during a multi-file operation. For
Georg Brandl116aa622007-08-15 14:28:22 +0000154 :func:`copytree`, the exception argument is a list of 3-tuples (*srcname*,
155 *dstname*, *exception*).
156
Georg Brandl116aa622007-08-15 14:28:22 +0000157
158.. _shutil-example:
159
Tarek Ziadé396fad72010-02-23 05:30:31 +0000160copytree example
161::::::::::::::::
Georg Brandl116aa622007-08-15 14:28:22 +0000162
163This example is the implementation of the :func:`copytree` function, described
164above, with the docstring omitted. It demonstrates many of the other functions
165provided by this module. ::
166
167 def copytree(src, dst, symlinks=False):
168 names = os.listdir(src)
169 os.makedirs(dst)
170 errors = []
171 for name in names:
172 srcname = os.path.join(src, name)
173 dstname = os.path.join(dst, name)
174 try:
175 if symlinks and os.path.islink(srcname):
176 linkto = os.readlink(srcname)
177 os.symlink(linkto, dstname)
178 elif os.path.isdir(srcname):
179 copytree(srcname, dstname, symlinks)
180 else:
181 copy2(srcname, dstname)
182 # XXX What about devices, sockets etc.?
183 except (IOError, os.error) as why:
184 errors.append((srcname, dstname, str(why)))
185 # catch the Error from the recursive copytree so that we can
186 # continue with other files
187 except Error as err:
188 errors.extend(err.args[0])
189 try:
190 copystat(src, dst)
191 except WindowsError:
192 # can't copy file access times on Windows
193 pass
194 except OSError as why:
195 errors.extend((src, dst, str(why)))
196 if errors:
Collin Winterc79461b2007-09-01 23:34:30 +0000197 raise Error(errors)
Georg Brandl116aa622007-08-15 14:28:22 +0000198
Tarek Ziadé396fad72010-02-23 05:30:31 +0000199Another example that uses the :func:`ignore_patterns` helper::
200
201 from shutil import copytree, ignore_patterns
202
203 copytree(source, destination, ignore=ignore_patterns('*.pyc', 'tmp*'))
204
205This will copy everything except ``.pyc`` files and files or directories whose
206name starts with ``tmp``.
207
208Another example that uses the *ignore* argument to add a logging call::
209
210 from shutil import copytree
211 import logging
212
213 def _logpath(path, names):
214 logging.info('Working in %s' % path)
215 return [] # nothing will be ignored
216
217 copytree(source, destination, ignore=_logpath)
218
219
220Archives operations
221-------------------
222
223.. function:: make_archive(base_name, format, [root_dir, [base_dir, [verbose, [dry_run, [owner, [group, [logger]]]]]]])
224
225 Create an archive file (eg. zip or tar) and returns its name.
226
227 *base_name* is the name of the file to create, including the path, minus
228 any format-specific extension. *format* is the archive format: one of
Tarek Ziadé10a51af2010-04-19 21:31:42 +0000229 "zip", "tar", "ztar", "bztar" or "gztar".
Tarek Ziadé396fad72010-02-23 05:30:31 +0000230
231 *root_dir* is a directory that will be the root directory of the
232 archive; ie. we typically chdir into *root_dir* before creating the
233 archive.
234
235 *base_dir* is the directory where we start archiving from;
236 ie. *base_dir* will be the common prefix of all files and
237 directories in the archive.
238
239 *root_dir* and *base_dir* both default to the current directory.
240
241 *owner* and *group* are used when creating a tar archive. By default,
242 uses the current owner and group.
243
Ezio Melottif8754a62010-03-21 07:16:43 +0000244 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000245
246
247.. function:: get_archive_formats()
248
249 Returns a list of supported formats for archiving.
250 Each element of the returned sequence is a tuple ``(name, description)``
251
252 By default :mod:`shutil` provides these formats:
253
254 - *gztar*: gzip'ed tar-file
255 - *bztar*: bzip2'ed tar-file
256 - *ztar*: compressed tar file
257 - *tar*: uncompressed tar file
258 - *zip*: ZIP file
259
260 You can register new formats or provide your own archiver for any existing
261 formats, by using :func:`register_archive_format`.
262
Ezio Melottif8754a62010-03-21 07:16:43 +0000263 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000264
265
266.. function:: register_archive_format(name, function, [extra_args, [description]])
267
268 Registers an archiver for the format *name*. *function* is a callable that
269 will be used to invoke the archiver.
270
271 If given, *extra_args* is a sequence of ``(name, value)`` that will be
272 used as extra keywords arguments when the archiver callable is used.
273
274 *description* is used by :func:`get_archive_formats` which returns the
275 list of archivers. Defaults to an empty list.
276
Ezio Melottif8754a62010-03-21 07:16:43 +0000277 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000278
279
280.. function:: unregister_archive_format(name)
281
282 Remove the archive format *name* from the list of supported formats.
283
Ezio Melottif8754a62010-03-21 07:16:43 +0000284 .. versionadded:: 3.2
Tarek Ziadé396fad72010-02-23 05:30:31 +0000285
286
287Archiving example
288:::::::::::::::::
289
290In this example, we create a gzip'ed tar-file archive containing all files
291found in the :file:`.ssh` directory of the user::
292
293 >>> from shutil import make_archive
294 >>> import os
295 >>> archive_name = os.path.expanduser(os.path.join('~', 'myarchive'))
296 >>> root_dir = os.path.expanduser(os.path.join('~', '.ssh'))
297 >>> make_archive(archive_name, 'gztar', root_dir)
298 '/Users/tarek/myarchive.tar.gz'
299
300The resulting archive contains::
301
302 $ tar -tzvf /Users/tarek/myarchive.tar.gz
303 drwx------ tarek/staff 0 2010-02-01 16:23:40 ./
304 -rw-r--r-- tarek/staff 609 2008-06-09 13:26:54 ./authorized_keys
305 -rwxr-xr-x tarek/staff 65 2008-06-09 13:26:54 ./config
306 -rwx------ tarek/staff 668 2008-06-09 13:26:54 ./id_dsa
307 -rwxr-xr-x tarek/staff 609 2008-06-09 13:26:54 ./id_dsa.pub
308 -rw------- tarek/staff 1675 2008-06-09 13:26:54 ./id_rsa
309 -rw-r--r-- tarek/staff 397 2008-06-09 13:26:54 ./id_rsa.pub
310 -rw-r--r-- tarek/staff 37192 2010-02-06 18:23:10 ./known_hosts
311
312