blob: 20a84b618a37df2f61063e78551397095da0d065 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`os.path` --- Common pathname manipulations
2================================================
3
4.. module:: os.path
5 :synopsis: Operations on pathnames.
6
Georg Brandl116aa622007-08-15 14:28:22 +00007.. index:: single: path; operations
8
9This module implements some useful functions on pathnames. To read or
10write files see :func:`open`, and for accessing the filesystem see the
Martin v. Löwis651423c2008-10-07 07:03:04 +000011:mod:`os` module. The path parameters can be passed as either strings,
12or bytes. Applications are encouraged to represent file names as
13(Unicode) character strings. Unfortunately, some file names may not be
14representable as strings on Unix, so applications that need to support
15arbitrary file names on Unix should use bytes objects to represent
16path names. Vice versa, using bytes objects cannot represent all file
17names on Windows (in the standard ``mbcs`` encoding), hence Windows
18applications should use string objects to access all files.
Georg Brandl116aa622007-08-15 14:28:22 +000019
Georg Brandl76e55382008-10-08 16:34:57 +000020.. note::
21
22 All of these functions accept either only bytes or only string objects as
23 their parameters. The result is an object of the same type, if a path or
24 file name is returned.
25
Georg Brandl116aa622007-08-15 14:28:22 +000026
Benjamin Petersond23f8222009-04-05 19:13:16 +000027.. note::
28
29 Since different operating systems have different path name conventions, there
30 are several versions of this module in the standard library. The
31 :mod:`os.path` module is always the path module suitable for the operating
32 system Python is running on, and therefore usable for local paths. However,
33 you can also import and use the individual modules if you want to manipulate
34 a path that is *always* in one of the different formats. They all have the
35 same interface:
36
37 * :mod:`posixpath` for UNIX-style paths
38 * :mod:`ntpath` for Windows paths
39 * :mod:`macpath` for old-style MacOS paths
40 * :mod:`os2emxpath` for OS/2 EMX paths
41
42
Georg Brandl116aa622007-08-15 14:28:22 +000043.. function:: abspath(path)
44
45 Return a normalized absolutized version of the pathname *path*. On most
46 platforms, this is equivalent to ``normpath(join(os.getcwd(), path))``.
47
Georg Brandl116aa622007-08-15 14:28:22 +000048
49.. function:: basename(path)
50
51 Return the base name of pathname *path*. This is the second half of the pair
52 returned by ``split(path)``. Note that the result of this function is different
53 from the Unix :program:`basename` program; where :program:`basename` for
54 ``'/foo/bar/'`` returns ``'bar'``, the :func:`basename` function returns an
55 empty string (``''``).
56
57
58.. function:: commonprefix(list)
59
60 Return the longest path prefix (taken character-by-character) that is a prefix
61 of all paths in *list*. If *list* is empty, return the empty string (``''``).
62 Note that this may return invalid paths because it works a character at a time.
63
64
65.. function:: dirname(path)
66
67 Return the directory name of pathname *path*. This is the first half of the
68 pair returned by ``split(path)``.
69
70
71.. function:: exists(path)
72
Richard Oudkerk2240ac12012-07-06 12:05:32 +010073 Return ``True`` if *path* refers to an existing path or an open
74 file descriptor. Returns ``False`` for broken symbolic links. On
75 some platforms, this function may return ``False`` if permission is
76 not granted to execute :func:`os.stat` on the requested file, even
Georg Brandl116aa622007-08-15 14:28:22 +000077 if the *path* physically exists.
78
Richard Oudkerk2240ac12012-07-06 12:05:32 +010079 .. versionchanged:: 3.3
80 *path* can now be an integer: ``True`` is returned if it is an
81 open file descriptor, ``False`` otherwise.
82
Georg Brandl116aa622007-08-15 14:28:22 +000083
84.. function:: lexists(path)
85
86 Return ``True`` if *path* refers to an existing path. Returns ``True`` for
87 broken symbolic links. Equivalent to :func:`exists` on platforms lacking
88 :func:`os.lstat`.
89
Georg Brandl116aa622007-08-15 14:28:22 +000090
91.. function:: expanduser(path)
92
93 On Unix and Windows, return the argument with an initial component of ``~`` or
94 ``~user`` replaced by that *user*'s home directory.
95
96 .. index:: module: pwd
97
98 On Unix, an initial ``~`` is replaced by the environment variable :envvar:`HOME`
99 if it is set; otherwise the current user's home directory is looked up in the
100 password directory through the built-in module :mod:`pwd`. An initial ``~user``
101 is looked up directly in the password directory.
102
103 On Windows, :envvar:`HOME` and :envvar:`USERPROFILE` will be used if set,
104 otherwise a combination of :envvar:`HOMEPATH` and :envvar:`HOMEDRIVE` will be
105 used. An initial ``~user`` is handled by stripping the last directory component
106 from the created user path derived above.
107
108 If the expansion fails or if the path does not begin with a tilde, the path is
109 returned unchanged.
110
111
112.. function:: expandvars(path)
113
114 Return the argument with environment variables expanded. Substrings of the form
115 ``$name`` or ``${name}`` are replaced by the value of environment variable
116 *name*. Malformed variable names and references to non-existing variables are
117 left unchanged.
118
119 On Windows, ``%name%`` expansions are supported in addition to ``$name`` and
120 ``${name}``.
121
122
123.. function:: getatime(path)
124
125 Return the time of last access of *path*. The return value is a number giving
126 the number of seconds since the epoch (see the :mod:`time` module). Raise
127 :exc:`os.error` if the file does not exist or is inaccessible.
128
Georg Brandl55ac8f02007-09-01 13:51:09 +0000129 If :func:`os.stat_float_times` returns True, the result is a floating point
130 number.
Georg Brandl116aa622007-08-15 14:28:22 +0000131
132
133.. function:: getmtime(path)
134
135 Return the time of last modification of *path*. The return value is a number
136 giving the number of seconds since the epoch (see the :mod:`time` module).
137 Raise :exc:`os.error` if the file does not exist or is inaccessible.
138
Georg Brandl55ac8f02007-09-01 13:51:09 +0000139 If :func:`os.stat_float_times` returns True, the result is a floating point
140 number.
Georg Brandl116aa622007-08-15 14:28:22 +0000141
142
143.. function:: getctime(path)
144
145 Return the system's ctime which, on some systems (like Unix) is the time of the
146 last change, and, on others (like Windows), is the creation time for *path*.
147 The return value is a number giving the number of seconds since the epoch (see
148 the :mod:`time` module). Raise :exc:`os.error` if the file does not exist or
149 is inaccessible.
150
Georg Brandl116aa622007-08-15 14:28:22 +0000151
152.. function:: getsize(path)
153
154 Return the size, in bytes, of *path*. Raise :exc:`os.error` if the file does
155 not exist or is inaccessible.
156
Georg Brandl116aa622007-08-15 14:28:22 +0000157
158.. function:: isabs(path)
159
Christian Heimesaf98da12008-01-27 15:18:18 +0000160 Return ``True`` if *path* is an absolute pathname. On Unix, that means it
161 begins with a slash, on Windows that it begins with a (back)slash after chopping
162 off a potential drive letter.
Georg Brandl116aa622007-08-15 14:28:22 +0000163
164
165.. function:: isfile(path)
166
167 Return ``True`` if *path* is an existing regular file. This follows symbolic
168 links, so both :func:`islink` and :func:`isfile` can be true for the same path.
169
170
171.. function:: isdir(path)
172
173 Return ``True`` if *path* is an existing directory. This follows symbolic
174 links, so both :func:`islink` and :func:`isdir` can be true for the same path.
175
176
177.. function:: islink(path)
178
179 Return ``True`` if *path* refers to a directory entry that is a symbolic link.
180 Always ``False`` if symbolic links are not supported.
181
182
183.. function:: ismount(path)
184
185 Return ``True`` if pathname *path* is a :dfn:`mount point`: a point in a file
186 system where a different file system has been mounted. The function checks
187 whether *path*'s parent, :file:`path/..`, is on a different device than *path*,
188 or whether :file:`path/..` and *path* point to the same i-node on the same
189 device --- this should detect mount points for all Unix and POSIX variants.
190
191
192.. function:: join(path1[, path2[, ...]])
193
194 Join one or more path components intelligently. If any component is an absolute
195 path, all previous components (on Windows, including the previous drive letter,
196 if there was one) are thrown away, and joining continues. The return value is
197 the concatenation of *path1*, and optionally *path2*, etc., with exactly one
R David Murray24eb4bc2011-06-23 21:26:13 -0400198 directory separator (``os.sep``) following each non-empty part except the last.
199 (This means that an empty last part will result in a path that ends with a
200 separator.) Note that on Windows, since there is a current directory for
201 each drive, ``os.path.join("c:", "foo")`` represents a path relative to the
202 current directory on drive :file:`C:` (:file:`c:foo`), not :file:`c:\\foo`.
Georg Brandl116aa622007-08-15 14:28:22 +0000203
204
205.. function:: normcase(path)
206
Benjamin Petersond23f8222009-04-05 19:13:16 +0000207 Normalize the case of a pathname. On Unix and Mac OS X, this returns the
208 path unchanged; on case-insensitive filesystems, it converts the path to
209 lowercase. On Windows, it also converts forward slashes to backward slashes.
Ezio Melotti5a3ef5b2010-06-25 10:56:11 +0000210 Raise a TypeError if the type of *path* is not ``str`` or ``bytes``.
Georg Brandl116aa622007-08-15 14:28:22 +0000211
212
213.. function:: normpath(path)
214
215 Normalize a pathname. This collapses redundant separators and up-level
Georg Brandl353ebce2010-08-02 19:19:26 +0000216 references so that ``A//B``, ``A/B/``, ``A/./B`` and ``A/foo/../B`` all become
217 ``A/B``.
218
Georg Brandl116aa622007-08-15 14:28:22 +0000219 It does not normalize the case (use :func:`normcase` for that). On Windows, it
220 converts forward slashes to backward slashes. It should be understood that this
221 may change the meaning of the path if it contains symbolic links!
222
223
224.. function:: realpath(path)
225
226 Return the canonical path of the specified filename, eliminating any symbolic
227 links encountered in the path (if they are supported by the operating system).
228
Georg Brandl116aa622007-08-15 14:28:22 +0000229
Georg Brandl18244152009-09-02 20:34:52 +0000230.. function:: relpath(path, start=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000231
232 Return a relative filepath to *path* either from the current directory or from
233 an optional *start* point.
234
Benjamin Petersonf650e462010-05-06 23:03:05 +0000235 *start* defaults to :attr:`os.curdir`.
236
Antoine Pitrouf10f1622010-12-12 20:17:29 +0000237 Availability: Unix, Windows.
Georg Brandl116aa622007-08-15 14:28:22 +0000238
Georg Brandl116aa622007-08-15 14:28:22 +0000239
240.. function:: samefile(path1, path2)
241
Brian Curtind40e6f72010-07-08 21:39:08 +0000242 Return ``True`` if both pathname arguments refer to the same file or directory.
243 On Unix, this is determined by the device number and i-node number and raises an
244 exception if a :func:`os.stat` call on either pathname fails.
Benjamin Petersonf650e462010-05-06 23:03:05 +0000245
Brian Curtind40e6f72010-07-08 21:39:08 +0000246 On Windows, two files are the same if they resolve to the same final path
Brian Curtinc7395692010-07-09 15:15:09 +0000247 name using the Windows API call GetFinalPathNameByHandle. This function
Brian Curtind40e6f72010-07-08 21:39:08 +0000248 raises an exception if handles cannot be obtained to either file.
249
Antoine Pitrouf10f1622010-12-12 20:17:29 +0000250 Availability: Unix, Windows.
Georg Brandl116aa622007-08-15 14:28:22 +0000251
Georg Brandlb3823372010-07-10 08:58:37 +0000252 .. versionchanged:: 3.2
253 Added Windows support.
Brian Curtinc7395692010-07-09 15:15:09 +0000254
Georg Brandl116aa622007-08-15 14:28:22 +0000255
256.. function:: sameopenfile(fp1, fp2)
257
258 Return ``True`` if the file descriptors *fp1* and *fp2* refer to the same file.
Benjamin Petersonf650e462010-05-06 23:03:05 +0000259
Brian Curtin62857742010-09-06 17:07:27 +0000260 Availability: Unix, Windows.
261
Georg Brandl61063cc2012-06-24 22:48:30 +0200262 .. versionchanged:: 3.2
263 Added Windows support.
Georg Brandl116aa622007-08-15 14:28:22 +0000264
265
266.. function:: samestat(stat1, stat2)
267
268 Return ``True`` if the stat tuples *stat1* and *stat2* refer to the same file.
269 These structures may have been returned by :func:`fstat`, :func:`lstat`, or
270 :func:`stat`. This function implements the underlying comparison used by
Benjamin Petersonf650e462010-05-06 23:03:05 +0000271 :func:`samefile` and :func:`sameopenfile`.
272
273 Availability: Unix.
Georg Brandl116aa622007-08-15 14:28:22 +0000274
275
276.. function:: split(path)
277
Georg Brandl539c1652010-10-14 06:46:08 +0000278 Split the pathname *path* into a pair, ``(head, tail)`` where *tail* is the
279 last pathname component and *head* is everything leading up to that. The
280 *tail* part will never contain a slash; if *path* ends in a slash, *tail*
281 will be empty. If there is no slash in *path*, *head* will be empty. If
282 *path* is empty, both *head* and *tail* are empty. Trailing slashes are
283 stripped from *head* unless it is the root (one or more slashes only). In
284 all cases, ``join(head, tail)`` returns a path to the same location as *path*
285 (but the strings may differ).
Georg Brandl116aa622007-08-15 14:28:22 +0000286
287
288.. function:: splitdrive(path)
289
290 Split the pathname *path* into a pair ``(drive, tail)`` where *drive* is either
Mark Hammond5a607a32009-05-06 08:04:54 +0000291 a mount point or the empty string. On systems which do not use drive
Georg Brandl116aa622007-08-15 14:28:22 +0000292 specifications, *drive* will always be the empty string. In all cases, ``drive
293 + tail`` will be the same as *path*.
294
Mark Hammond5a607a32009-05-06 08:04:54 +0000295 On Windows, splits a pathname into drive/UNC sharepoint and relative path.
296
297 If the path contains a drive letter, drive will contain everything
298 up to and including the colon.
299 e.g. ``splitdrive("c:/dir")`` returns ``("c:", "/dir")``
300
301 If the path contains a UNC path, drive will contain the host name
302 and share, up to but not including the fourth separator.
303 e.g. ``splitdrive("//host/computer/dir")`` returns ``("//host/computer", "/dir")``
304
Georg Brandl116aa622007-08-15 14:28:22 +0000305
306.. function:: splitext(path)
307
308 Split the pathname *path* into a pair ``(root, ext)`` such that ``root + ext ==
309 path``, and *ext* is empty or begins with a period and contains at most one
310 period. Leading periods on the basename are ignored; ``splitext('.cshrc')``
311 returns ``('.cshrc', '')``.
312
Georg Brandl116aa622007-08-15 14:28:22 +0000313
314.. function:: splitunc(path)
315
Mark Hammond5a607a32009-05-06 08:04:54 +0000316 .. deprecated:: 3.1
317 Use *splitdrive* instead.
318
Georg Brandl116aa622007-08-15 14:28:22 +0000319 Split the pathname *path* into a pair ``(unc, rest)`` so that *unc* is the UNC
320 mount point (such as ``r'\\host\mount'``), if present, and *rest* the rest of
321 the path (such as ``r'\path\file.ext'``). For paths containing drive letters,
Benjamin Petersonf650e462010-05-06 23:03:05 +0000322 *unc* will always be the empty string.
323
324 Availability: Windows.
Georg Brandl116aa622007-08-15 14:28:22 +0000325
326
Georg Brandl116aa622007-08-15 14:28:22 +0000327.. data:: supports_unicode_filenames
328
329 True if arbitrary Unicode strings can be used as file names (within limitations
Victor Stinnerb55e4982010-09-11 00:22:12 +0000330 imposed by the file system).