blob: 7a17d23845356f7d5a75c36fff9e7711910a28d5 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`os.path` --- Common pathname manipulations
2================================================
3
4.. module:: os.path
5 :synopsis: Operations on pathnames.
6
Georg Brandl116aa622007-08-15 14:28:22 +00007.. index:: single: path; operations
8
9This module implements some useful functions on pathnames. To read or
10write files see :func:`open`, and for accessing the filesystem see the
Martin v. Löwis651423c2008-10-07 07:03:04 +000011:mod:`os` module. The path parameters can be passed as either strings,
12or bytes. Applications are encouraged to represent file names as
13(Unicode) character strings. Unfortunately, some file names may not be
14representable as strings on Unix, so applications that need to support
15arbitrary file names on Unix should use bytes objects to represent
16path names. Vice versa, using bytes objects cannot represent all file
17names on Windows (in the standard ``mbcs`` encoding), hence Windows
18applications should use string objects to access all files.
Georg Brandl116aa622007-08-15 14:28:22 +000019
R David Murraya4e700c2013-01-06 16:13:10 -050020Unlike a unix shell, Python does not do any *automatic* path expansions.
21Functions such as :func:`expanduser` and :func:`expandvars` can be invoked
22explicitly when an application desires shell-like path expansion. (See also
23the :mod:`glob` module.)
24
Georg Brandl76e55382008-10-08 16:34:57 +000025.. note::
26
27 All of these functions accept either only bytes or only string objects as
28 their parameters. The result is an object of the same type, if a path or
29 file name is returned.
30
Georg Brandl116aa622007-08-15 14:28:22 +000031
Benjamin Petersond23f8222009-04-05 19:13:16 +000032.. note::
33
34 Since different operating systems have different path name conventions, there
35 are several versions of this module in the standard library. The
36 :mod:`os.path` module is always the path module suitable for the operating
37 system Python is running on, and therefore usable for local paths. However,
38 you can also import and use the individual modules if you want to manipulate
39 a path that is *always* in one of the different formats. They all have the
40 same interface:
41
42 * :mod:`posixpath` for UNIX-style paths
43 * :mod:`ntpath` for Windows paths
44 * :mod:`macpath` for old-style MacOS paths
45 * :mod:`os2emxpath` for OS/2 EMX paths
46
47
Georg Brandl116aa622007-08-15 14:28:22 +000048.. function:: abspath(path)
49
50 Return a normalized absolutized version of the pathname *path*. On most
Chris Jerdonek0b502ff2012-11-25 20:38:01 -080051 platforms, this is equivalent to calling the function :func:`normpath` as
52 follows: ``normpath(join(os.getcwd(), path))``.
Georg Brandl116aa622007-08-15 14:28:22 +000053
Georg Brandl116aa622007-08-15 14:28:22 +000054
55.. function:: basename(path)
56
Chris Jerdonek0b502ff2012-11-25 20:38:01 -080057 Return the base name of pathname *path*. This is the second element of the
58 pair returned by passing *path* to the function :func:`split`. Note that
59 the result of this function is different
Georg Brandl116aa622007-08-15 14:28:22 +000060 from the Unix :program:`basename` program; where :program:`basename` for
61 ``'/foo/bar/'`` returns ``'bar'``, the :func:`basename` function returns an
62 empty string (``''``).
63
64
65.. function:: commonprefix(list)
66
67 Return the longest path prefix (taken character-by-character) that is a prefix
68 of all paths in *list*. If *list* is empty, return the empty string (``''``).
69 Note that this may return invalid paths because it works a character at a time.
70
71
72.. function:: dirname(path)
73
Chris Jerdonek0b502ff2012-11-25 20:38:01 -080074 Return the directory name of pathname *path*. This is the first element of
75 the pair returned by passing *path* to the function :func:`split`.
Georg Brandl116aa622007-08-15 14:28:22 +000076
77
78.. function:: exists(path)
79
80 Return ``True`` if *path* refers to an existing path. Returns ``False`` for
81 broken symbolic links. On some platforms, this function may return ``False`` if
82 permission is not granted to execute :func:`os.stat` on the requested file, even
83 if the *path* physically exists.
84
85
86.. function:: lexists(path)
87
88 Return ``True`` if *path* refers to an existing path. Returns ``True`` for
89 broken symbolic links. Equivalent to :func:`exists` on platforms lacking
90 :func:`os.lstat`.
91
Georg Brandl116aa622007-08-15 14:28:22 +000092
93.. function:: expanduser(path)
94
95 On Unix and Windows, return the argument with an initial component of ``~`` or
96 ``~user`` replaced by that *user*'s home directory.
97
98 .. index:: module: pwd
99
100 On Unix, an initial ``~`` is replaced by the environment variable :envvar:`HOME`
101 if it is set; otherwise the current user's home directory is looked up in the
102 password directory through the built-in module :mod:`pwd`. An initial ``~user``
103 is looked up directly in the password directory.
104
105 On Windows, :envvar:`HOME` and :envvar:`USERPROFILE` will be used if set,
106 otherwise a combination of :envvar:`HOMEPATH` and :envvar:`HOMEDRIVE` will be
107 used. An initial ``~user`` is handled by stripping the last directory component
108 from the created user path derived above.
109
110 If the expansion fails or if the path does not begin with a tilde, the path is
111 returned unchanged.
112
113
114.. function:: expandvars(path)
115
116 Return the argument with environment variables expanded. Substrings of the form
117 ``$name`` or ``${name}`` are replaced by the value of environment variable
118 *name*. Malformed variable names and references to non-existing variables are
119 left unchanged.
120
121 On Windows, ``%name%`` expansions are supported in addition to ``$name`` and
122 ``${name}``.
123
124
125.. function:: getatime(path)
126
127 Return the time of last access of *path*. The return value is a number giving
128 the number of seconds since the epoch (see the :mod:`time` module). Raise
129 :exc:`os.error` if the file does not exist or is inaccessible.
130
Georg Brandl55ac8f02007-09-01 13:51:09 +0000131 If :func:`os.stat_float_times` returns True, the result is a floating point
132 number.
Georg Brandl116aa622007-08-15 14:28:22 +0000133
134
135.. function:: getmtime(path)
136
137 Return the time of last modification of *path*. The return value is a number
138 giving the number of seconds since the epoch (see the :mod:`time` module).
139 Raise :exc:`os.error` if the file does not exist or is inaccessible.
140
Georg Brandl55ac8f02007-09-01 13:51:09 +0000141 If :func:`os.stat_float_times` returns True, the result is a floating point
142 number.
Georg Brandl116aa622007-08-15 14:28:22 +0000143
144
145.. function:: getctime(path)
146
147 Return the system's ctime which, on some systems (like Unix) is the time of the
148 last change, and, on others (like Windows), is the creation time for *path*.
149 The return value is a number giving the number of seconds since the epoch (see
150 the :mod:`time` module). Raise :exc:`os.error` if the file does not exist or
151 is inaccessible.
152
Georg Brandl116aa622007-08-15 14:28:22 +0000153
154.. function:: getsize(path)
155
156 Return the size, in bytes, of *path*. Raise :exc:`os.error` if the file does
157 not exist or is inaccessible.
158
Georg Brandl116aa622007-08-15 14:28:22 +0000159
160.. function:: isabs(path)
161
Christian Heimesaf98da12008-01-27 15:18:18 +0000162 Return ``True`` if *path* is an absolute pathname. On Unix, that means it
163 begins with a slash, on Windows that it begins with a (back)slash after chopping
164 off a potential drive letter.
Georg Brandl116aa622007-08-15 14:28:22 +0000165
166
167.. function:: isfile(path)
168
169 Return ``True`` if *path* is an existing regular file. This follows symbolic
170 links, so both :func:`islink` and :func:`isfile` can be true for the same path.
171
172
173.. function:: isdir(path)
174
175 Return ``True`` if *path* is an existing directory. This follows symbolic
176 links, so both :func:`islink` and :func:`isdir` can be true for the same path.
177
178
179.. function:: islink(path)
180
181 Return ``True`` if *path* refers to a directory entry that is a symbolic link.
182 Always ``False`` if symbolic links are not supported.
183
184
185.. function:: ismount(path)
186
187 Return ``True`` if pathname *path* is a :dfn:`mount point`: a point in a file
188 system where a different file system has been mounted. The function checks
189 whether *path*'s parent, :file:`path/..`, is on a different device than *path*,
190 or whether :file:`path/..` and *path* point to the same i-node on the same
191 device --- this should detect mount points for all Unix and POSIX variants.
192
193
194.. function:: join(path1[, path2[, ...]])
195
196 Join one or more path components intelligently. If any component is an absolute
197 path, all previous components (on Windows, including the previous drive letter,
198 if there was one) are thrown away, and joining continues. The return value is
199 the concatenation of *path1*, and optionally *path2*, etc., with exactly one
R David Murray24eb4bc2011-06-23 21:26:13 -0400200 directory separator (``os.sep``) following each non-empty part except the last.
201 (This means that an empty last part will result in a path that ends with a
202 separator.) Note that on Windows, since there is a current directory for
203 each drive, ``os.path.join("c:", "foo")`` represents a path relative to the
204 current directory on drive :file:`C:` (:file:`c:foo`), not :file:`c:\\foo`.
Georg Brandl116aa622007-08-15 14:28:22 +0000205
206
207.. function:: normcase(path)
208
Benjamin Petersond23f8222009-04-05 19:13:16 +0000209 Normalize the case of a pathname. On Unix and Mac OS X, this returns the
210 path unchanged; on case-insensitive filesystems, it converts the path to
211 lowercase. On Windows, it also converts forward slashes to backward slashes.
Ezio Melotti5a3ef5b2010-06-25 10:56:11 +0000212 Raise a TypeError if the type of *path* is not ``str`` or ``bytes``.
Georg Brandl116aa622007-08-15 14:28:22 +0000213
214
215.. function:: normpath(path)
216
Terry Jan Reedyec6e1322013-03-17 15:21:26 -0400217 Normalize a pathname by collapsing redundant separators and up-level
218 references so that ``A//B``, ``A/B/``, ``A/./B`` and ``A/foo/../B`` all
219 become ``A/B``. This string manipulation may change the meaning of a path
220 that contains symbolic links. On Windows, it converts forward slashes to
221 backward slashes. To normalize case, use :func:`normcase`.
Georg Brandl116aa622007-08-15 14:28:22 +0000222
223
224.. function:: realpath(path)
225
226 Return the canonical path of the specified filename, eliminating any symbolic
227 links encountered in the path (if they are supported by the operating system).
228
Georg Brandl116aa622007-08-15 14:28:22 +0000229
Georg Brandl18244152009-09-02 20:34:52 +0000230.. function:: relpath(path, start=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000231
232 Return a relative filepath to *path* either from the current directory or from
233 an optional *start* point.
234
Benjamin Petersonf650e462010-05-06 23:03:05 +0000235 *start* defaults to :attr:`os.curdir`.
236
Antoine Pitrouf10f1622010-12-12 20:17:29 +0000237 Availability: Unix, Windows.
Georg Brandl116aa622007-08-15 14:28:22 +0000238
Georg Brandl116aa622007-08-15 14:28:22 +0000239
240.. function:: samefile(path1, path2)
241
Brian Curtind40e6f72010-07-08 21:39:08 +0000242 Return ``True`` if both pathname arguments refer to the same file or directory.
243 On Unix, this is determined by the device number and i-node number and raises an
244 exception if a :func:`os.stat` call on either pathname fails.
Benjamin Petersonf650e462010-05-06 23:03:05 +0000245
Brian Curtind40e6f72010-07-08 21:39:08 +0000246 On Windows, two files are the same if they resolve to the same final path
Brian Curtinc7395692010-07-09 15:15:09 +0000247 name using the Windows API call GetFinalPathNameByHandle. This function
Brian Curtind40e6f72010-07-08 21:39:08 +0000248 raises an exception if handles cannot be obtained to either file.
249
Antoine Pitrouf10f1622010-12-12 20:17:29 +0000250 Availability: Unix, Windows.
Georg Brandl116aa622007-08-15 14:28:22 +0000251
Georg Brandlb3823372010-07-10 08:58:37 +0000252 .. versionchanged:: 3.2
253 Added Windows support.
Brian Curtinc7395692010-07-09 15:15:09 +0000254
Georg Brandl116aa622007-08-15 14:28:22 +0000255
256.. function:: sameopenfile(fp1, fp2)
257
258 Return ``True`` if the file descriptors *fp1* and *fp2* refer to the same file.
Benjamin Petersonf650e462010-05-06 23:03:05 +0000259
Brian Curtin62857742010-09-06 17:07:27 +0000260 Availability: Unix, Windows.
261
262 .. versionchanged:: 3.2 Added Windows support.
Georg Brandl116aa622007-08-15 14:28:22 +0000263
264
265.. function:: samestat(stat1, stat2)
266
267 Return ``True`` if the stat tuples *stat1* and *stat2* refer to the same file.
268 These structures may have been returned by :func:`fstat`, :func:`lstat`, or
269 :func:`stat`. This function implements the underlying comparison used by
Benjamin Petersonf650e462010-05-06 23:03:05 +0000270 :func:`samefile` and :func:`sameopenfile`.
271
272 Availability: Unix.
Georg Brandl116aa622007-08-15 14:28:22 +0000273
274
275.. function:: split(path)
276
Georg Brandl539c1652010-10-14 06:46:08 +0000277 Split the pathname *path* into a pair, ``(head, tail)`` where *tail* is the
278 last pathname component and *head* is everything leading up to that. The
279 *tail* part will never contain a slash; if *path* ends in a slash, *tail*
280 will be empty. If there is no slash in *path*, *head* will be empty. If
281 *path* is empty, both *head* and *tail* are empty. Trailing slashes are
282 stripped from *head* unless it is the root (one or more slashes only). In
283 all cases, ``join(head, tail)`` returns a path to the same location as *path*
Chris Jerdonek0b502ff2012-11-25 20:38:01 -0800284 (but the strings may differ). Also see the functions :func:`dirname` and
285 :func:`basename`.
Georg Brandl116aa622007-08-15 14:28:22 +0000286
287
288.. function:: splitdrive(path)
289
290 Split the pathname *path* into a pair ``(drive, tail)`` where *drive* is either
Mark Hammond5a607a32009-05-06 08:04:54 +0000291 a mount point or the empty string. On systems which do not use drive
Georg Brandl116aa622007-08-15 14:28:22 +0000292 specifications, *drive* will always be the empty string. In all cases, ``drive
293 + tail`` will be the same as *path*.
294
Mark Hammond5a607a32009-05-06 08:04:54 +0000295 On Windows, splits a pathname into drive/UNC sharepoint and relative path.
296
297 If the path contains a drive letter, drive will contain everything
298 up to and including the colon.
299 e.g. ``splitdrive("c:/dir")`` returns ``("c:", "/dir")``
300
301 If the path contains a UNC path, drive will contain the host name
302 and share, up to but not including the fourth separator.
303 e.g. ``splitdrive("//host/computer/dir")`` returns ``("//host/computer", "/dir")``
304
Georg Brandl116aa622007-08-15 14:28:22 +0000305
306.. function:: splitext(path)
307
308 Split the pathname *path* into a pair ``(root, ext)`` such that ``root + ext ==
309 path``, and *ext* is empty or begins with a period and contains at most one
310 period. Leading periods on the basename are ignored; ``splitext('.cshrc')``
311 returns ``('.cshrc', '')``.
312
Georg Brandl116aa622007-08-15 14:28:22 +0000313
314.. function:: splitunc(path)
315
Mark Hammond5a607a32009-05-06 08:04:54 +0000316 .. deprecated:: 3.1
317 Use *splitdrive* instead.
318
Georg Brandl116aa622007-08-15 14:28:22 +0000319 Split the pathname *path* into a pair ``(unc, rest)`` so that *unc* is the UNC
320 mount point (such as ``r'\\host\mount'``), if present, and *rest* the rest of
321 the path (such as ``r'\path\file.ext'``). For paths containing drive letters,
Benjamin Petersonf650e462010-05-06 23:03:05 +0000322 *unc* will always be the empty string.
323
324 Availability: Windows.
Georg Brandl116aa622007-08-15 14:28:22 +0000325
326
Georg Brandl116aa622007-08-15 14:28:22 +0000327.. data:: supports_unicode_filenames
328
329 True if arbitrary Unicode strings can be used as file names (within limitations
Victor Stinnerb55e4982010-09-11 00:22:12 +0000330 imposed by the file system).