blob: 78256390b7c4794863973a5a5fb2e4cc4facd82e [file] [log] [blame]
Fred Drakeb23ee1d1999-02-01 20:20:39 +00001\section{\module{os.path} ---
2 Common pathname manipulations}
3\declaremodule{standard}{os.path}
Fred Drakeb91e9341998-07-23 17:59:49 +00004
Fred Drakeb23ee1d1999-02-01 20:20:39 +00005\modulesynopsis{Common pathname manipulations.}
Guido van Rossum470be141995-03-17 16:07:09 +00006
Fred Drakeb23ee1d1999-02-01 20:20:39 +00007This module implements some useful functions on pathnames.
Fred Drake203b4f11998-05-14 15:16:12 +00008\index{path!operations}
Guido van Rossum470be141995-03-17 16:07:09 +00009
Fred Drake0aa811c2001-10-20 04:24:09 +000010\warning{On Windows, many of these functions do not properly
Fred Drakebbf7a402001-09-28 16:14:18 +000011support UNC pathnames. \function{splitunc()} and \function{ismount()}
Fred Drake0aa811c2001-10-20 04:24:09 +000012do handle them correctly.}
Fred Drakebbf7a402001-09-28 16:14:18 +000013
Fred Drakeb23ee1d1999-02-01 20:20:39 +000014
Fred Drakea9b9bf91999-02-02 18:58:33 +000015\begin{funcdesc}{abspath}{path}
16Return a normalized absolutized version of the pathname \var{path}.
17On most platforms, this is equivalent to
Fred Drake39d4a021999-10-18 14:10:06 +000018\code{normpath(join(os.getcwd(), \var{path}))}.
Fred Drake154d9091999-03-17 22:25:11 +000019\versionadded{1.5.2}
Guido van Rossum1804dc31999-01-29 18:05:05 +000020\end{funcdesc}
21
Fred Drakea9b9bf91999-02-02 18:58:33 +000022\begin{funcdesc}{basename}{path}
23Return the base name of pathname \var{path}. This is the second half
Fred Drake3aecfc92000-10-26 21:38:23 +000024of the pair returned by \code{split(\var{path})}. Note that the
25result of this function is different from the
26\UNIX{} \program{basename} program; where \program{basename} for
27\code{'/foo/bar/'} returns \code{'bar'}, the \function{basename()}
28function returns an empty string (\code{''}).
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000029\end{funcdesc}
30
31\begin{funcdesc}{commonprefix}{list}
Skip Montanaro297bf7c2000-08-23 16:58:32 +000032Return the longest path prefix (taken character-by-character) that is a
33prefix of all paths in
Fred Drakeb23ee1d1999-02-01 20:20:39 +000034\var{list}. If \var{list} is empty, return the empty string
Skip Montanaro297bf7c2000-08-23 16:58:32 +000035(\code{''}). Note that this may return invalid paths because it works a
36character at a time.
Fred Drakeb23ee1d1999-02-01 20:20:39 +000037\end{funcdesc}
38
Fred Drakea9b9bf91999-02-02 18:58:33 +000039\begin{funcdesc}{dirname}{path}
40Return the directory name of pathname \var{path}. This is the first
41half of the pair returned by \code{split(\var{path})}.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000042\end{funcdesc}
43
Fred Drakea9b9bf91999-02-02 18:58:33 +000044\begin{funcdesc}{exists}{path}
Neal Norwitzd3dab2b2002-04-05 02:21:09 +000045Return \code{True} if \var{path} refers to an existing path.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000046\end{funcdesc}
47
Fred Drakea9b9bf91999-02-02 18:58:33 +000048\begin{funcdesc}{expanduser}{path}
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000049Return the argument with an initial component of \samp{\~} or
50\samp{\~\var{user}} replaced by that \var{user}'s home directory. An
Fred Drake203b4f11998-05-14 15:16:12 +000051initial \samp{\~{}} is replaced by the environment variable
Fred Drake23a16341998-08-06 15:33:55 +000052\envvar{HOME}; an initial \samp{\~\var{user}} is looked up in the
53password directory through the built-in module
Fred Drakeb23ee1d1999-02-01 20:20:39 +000054\refmodule{pwd}\refbimodindex{pwd}. If the expansion fails, or if the
55path does not begin with a tilde, the path is returned unchanged. On
Fred Drakea9b9bf91999-02-02 18:58:33 +000056the Macintosh, this always returns \var{path} unchanged.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000057\end{funcdesc}
58
Fred Drakea9b9bf91999-02-02 18:58:33 +000059\begin{funcdesc}{expandvars}{path}
Guido van Rossum17383111994-04-21 10:32:28 +000060Return the argument with environment variables expanded. Substrings
61of the form \samp{\$\var{name}} or \samp{\$\{\var{name}\}} are
62replaced by the value of environment variable \var{name}. Malformed
63variable names and references to non-existing variables are left
Fred Drakea9b9bf91999-02-02 18:58:33 +000064unchanged. On the Macintosh, this always returns \var{path}
65unchanged.
Guido van Rossum17383111994-04-21 10:32:28 +000066\end{funcdesc}
67
Fred Draked8a41e61999-02-19 17:54:10 +000068\begin{funcdesc}{getatime}{path}
69Return the time of last access of \var{filename}. The return
70value is integer giving the number of seconds since the epoch (see the
71\refmodule{time} module). Raise \exception{os.error} if the file does
72not exist or is inaccessible.
Fred Drakea9b9bf91999-02-02 18:58:33 +000073\versionadded{1.5.2}
Guido van Rossum2babd7b1998-07-24 20:49:39 +000074\end{funcdesc}
75
Fred Drakea9b9bf91999-02-02 18:58:33 +000076\begin{funcdesc}{getmtime}{path}
Guido van Rossum2babd7b1998-07-24 20:49:39 +000077Return the time of last modification of \var{filename}. The return
78value is integer giving the number of seconds since the epoch (see the
Fred Drakeb23ee1d1999-02-01 20:20:39 +000079\refmodule{time} module). Raise \exception{os.error} if the file does
80not exist or is inaccessible.
Fred Drakea9b9bf91999-02-02 18:58:33 +000081\versionadded{1.5.2}
Guido van Rossum2babd7b1998-07-24 20:49:39 +000082\end{funcdesc}
83
Fred Draked8a41e61999-02-19 17:54:10 +000084\begin{funcdesc}{getsize}{path}
85Return the size, in bytes, of \var{filename}. Raise
86\exception{os.error} if the file does not exist or is inaccessible.
Fred Drakea9b9bf91999-02-02 18:58:33 +000087\versionadded{1.5.2}
Guido van Rossum2babd7b1998-07-24 20:49:39 +000088\end{funcdesc}
89
Fred Drakea9b9bf91999-02-02 18:58:33 +000090\begin{funcdesc}{isabs}{path}
Neal Norwitzd3dab2b2002-04-05 02:21:09 +000091Return \code{True} if \var{path} is an absolute pathname (begins with a
Fred Drakea9b9bf91999-02-02 18:58:33 +000092slash).
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000093\end{funcdesc}
94
Fred Drakea9b9bf91999-02-02 18:58:33 +000095\begin{funcdesc}{isfile}{path}
Neal Norwitzd3dab2b2002-04-05 02:21:09 +000096Return \code{True} if \var{path} is an existing regular file. This follows
Fred Drakedb9693e1998-03-11 05:50:42 +000097symbolic links, so both \function{islink()} and \function{isfile()}
98can be true for the same path.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000099\end{funcdesc}
100
Fred Drakea9b9bf91999-02-02 18:58:33 +0000101\begin{funcdesc}{isdir}{path}
Neal Norwitzd3dab2b2002-04-05 02:21:09 +0000102Return \code{True} if \var{path} is an existing directory. This follows
Fred Drakedb9693e1998-03-11 05:50:42 +0000103symbolic links, so both \function{islink()} and \function{isdir()} can
104be true for the same path.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000105\end{funcdesc}
106
Fred Drakea9b9bf91999-02-02 18:58:33 +0000107\begin{funcdesc}{islink}{path}
Neal Norwitzd3dab2b2002-04-05 02:21:09 +0000108Return \code{True} if \var{path} refers to a directory entry that is a
109symbolic link. Always \code{False} if symbolic links are not supported.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000110\end{funcdesc}
111
Fred Drakea9b9bf91999-02-02 18:58:33 +0000112\begin{funcdesc}{ismount}{path}
Neal Norwitzd3dab2b2002-04-05 02:21:09 +0000113Return \code{True} if pathname \var{path} is a \dfn{mount point}: a point in
Fred Drakea9b9bf91999-02-02 18:58:33 +0000114a file system where a different file system has been mounted. The
115function checks whether \var{path}'s parent, \file{\var{path}/..}, is
116on a different device than \var{path}, or whether \file{\var{path}/..}
117and \var{path} point to the same i-node on the same device --- this
118should detect mount points for all \UNIX{} and \POSIX{} variants.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000119\end{funcdesc}
120
Fred Drakea9b9bf91999-02-02 18:58:33 +0000121\begin{funcdesc}{join}{path1\optional{, path2\optional{, ...}}}
Barry Warsaw75745871997-02-18 21:53:53 +0000122Joins one or more path components intelligently. If any component is
123an absolute path, all previous components are thrown away, and joining
Fred Drakea9b9bf91999-02-02 18:58:33 +0000124continues. The return value is the concatenation of \var{path1}, and
Fred Drakec1ddc502002-09-12 18:01:26 +0000125optionally \var{path2}, etc., with exactly one directory separator
Martin v. Löwis04791042002-12-11 12:55:53 +0000126(\code{os.sep}) inserted between components, unless \var{path2} is
Fred Drakec1ddc502002-09-12 18:01:26 +0000127empty. Note that on Windows, since there is a current directory for
128each drive, \function{os.path.join("c:", "foo")} represents a path
129relative to the current directory on drive \file{C:} (\file{c:foo}), not
130\file{c:\textbackslash\textbackslash foo}.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000131\end{funcdesc}
132
Fred Drakea9b9bf91999-02-02 18:58:33 +0000133\begin{funcdesc}{normcase}{path}
Fred Drakec37b65e2001-11-28 07:26:15 +0000134Normalize the case of a pathname. On \UNIX, this returns the path
Guido van Rossum1931c0c1998-02-18 14:00:05 +0000135unchanged; on case-insensitive filesystems, it converts the path to
136lowercase. On Windows, it also converts forward slashes to backward
137slashes.
138\end{funcdesc}
139
Fred Drakea9b9bf91999-02-02 18:58:33 +0000140\begin{funcdesc}{normpath}{path}
Guido van Rossum1931c0c1998-02-18 14:00:05 +0000141Normalize a pathname. This collapses redundant separators and
142up-level references, e.g. \code{A//B}, \code{A/./B} and
143\code{A/foo/../B} all become \code{A/B}. It does not normalize the
Fred Drake38e5d272000-04-03 20:13:55 +0000144case (use \function{normcase()} for that). On Windows, it converts
145forward slashes to backward slashes.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000146\end{funcdesc}
147
Guido van Rossum83eeef42001-09-17 15:16:09 +0000148\begin{funcdesc}{realpath}{path}
149Return the canonical path of the specified filename, eliminating any
150symbolic links encountered in the path.
Fred Drakec37b65e2001-11-28 07:26:15 +0000151Availability: \UNIX.
Guido van Rossum83eeef42001-09-17 15:16:09 +0000152\versionadded{2.2}
153\end{funcdesc}
154
Fred Drakea9b9bf91999-02-02 18:58:33 +0000155\begin{funcdesc}{samefile}{path1, path2}
Neal Norwitzd3dab2b2002-04-05 02:21:09 +0000156Return \code{True} if both pathname arguments refer to the same file or
Fred Drakedb9693e1998-03-11 05:50:42 +0000157directory (as indicated by device number and i-node number).
158Raise an exception if a \function{os.stat()} call on either pathname
159fails.
Fred Drakec37b65e2001-11-28 07:26:15 +0000160Availability: Macintosh, \UNIX.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000161\end{funcdesc}
162
Fred Draked673d481999-02-03 22:31:30 +0000163\begin{funcdesc}{sameopenfile}{fp1, fp2}
Neal Norwitzd3dab2b2002-04-05 02:21:09 +0000164Return \code{True} if the file objects \var{fp1} and \var{fp2} refer to the
Fred Draked673d481999-02-03 22:31:30 +0000165same file. The two file objects may represent different file
166descriptors.
Fred Drakec37b65e2001-11-28 07:26:15 +0000167Availability: Macintosh, \UNIX.
Fred Draked673d481999-02-03 22:31:30 +0000168\end{funcdesc}
169
170\begin{funcdesc}{samestat}{stat1, stat2}
Neal Norwitzd3dab2b2002-04-05 02:21:09 +0000171Return \code{True} if the stat tuples \var{stat1} and \var{stat2} refer to
Fred Draked673d481999-02-03 22:31:30 +0000172the same file. These structures may have been returned by
173\function{fstat()}, \function{lstat()}, or \function{stat()}. This
174function implements the underlying comparison used by
175\function{samefile()} and \function{sameopenfile()}.
Fred Drakec37b65e2001-11-28 07:26:15 +0000176Availability: Macintosh, \UNIX.
Fred Draked673d481999-02-03 22:31:30 +0000177\end{funcdesc}
178
Fred Drakea9b9bf91999-02-02 18:58:33 +0000179\begin{funcdesc}{split}{path}
Fred Draked673d481999-02-03 22:31:30 +0000180Split the pathname \var{path} into a pair, \code{(\var{head},
181\var{tail})} where \var{tail} is the last pathname component and
Fred Drakea9b9bf91999-02-02 18:58:33 +0000182\var{head} is everything leading up to that. The \var{tail} part will
183never contain a slash; if \var{path} ends in a slash, \var{tail} will
184be empty. If there is no slash in \var{path}, \var{head} will be
185empty. If \var{path} is empty, both \var{head} and \var{tail} are
186empty. Trailing slashes are stripped from \var{head} unless it is the
187root (one or more slashes only). In nearly all cases,
188\code{join(\var{head}, \var{tail})} equals \var{path} (the only
189exception being when there were multiple slashes separating \var{head}
190from \var{tail}).
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000191\end{funcdesc}
192
Fred Drake0256c1f1999-02-03 19:24:44 +0000193\begin{funcdesc}{splitdrive}{path}
194Split the pathname \var{path} into a pair \code{(\var{drive},
Fred Draked673d481999-02-03 22:31:30 +0000195\var{tail})} where \var{drive} is either a drive specification or the
Fred Drake0256c1f1999-02-03 19:24:44 +0000196empty string. On systems which do not use drive specifications,
197\var{drive} will always be the empty string. In all cases,
198\code{\var{drive} + \var{tail}} will be the same as \var{path}.
Fred Drake56a71ee2001-05-25 16:21:00 +0000199\versionadded{1.3}
Fred Drake0256c1f1999-02-03 19:24:44 +0000200\end{funcdesc}
201
Fred Drakea9b9bf91999-02-02 18:58:33 +0000202\begin{funcdesc}{splitext}{path}
Fred Drake0256c1f1999-02-03 19:24:44 +0000203Split the pathname \var{path} into a pair \code{(\var{root}, \var{ext})}
Fred Drakea9b9bf91999-02-02 18:58:33 +0000204such that \code{\var{root} + \var{ext} == \var{path}},
Guido van Rossum56b30ea1996-08-19 23:00:50 +0000205and \var{ext} is empty or begins with a period and contains
206at most one period.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000207\end{funcdesc}
208
Fred Drakea9b9bf91999-02-02 18:58:33 +0000209\begin{funcdesc}{walk}{path, visit, arg}
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000210Calls the function \var{visit} with arguments
211\code{(\var{arg}, \var{dirname}, \var{names})} for each directory in the
Fred Drakea9b9bf91999-02-02 18:58:33 +0000212directory tree rooted at \var{path} (including \var{path} itself, if it
213is a directory). The argument \var{dirname} specifies the visited
214directory, the argument \var{names} lists the files in the directory
215(gotten from \code{os.listdir(\var{dirname})}).
Guido van Rossume8e87991997-03-25 15:25:54 +0000216The \var{visit} function may modify \var{names} to
Guido van Rossum470be141995-03-17 16:07:09 +0000217influence the set of directories visited below \var{dirname}, e.g., to
218avoid visiting certain parts of the tree. (The object referred to by
Fred Drakedb9693e1998-03-11 05:50:42 +0000219\var{names} must be modified in place, using \keyword{del} or slice
Guido van Rossum470be141995-03-17 16:07:09 +0000220assignment.)
Steve Holden545092b2002-08-06 16:07:07 +0000221
Mark Hammond8696ebc2002-10-08 02:44:31 +0000222\begin{datadesc}{supports_unicode_filenames}
223True if arbitrary Unicode strings can be used as file names (within
224limitations imposed by the file system), and if os.listdir returns
225Unicode strings for a Unicode argument.
226\versionadded{2.3}
227\end{datadesc}
228
Fred Drake95fa4dd2002-08-07 12:39:33 +0000229\begin{notice}
230Symbolic links to directories are not treated as subdirectories, and
231that \function{walk()} therefore will not visit them. To visit linked
232directories you must identify them with
233\code{os.path.islink(\var{file})} and
234\code{os.path.isdir(\var{file})}, and invoke \function{walk()} as
235necessary.
236\end{notice}
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000237\end{funcdesc}