Blame - Doc/lib/librexec.tex - platform/external/python/cpython2

blob: 32a565fdbeea34b5eee30263329efd396e9f1bfd [file] [log] [blame]

Guido van Rossum	be0a8a6	1996-09-10 17:37:05 +0000	[diff] [blame]	1	\section{Standard Module \sectcode{rexec}}
				2	\stmodindex{rexec}
				3	\renewcommand{\indexsubitem}{(in module rexec)}
				4
Guido van Rossum	095538d	1996-10-22 01:11:19 +0000	[diff] [blame]	5	This module contains the \code{RExec} class, which supports
				6	\code{r_exec()}, \code{r_eval()}, \code{r_execfile()}, and
				7	\code{r_import()} methods, which are restricted versions of the standard
				8	Python functions \code{exec()}, \code{eval()}, \code{execfile()}, and
				9	\code{import()}. Code executed in this restricted environment will
				10	only have access to modules and functions that are deemed safe; you
				11	can subclass \code{RExec} to add or remove capabilities as desired.
				12
				13	\emph{Note:} The \code{RExec} class can prevent code from performing
				14	unsafe operations like reading or writing disk files, or using TCP/IP
				15	sockets. However, it does not protect against code using extremely
				16	large amounts of memory or CPU time.
				17	% XXX is there any protection against this?
				18
				19	\begin{funcdesc}{RExec}{\optional{hooks\, verbose} }
				20	Returns an instance of the \code{RExec} class.
				21
				22	% XXX is ihooks.py documented? If yes, there should be a ref here
				23
				24	\var{hooks} is an instance of the \code{RHooks} class or a subclass of it.
				25	Whenever the RExec module searches for a module (even a built-in one)
				26	or reads a module's code, it doesn't actually go out to the file
				27	system itself. Rather, it calls methods of an RHooks instance that
				28	was passed to or created by its constructor. (Actually, the RExec
				29	object doesn't make these calls---they are made by a module loader
				30	object that's part of the RExec object. This allows another level of
				31	flexibility, e.g. using packages.)
				32
				33	By providing an alternate RHooks object, we can control the actual
				34	file system accesses made to import a module, without changing the
				35	actual algorithm that controls the order in which those accesses are
				36	made. For instance, we could substitute an RHooks object that passes
				37	all filesystem requests to a file server elsewhere, via some RPC
				38	mechanism such as ILU. Grail's applet loader uses this to support
				39	importing applets from a URL for a directory.
				40
				41	% XXX does verbose actually do anything at the moment?
				42	If \var{verbose} is true, additional debugging output will be sent to
				43	standard output.
				44	\end{funcdesc}
				45
				46	RExec instances have the following attributes, which are used by the
				47	\code{__init__} method. Changing them on an existing instance won't
				48	have any effect; instead, create a subclass of \code{RExec} and assign
				49	them new values in the class definition. Instances of the new class
				50	will then use those new values. All these attributes are tuples of
				51	strings.
				52
				53	\renewcommand{\indexsubitem}{(RExec object attribute)}
				54	\begin{datadesc}{nok_builtin_names}
				55	Contains the names of built-in functions which will \emph{not} be
				56	available to programs running in the restricted environment. The
				57	value for \code{RExec} is \code{('open',} \code{reload',}
				58	\code{__import__')}.
				59	\end{datadesc}
				60
				61	\begin{datadesc}{ok_builtin_modules}
				62	Contains the names of built-in modules which can be safely imported.
				63	The value for \code{RExec} is \code{('array',} \code{'binascii',} \code{'audioop',}
				64	\code{'imageop',} \code{'marshal',} \code{'math',} \code{'md5',} \code{'parser',} \code{'regex',} \code{'rotor',}
				65	\code{'select',} \code{'strop',} \code{'struct',} \code{'time')}.
				66	\end{datadesc}
				67
				68	\begin{datadesc}{ok_path}
				69	Contains the directories which will be searched when an \code{import}
				70	is performed in the restricted environment.
				71	The value for \code{RExec} is the same as \code{sys.path} for
				72	unrestricted code.
				73	\end{datadesc}
				74
				75	\begin{datadesc}{ok_posix_names}
				76	% Should this be called ok_os_names?
				77	Contains the names of the functions in the \code{os} module which will be
				78	available to programs running in the restricted environment. The
				79	value for \code{RExec} is \code{('error',} \code{'fstat',}
				80	\code{'listdir',} \code{'lstat',} \code{'readlink',} \code{'stat',}
				81	\code{'times',} \code{'uname',} \code{'getpid',} \code{'getppid',}
				82	\code{'getcwd',} \code{'getuid',} \code{'getgid',} \code{'geteuid',}
				83	\code{'getegid')}.
				84	\end{datadesc}
				85
				86	\begin{datadesc}{ok_sys_names}
				87	Contains the names of the functions and variables in the \code{sys} module which will be
				88	available to programs running in the restricted environment. The
				89	value for \code{RExec} is \code{('ps1',} \code{'ps2',}
				90	\code{'copyright',} \code{'version',} \code{'platform',} \code{'exit',}
				91	\code{'maxint')}.
				92	\end{datadesc}
				93
				94	RExec instances support the following methods:
				95	\renewcommand{\indexsubitem}{(RExec object method)}
				96
				97	\begin{funcdesc}{r_eval}{code}
				98	\var{code} must either be a string containing a Python expression, or a compiled code object, which will
				99	be evaluated in the restricted environment. The value of the expression or code object will be returned.
				100	\end{funcdesc}
				101
				102	\begin{funcdesc}{r_exec}{code}
				103	\var{code} must either be a string containing one or more lines of Python code, or a compiled code object,
				104	which will be executed in the restricted environment.
				105	\end{funcdesc}
				106
				107	\begin{funcdesc}{r_execfile}{filename}
				108	Execute the Python code contained in the file \var{filename} in the
				109	restricted environment.
				110	\end{funcdesc}
				111
				112	Methods whose names begin with \code{s_} are similar to the functions
				113	beginning with \code{r_}, but the code will be granted access to
				114	restricted versions of \code{sys.stdin}, \code{sys.stderr}, and
				115	\code{sys.stdout}.
				116
				117	\begin{funcdesc}{s_eval}{code}
				118	\var{code} must be a string containing a Python expression, which will
				119	be evaluated in the restricted environment.
				120	\end{funcdesc}
				121
				122	\begin{funcdesc}{s_exec}{code}
				123	\var{code} must be a string containing one or more lines of Python code,
				124	which will be executed in the restricted environment.
				125	\end{funcdesc}
				126
				127	\begin{funcdesc}{s_execfile}{code}
				128	Execute the Python code contained in the file \var{filename} in the
				129	restricted environment.
				130	\end{funcdesc}
				131
				132	\code{RExec} objects must also support various methods which will be implicitly called
				133	by code executing in the restricted environment. Overriding these
				134	methods in a subclass is used to change the policies enforced by a restricted environment.
				135
				136	\begin{funcdesc}{r_import}{modulename\optional{\, globals, locals, fromlist}}
				137	Import the module \var{modulename}, raising an \code{ImportError} exception
				138	if the module is considered unsafe.
				139	\end{funcdesc}
				140
				141	\begin{funcdesc}{r_open}{filename\optional{\, mode\optional{\, bufsize}}}
				142	Method called when \code{open()} is called in the restricted
				143	environment. The arguments are identical to those of \code{open()},
				144	and a file object (or a class instance compatible with file objects)
				145	should be returned. \code{RExec}'s default behaviour is allow opening
				146	any file for reading, but forbidding any attempt to write a file. See
				147	the example below for an implementation of a less restrictive \code{r_open()}.
				148	\end{funcdesc}
				149
				150	\begin{funcdesc}{r_reload}{module}
				151	Reload the module object \var{module}, re-parsing and re-initializing it.
				152	\end{funcdesc}
				153
				154	\begin{funcdesc}{r_unload}{module}
				155	Unload the module object \var{module}.
				156	% XXX what are the semantics of this?
				157	\end{funcdesc}
				158
				159	\begin{funcdesc}{s_import}{modulename\optional{\, globals, locals, fromlist}}
				160	Import the module \var{modulename}, raising an \code{ImportError} exception
				161	if the module is considered unsafe.
				162	\end{funcdesc}
				163
				164	\begin{funcdesc}{s_reload}{module}
				165	Reload the module object \var{module}, re-parsing and re-initializing it.
				166	\end{funcdesc}
				167
				168	\begin{funcdesc}{s_unload}{module}
				169	Unload the module object \var{module}.
				170	% XXX what are the semantics of this?
				171	\end{funcdesc}
				172
				173	\subsection{An example}
				174
				175	Let us say that we want a slightly more relaxed policy than the
				176	standard RExec class. For example, if we're willing to allow files in
				177	\file{/tmp} to be written, we can subclass the \code{RExec} class:
				178
				179	\bcode\begin{verbatim}
				180	class TmpWriterRExec(rexec.RExec):
				181	def r_open(self, file, mode='r', buf=-1):
				182	if mode in ('r', 'rb'): pass
				183	elif mode in ('w', 'wb'):
				184	# check filename : must begin with /tmp/
				185	if file[0:5]!='/tmp/':
				186	raise IOError, "can't open files for writing outside of /tmp"
				187	elif string.find(file, '/../')!=-1:
				188	raise IOError, "'..' in filename; open for writing forbidden"
				189	return open(file, mode, buf)
				190	\end{verbatim}\ecode
				191
				192	Notice that the above code will occasionally forbid a perfectly valid
				193	filename; for example, code in the restricted environment won't be
				194	able to open a file called \file{/tmp/foo/../bar}. To fix this, the
				195	\code{r_open} method would have to simplify the filename to
				196	\file{/tmp/bar}, which would require splitting apart the filename and
				197	performing various operations on it. In cases where security is at
				198	stake, it may be preferable to write simple code which is sometimes
				199	overly restrictive, instead of more general code that is also more
				200	complex and may harbor a subtle security hole.