librexec.tex 9.36 KB
Newer Older
\section{Standard Module \sectcode{rexec}}
Guido van Rossum's avatar
Guido van Rossum committed
3 4 5
\renewcommand{\indexsubitem}{(in module rexec)}

6 7 8 9
This module contains the \code{RExec} class, which supports
\code{r_exec()}, \code{r_eval()}, \code{r_execfile()}, and
\code{r_import()} methods, which are restricted versions of the standard
Python functions \code{exec()}, \code{eval()}, \code{execfile()}, and
10 11
the \code{import} statement.
Code executed in this restricted environment will
12 13 14 15 16 17 18 19
only have access to modules and functions that are deemed safe; you
can subclass \code{RExec} to add or remove capabilities as desired.

\emph{Note:} The \code{RExec} class can prevent code from performing
unsafe operations like reading or writing disk files, or using TCP/IP
sockets.  However, it does not protect against code using extremely
large amounts of memory or CPU time.  

\begin{funcdesc}{RExec}{\optional{hooks\optional{\, verbose}}}
21 22 23
Returns an instance of the \code{RExec} class.  

\var{hooks} is an instance of the \code{RHooks} class or a subclass of it.
24 25
If it is omitted or \code{None}, the default \code{RHooks} class is
26 27 28 29 30 31 32 33
Whenever the RExec module searches for a module (even a built-in one)
or reads a module's code, it doesn't actually go out to the file
system itself.  Rather, it calls methods of an RHooks instance that
was passed to or created by its constructor.  (Actually, the RExec
object doesn't make these calls---they are made by a module loader
object that's part of the RExec object.  This allows another level of
flexibility, e.g. using packages.)

By providing an alternate RHooks object, we can control the
35 36 37 38 39 40 41
file system accesses made to import a module, without changing the
actual algorithm that controls the order in which those accesses are
made.  For instance, we could substitute an RHooks object that passes
all filesystem requests to a file server elsewhere, via some RPC
mechanism such as ILU.  Grail's applet loader uses this to support
importing applets from a URL for a directory.

If \var{verbose} is true, additional debugging output may be sent to
43 44 45
standard output.

The RExec class has the following class attributes, which are used by the
47 48 49 50 51 52 53 54 55
\code{__init__} method.  Changing them on an existing instance won't
have any effect; instead, create a subclass of \code{RExec} and assign
them new values in the class definition.  Instances of the new class
will then use those new values.  All these attributes are tuples of

\renewcommand{\indexsubitem}{(RExec object attribute)}
Contains the names of built-in functions which will \emph{not} be
56 57 58 59 60 61 62 63
available to programs running in the restricted environment.  The
value for \code{RExec} is \code{('open',} \code{'reload',}
\code{'__import__')}.  (This gives the exceptions, because by far the
majority of built-in functions are harmless.  A subclass that wants to
override this variable should probably start with the value from the
base class and concatenate additional forbidden functions --- when new
dangerous built-in functions are added to Python, they will also be
added to this module.)
64 65 66 67

Contains the names of built-in modules which can be safely imported.
68 69 70 71 72 73 74
The value for \code{RExec} is \code{('audioop',} \code{'array',}
\code{'binascii',} \code{'cmath',} \code{'errno',} \code{'imageop',}
\code{'marshal',} \code{'math',} \code{'md5',} \code{'operator',}
\code{'parser',} \code{'regex',} \code{'rotor',} \code{'select',}
\code{'strop',} \code{'struct',} \code{'time')}.  A similar remark
about overriding this variable applies --- use the value from the base
class as a starting point.
75 76 77 78 79

Contains the directories which will be searched when an \code{import}
is performed in the restricted environment.  
80 81
The value for \code{RExec} is the same as \code{sys.path} (at the time
the module is loaded) for unrestricted code.
82 83 84 85 86 87 88 89 90 91 92 93 94 95

% Should this be called ok_os_names?
Contains the names of the functions in the \code{os} module which will be
available to programs running in the restricted environment.  The
value for \code{RExec} is \code{('error',} \code{'fstat',}
\code{'listdir',} \code{'lstat',} \code{'readlink',} \code{'stat',}
\code{'times',} \code{'uname',} \code{'getpid',} \code{'getppid',}
\code{'getcwd',} \code{'getuid',} \code{'getgid',} \code{'geteuid',}

96 97 98 99 100
Contains the names of the functions and variables in the \code{sys}
module which will be available to programs running in the restricted
environment.  The value for \code{RExec} is \code{('ps1',}
\code{'ps2',} \code{'copyright',} \code{'version',} \code{'platform',}
\code{'exit',} \code{'maxint')}.
101 102 103 104 105 106

RExec instances support the following methods:
\renewcommand{\indexsubitem}{(RExec object method)}

107 108 109 110
\var{code} must either be a string containing a Python expression, or
a compiled code object, which will be evaluated in the restricted
environment's \code{__main__} module.  The value of the expression or
code object will be returned.
111 112 113

114 115 116
\var{code} must either be a string containing one or more lines of
Python code, or a compiled code object, which will be executed in the
restricted environment's \code{__main__} module.
117 118 119 120

Execute the Python code contained in the file \var{filename} in the
restricted environment's \code{__main__} module.
122 123 124 125

Methods whose names begin with \code{s_} are similar to the functions
beginning with \code{r_}, but the code will be granted access to
126 127
restricted versions of the standard I/O streans \code{sys.stdin},
\code{sys.stderr}, and \code{sys.stdout}.  
128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143

\var{code} must be a string containing a Python expression, which will
be evaluated in the restricted environment.  

\var{code} must be a string containing one or more lines of Python code,
which will be executed in the restricted environment.  

Execute the Python code contained in the file \var{filename} in the
restricted environment.

144 145 146 147
\code{RExec} objects must also support various methods which will be
implicitly called by code executing in the restricted environment.
Overriding these methods in a subclass is used to change the policies
enforced by a restricted environment.

149 150 151
\begin{funcdesc}{r_import}{modulename\optional{\, globals\, locals\, fromlist}}
Import the module \var{modulename}, raising an \code{ImportError}
exception if the module is considered unsafe.
152 153 154 155 156 157 158 159

\begin{funcdesc}{r_open}{filename\optional{\, mode\optional{\, bufsize}}}
Method called when \code{open()} is called in the restricted
environment.  The arguments are identical to those of \code{open()},
and a file object (or a class instance compatible with file objects)
should be returned.  \code{RExec}'s default behaviour is allow opening
any file for reading, but forbidding any attempt to write a file.  See
160 161
the example below for an implementation of a less restrictive
162 163 164 165 166 167 168

Reload the module object \var{module}, re-parsing and re-initializing it.  

169 170
Unload the module object \var{module} (i.e., remove it from the
restricted environment's \code{sys.modules} dictionary).
171 172

173 174
And their equivalents with access to restricted standard I/O streams:

\begin{funcdesc}{s_import}{modulename\optional{\, globals, locals, fromlist}}
176 177
Import the module \var{modulename}, raising an \code{ImportError}
exception if the module is considered unsafe.
178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197

Reload the module object \var{module}, re-parsing and re-initializing it.  

Unload the module object \var{module}.   
% XXX what are the semantics of this?  

\subsection{An example}

Let us say that we want a slightly more relaxed policy than the
standard RExec class.  For example, if we're willing to allow files in
\file{/tmp} to be written, we can subclass the \code{RExec} class:

class TmpWriterRExec(rexec.RExec):
    def r_open(self, file, mode='r', buf=-1):
198 199 200 201 202 203 204 205 206 207
        if mode in ('r', 'rb'):
        elif mode in ('w', 'wb', 'a', 'ab'):
            # check filename : must begin with /tmp/
            if file[:5]!='/tmp/': 
                raise IOError, "can't write outside /tmp"
            elif (string.find(file, '/../') >= 0 or
                 file[:3] == '../' or file[-3:] == '/..'):
                raise IOError, "'..' in filename forbidden"
        else: raise IOError, "Illegal open() mode"
208 209
        return open(file, mode, buf)
Guido van Rossum's avatar
Guido van Rossum committed
211 212 213 214 215 216 217 218 219
Notice that the above code will occasionally forbid a perfectly valid
filename; for example, code in the restricted environment won't be
able to open a file called \file{/tmp/foo/../bar}.  To fix this, the
\code{r_open} method would have to simplify the filename to
\file{/tmp/bar}, which would require splitting apart the filename and
performing various operations on it.  In cases where security is at
stake, it may be preferable to write simple code which is sometimes
overly restrictive, instead of more general code that is also more
complex and may harbor a subtle security hole.