| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 1 | \section{Standard Module \sectcode{cgi}} | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 2 | \label{module-cgi} | 
| Guido van Rossum | a12ef94 | 1995-02-27 17:53:25 +0000 | [diff] [blame] | 3 | \stmodindex{cgi} | 
|  | 4 | \indexii{WWW}{server} | 
|  | 5 | \indexii{CGI}{protocol} | 
|  | 6 | \indexii{HTTP}{protocol} | 
|  | 7 | \indexii{MIME}{headers} | 
|  | 8 | \index{URL} | 
|  | 9 |  | 
| Guido van Rossum | 8675115 | 1995-02-28 17:14:32 +0000 | [diff] [blame] | 10 | \renewcommand{\indexsubitem}{(in module cgi)} | 
|  | 11 |  | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 12 | Support module for CGI (Common Gateway Interface) scripts. | 
| Guido van Rossum | a12ef94 | 1995-02-27 17:53:25 +0000 | [diff] [blame] | 13 |  | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 14 | This module defines a number of utilities for use by CGI scripts | 
|  | 15 | written in Python. | 
| Guido van Rossum | a12ef94 | 1995-02-27 17:53:25 +0000 | [diff] [blame] | 16 |  | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 17 | \subsection{Introduction} | 
|  | 18 | \nodename{Introduction to the CGI module} | 
| Guido van Rossum | a12ef94 | 1995-02-27 17:53:25 +0000 | [diff] [blame] | 19 |  | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 20 | A CGI script is invoked by an HTTP server, usually to process user | 
|  | 21 | input submitted through an HTML \code{<FORM>} or \code{<ISINPUT>} element. | 
|  | 22 |  | 
|  | 23 | Most often, CGI scripts live in the server's special \code{cgi-bin} | 
|  | 24 | directory.  The HTTP server places all sorts of information about the | 
|  | 25 | request (such as the client's hostname, the requested URL, the query | 
|  | 26 | string, and lots of other goodies) in the script's shell environment, | 
|  | 27 | executes the script, and sends the script's output back to the client. | 
|  | 28 |  | 
|  | 29 | The script's input is connected to the client too, and sometimes the | 
|  | 30 | form data is read this way; at other times the form data is passed via | 
|  | 31 | the ``query string'' part of the URL.  This module (\code{cgi.py}) is intended | 
|  | 32 | to take care of the different cases and provide a simpler interface to | 
|  | 33 | the Python script.  It also provides a number of utilities that help | 
|  | 34 | in debugging scripts, and the latest addition is support for file | 
|  | 35 | uploads from a form (if your browser supports it -- Grail 0.3 and | 
|  | 36 | Netscape 2.0 do). | 
|  | 37 |  | 
|  | 38 | The output of a CGI script should consist of two sections, separated | 
|  | 39 | by a blank line.  The first section contains a number of headers, | 
|  | 40 | telling the client what kind of data is following.  Python code to | 
|  | 41 | generate a minimal header section looks like this: | 
| Guido van Rossum | a12ef94 | 1995-02-27 17:53:25 +0000 | [diff] [blame] | 42 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 43 | \bcode\begin{verbatim} | 
|  | 44 | print "Content-type: text/html"     # HTML is following | 
|  | 45 | print                               # blank line, end of headers | 
|  | 46 | \end{verbatim}\ecode | 
|  | 47 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 48 | The second section is usually HTML, which allows the client software | 
|  | 49 | to display nicely formatted text with header, in-line images, etc. | 
|  | 50 | Here's Python code that prints a simple piece of HTML: | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 51 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 52 | \bcode\begin{verbatim} | 
|  | 53 | print "<TITLE>CGI script output</TITLE>" | 
|  | 54 | print "<H1>This is my first CGI script</H1>" | 
|  | 55 | print "Hello, world!" | 
|  | 56 | \end{verbatim}\ecode | 
|  | 57 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 58 | (It may not be fully legal HTML according to the letter of the | 
|  | 59 | standard, but any browser will understand it.) | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 60 |  | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 61 | \subsection{Using the cgi module} | 
|  | 62 | \nodename{Using the cgi module} | 
|  | 63 |  | 
|  | 64 | Begin by writing \code{import cgi}.  Don't use \code{from cgi import *} -- the | 
|  | 65 | module defines all sorts of names for its own use or for backward | 
|  | 66 | compatibility that you don't want in your namespace. | 
|  | 67 |  | 
|  | 68 | It's best to use the \code{FieldStorage} class.  The other classes define in this | 
|  | 69 | module are provided mostly for backward compatibility.  Instantiate it | 
|  | 70 | exactly once, without arguments.  This reads the form contents from | 
|  | 71 | standard input or the environment (depending on the value of various | 
|  | 72 | environment variables set according to the CGI standard).  Since it may | 
|  | 73 | consume standard input, it should be instantiated only once. | 
|  | 74 |  | 
|  | 75 | The \code{FieldStorage} instance can be accessed as if it were a Python | 
|  | 76 | dictionary.  For instance, the following code (which assumes that the | 
|  | 77 | \code{Content-type} header and blank line have already been printed) checks that | 
|  | 78 | the fields \code{name} and \code{addr} are both set to a non-empty string: | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 79 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 80 | \bcode\begin{verbatim} | 
|  | 81 | form = cgi.FieldStorage() | 
|  | 82 | form_ok = 0 | 
|  | 83 | if form.has_key("name") and form.has_key("addr"): | 
|  | 84 | if form["name"].value != "" and form["addr"].value != "": | 
|  | 85 | form_ok = 1 | 
|  | 86 | if not form_ok: | 
|  | 87 | print "<H1>Error</H1>" | 
|  | 88 | print "Please fill in the name and addr fields." | 
|  | 89 | return | 
|  | 90 | ...further form processing here... | 
|  | 91 | \end{verbatim}\ecode | 
|  | 92 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 93 | Here the fields, accessed through \code{form[key]}, are themselves instances | 
|  | 94 | of \code{FieldStorage} (or \code{MiniFieldStorage}, depending on the form encoding). | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 95 |  | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 96 | If the submitted form data contains more than one field with the same | 
|  | 97 | name, the object retrieved by \code{form[key]} is not a \code{(Mini)FieldStorage} | 
|  | 98 | instance but a list of such instances.  If you expect this possibility | 
|  | 99 | (i.e., when your HTML form comtains multiple fields with the same | 
|  | 100 | name), use the \code{type()} function to determine whether you have a single | 
|  | 101 | instance or a list of instances.  For example, here's code that | 
|  | 102 | concatenates any number of username fields, separated by commas: | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 103 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 104 | \bcode\begin{verbatim} | 
|  | 105 | username = form["username"] | 
|  | 106 | if type(username) is type([]): | 
|  | 107 | # Multiple username fields specified | 
|  | 108 | usernames = "" | 
|  | 109 | for item in username: | 
|  | 110 | if usernames: | 
|  | 111 | # Next item -- insert comma | 
|  | 112 | usernames = usernames + "," + item.value | 
|  | 113 | else: | 
|  | 114 | # First item -- don't insert comma | 
|  | 115 | usernames = item.value | 
|  | 116 | else: | 
|  | 117 | # Single username field specified | 
|  | 118 | usernames = username.value | 
|  | 119 | \end{verbatim}\ecode | 
|  | 120 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 121 | If a field represents an uploaded file, the value attribute reads the | 
|  | 122 | entire file in memory as a string.  This may not be what you want.  You can | 
|  | 123 | test for an uploaded file by testing either the filename attribute or the | 
|  | 124 | file attribute.  You can then read the data at leasure from the file | 
|  | 125 | attribute: | 
|  | 126 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 127 | \bcode\begin{verbatim} | 
|  | 128 | fileitem = form["userfile"] | 
|  | 129 | if fileitem.file: | 
|  | 130 | # It's an uploaded file; count lines | 
|  | 131 | linecount = 0 | 
|  | 132 | while 1: | 
|  | 133 | line = fileitem.file.readline() | 
|  | 134 | if not line: break | 
|  | 135 | linecount = linecount + 1 | 
|  | 136 | \end{verbatim}\ecode | 
|  | 137 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 138 | The file upload draft standard entertains the possibility of uploading | 
|  | 139 | multiple files from one field (using a recursive \code{multipart/*} | 
|  | 140 | encoding).  When this occurs, the item will be a dictionary-like | 
|  | 141 | FieldStorage item.  This can be determined by testing its type | 
|  | 142 | attribute, which should have the value \code{multipart/form-data} (or | 
|  | 143 | perhaps another string beginning with \code{multipart/}  It this case, it | 
|  | 144 | can be iterated over recursively just like the top-level form object. | 
|  | 145 |  | 
|  | 146 | When a form is submitted in the ``old'' format (as the query string or as a | 
|  | 147 | single data part of type \code{application/x-www-form-urlencoded}), the items | 
|  | 148 | will actually be instances of the class \code{MiniFieldStorage}.  In this case, | 
|  | 149 | the list, file and filename attributes are always \code{None}. | 
|  | 150 |  | 
|  | 151 |  | 
|  | 152 | \subsection{Old classes} | 
|  | 153 |  | 
|  | 154 | These classes, present in earlier versions of the \code{cgi} module, are still | 
| Guido van Rossum | a5a4c2a | 1996-10-24 14:47:44 +0000 | [diff] [blame] | 155 | supported for backward compatibility.  New applications should use the | 
|  | 156 | FieldStorage class. | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 157 |  | 
|  | 158 | \code{SvFormContentDict}: single value form content as dictionary; assumes each | 
|  | 159 | field name occurs in the form only once. | 
|  | 160 |  | 
|  | 161 | \code{FormContentDict}: multiple value form content as dictionary (the form | 
|  | 162 | items are lists of values).  Useful if your form contains multiple | 
|  | 163 | fields with the same name. | 
|  | 164 |  | 
|  | 165 | Other classes (\code{FormContent}, \code{InterpFormContentDict}) are present for | 
|  | 166 | backwards compatibility with really old applications only.  If you still | 
|  | 167 | use these and would be inconvenienced when they disappeared from a next | 
|  | 168 | version of this module, drop me a note. | 
|  | 169 |  | 
|  | 170 |  | 
|  | 171 | \subsection{Functions} | 
| Fred Drake | 4b3f031 | 1996-12-13 22:04:31 +0000 | [diff] [blame] | 172 | \nodename{Functions in cgi module} | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 173 |  | 
|  | 174 | These are useful if you want more control, or if you want to employ | 
|  | 175 | some of the algorithms implemented in this module in other | 
|  | 176 | circumstances. | 
|  | 177 |  | 
|  | 178 | \begin{funcdesc}{parse}{fp}: Parse a query in the environment or from a file (default \code{sys.stdin}). | 
|  | 179 | \end{funcdesc} | 
|  | 180 |  | 
|  | 181 | \begin{funcdesc}{parse_qs}{qs}: parse a query string given as a string argument (data of type | 
|  | 182 | \code{application/x-www-form-urlencoded}). | 
|  | 183 | \end{funcdesc} | 
|  | 184 |  | 
|  | 185 | \begin{funcdesc}{parse_multipart}{fp\, pdict}: parse input of type \code{multipart/form-data} (for | 
|  | 186 | file uploads).  Arguments are \code{fp} for the input file and | 
|  | 187 | \code{pdict} for the dictionary containing other parameters of \code{content-type} header | 
|  | 188 |  | 
|  | 189 | Returns a dictionary just like \code{parse_qs()}: keys are the field names, each | 
|  | 190 | value is a list of values for that field.  This is easy to use but not | 
|  | 191 | much good if you are expecting megabytes to be uploaded -- in that case, | 
|  | 192 | use the \code{FieldStorage} class instead which is much more flexible.  Note | 
|  | 193 | that \code{content-type} is the raw, unparsed contents of the \code{content-type} | 
|  | 194 | header. | 
|  | 195 |  | 
|  | 196 | Note that this does not parse nested multipart parts -- use \code{FieldStorage} for | 
|  | 197 | that. | 
|  | 198 | \end{funcdesc} | 
|  | 199 |  | 
|  | 200 | \begin{funcdesc}{parse_header}{string}: parse a header like \code{Content-type} into a main | 
|  | 201 | content-type and a dictionary of parameters. | 
|  | 202 | \end{funcdesc} | 
|  | 203 |  | 
|  | 204 | \begin{funcdesc}{test}{}: robust test CGI script, usable as main program. | 
|  | 205 | Writes minimal HTTP headers and formats all information provided to | 
|  | 206 | the script in HTML form. | 
|  | 207 | \end{funcdesc} | 
|  | 208 |  | 
|  | 209 | \begin{funcdesc}{print_environ}{}: format the shell environment in HTML. | 
|  | 210 | \end{funcdesc} | 
|  | 211 |  | 
|  | 212 | \begin{funcdesc}{print_form}{form}: format a form in HTML. | 
|  | 213 | \end{funcdesc} | 
|  | 214 |  | 
|  | 215 | \begin{funcdesc}{print_directory}{}: format the current directory in HTML. | 
|  | 216 | \end{funcdesc} | 
|  | 217 |  | 
|  | 218 | \begin{funcdesc}{print_environ_usage}{}: print a list of useful (used by CGI) environment variables in | 
|  | 219 | HTML. | 
|  | 220 | \end{funcdesc} | 
|  | 221 |  | 
|  | 222 | \begin{funcdesc}{escape}{}: convert the characters ``\code{\&}'', ``\code{<}'' and ``\code{>}'' to HTML-safe | 
|  | 223 | sequences.  Use this if you need to display text that might contain | 
|  | 224 | such characters in HTML.  To translate URLs for inclusion in the HREF | 
|  | 225 | attribute of an \code{<A>} tag, use \code{urllib.quote()}. | 
|  | 226 | \end{funcdesc} | 
|  | 227 |  | 
|  | 228 |  | 
|  | 229 | \subsection{Caring about security} | 
|  | 230 |  | 
|  | 231 | There's one important rule: if you invoke an external program (e.g. | 
|  | 232 | via the \code{os.system()} or \code{os.popen()} functions), make very sure you don't | 
|  | 233 | pass arbitrary strings received from the client to the shell.  This is | 
|  | 234 | a well-known security hole whereby clever hackers anywhere on the web | 
|  | 235 | can exploit a gullible CGI script to invoke arbitrary shell commands. | 
|  | 236 | Even parts of the URL or field names cannot be trusted, since the | 
|  | 237 | request doesn't have to come from your form! | 
|  | 238 |  | 
|  | 239 | To be on the safe side, if you must pass a string gotten from a form | 
|  | 240 | to a shell command, you should make sure the string contains only | 
|  | 241 | alphanumeric characters, dashes, underscores, and periods. | 
|  | 242 |  | 
|  | 243 |  | 
|  | 244 | \subsection{Installing your CGI script on a Unix system} | 
|  | 245 |  | 
|  | 246 | Read the documentation for your HTTP server and check with your local | 
|  | 247 | system administrator to find the directory where CGI scripts should be | 
|  | 248 | installed; usually this is in a directory \code{cgi-bin} in the server tree. | 
|  | 249 |  | 
|  | 250 | Make sure that your script is readable and executable by ``others''; the | 
|  | 251 | Unix file mode should be 755 (use \code{chmod 755 filename}).  Make sure | 
|  | 252 | that the first line of the script contains \code{\#!} starting in column 1 | 
|  | 253 | followed by the pathname of the Python interpreter, for instance: | 
|  | 254 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 255 | \bcode\begin{verbatim} | 
|  | 256 | #!/usr/local/bin/python | 
|  | 257 | \end{verbatim}\ecode | 
|  | 258 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 259 | Make sure the Python interpreter exists and is executable by ``others''. | 
|  | 260 |  | 
|  | 261 | Make sure that any files your script needs to read or write are | 
|  | 262 | readable or writable, respectively, by ``others'' -- their mode should | 
|  | 263 | be 644 for readable and 666 for writable.  This is because, for | 
|  | 264 | security reasons, the HTTP server executes your script as user | 
|  | 265 | ``nobody'', without any special privileges.  It can only read (write, | 
|  | 266 | execute) files that everybody can read (write, execute).  The current | 
|  | 267 | directory at execution time is also different (it is usually the | 
|  | 268 | server's cgi-bin directory) and the set of environment variables is | 
|  | 269 | also different from what you get at login.  in particular, don't count | 
|  | 270 | on the shell's search path for executables (\code{\$PATH}) or the Python | 
|  | 271 | module search path (\code{\$PYTHONPATH}) to be set to anything interesting. | 
|  | 272 |  | 
|  | 273 | If you need to load modules from a directory which is not on Python's | 
|  | 274 | default module search path, you can change the path in your script, | 
|  | 275 | before importing other modules, e.g.: | 
|  | 276 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 277 | \bcode\begin{verbatim} | 
|  | 278 | import sys | 
|  | 279 | sys.path.insert(0, "/usr/home/joe/lib/python") | 
|  | 280 | sys.path.insert(0, "/usr/local/lib/python") | 
|  | 281 | \end{verbatim}\ecode | 
|  | 282 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 283 | (This way, the directory inserted last will be searched first!) | 
|  | 284 |  | 
|  | 285 | Instructions for non-Unix systems will vary; check your HTTP server's | 
|  | 286 | documentation (it will usually have a section on CGI scripts). | 
|  | 287 |  | 
|  | 288 |  | 
|  | 289 | \subsection{Testing your CGI script} | 
|  | 290 |  | 
|  | 291 | Unfortunately, a CGI script will generally not run when you try it | 
|  | 292 | from the command line, and a script that works perfectly from the | 
|  | 293 | command line may fail mysteriously when run from the server.  There's | 
|  | 294 | one reason why you should still test your script from the command | 
|  | 295 | line: if it contains a syntax error, the python interpreter won't | 
|  | 296 | execute it at all, and the HTTP server will most likely send a cryptic | 
|  | 297 | error to the client. | 
|  | 298 |  | 
|  | 299 | Assuming your script has no syntax errors, yet it does not work, you | 
|  | 300 | have no choice but to read the next section: | 
|  | 301 |  | 
|  | 302 |  | 
|  | 303 | \subsection{Debugging CGI scripts} | 
|  | 304 |  | 
|  | 305 | First of all, check for trivial installation errors -- reading the | 
|  | 306 | section above on installing your CGI script carefully can save you a | 
|  | 307 | lot of time.  If you wonder whether you have understood the | 
|  | 308 | installation procedure correctly, try installing a copy of this module | 
|  | 309 | file (\code{cgi.py}) as a CGI script.  When invoked as a script, the file | 
|  | 310 | will dump its environment and the contents of the form in HTML form. | 
|  | 311 | Give it the right mode etc, and send it a request.  If it's installed | 
|  | 312 | in the standard \code{cgi-bin} directory, it should be possible to send it a | 
|  | 313 | request by entering a URL into your browser of the form: | 
|  | 314 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 315 | \bcode\begin{verbatim} | 
|  | 316 | http://yourhostname/cgi-bin/cgi.py?name=Joe+Blow&addr=At+Home | 
|  | 317 | \end{verbatim}\ecode | 
|  | 318 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 319 | If this gives an error of type 404, the server cannot find the script | 
|  | 320 | -- perhaps you need to install it in a different directory.  If it | 
|  | 321 | gives another error (e.g.  500), there's an installation problem that | 
|  | 322 | you should fix before trying to go any further.  If you get a nicely | 
|  | 323 | formatted listing of the environment and form content (in this | 
|  | 324 | example, the fields should be listed as ``addr'' with value ``At Home'' | 
|  | 325 | and ``name'' with value ``Joe Blow''), the \code{cgi.py} script has been | 
|  | 326 | installed correctly.  If you follow the same procedure for your own | 
|  | 327 | script, you should now be able to debug it. | 
|  | 328 |  | 
|  | 329 | The next step could be to call the \code{cgi} module's test() function from | 
|  | 330 | your script: replace its main code with the single statement | 
|  | 331 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 332 | \bcode\begin{verbatim} | 
|  | 333 | cgi.test() | 
|  | 334 | \end{verbatim}\ecode | 
|  | 335 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 336 | This should produce the same results as those gotten from installing | 
|  | 337 | the \code{cgi.py} file itself. | 
|  | 338 |  | 
|  | 339 | When an ordinary Python script raises an unhandled exception | 
|  | 340 | (e.g. because of a typo in a module name, a file that can't be opened, | 
|  | 341 | etc.), the Python interpreter prints a nice traceback and exits. | 
|  | 342 | While the Python interpreter will still do this when your CGI script | 
|  | 343 | raises an exception, most likely the traceback will end up in one of | 
|  | 344 | the HTTP server's log file, or be discarded altogether. | 
|  | 345 |  | 
|  | 346 | Fortunately, once you have managed to get your script to execute | 
|  | 347 | *some* code, it is easy to catch exceptions and cause a traceback to | 
|  | 348 | be printed.  The \code{test()} function below in this module is an example. | 
|  | 349 | Here are the rules: | 
|  | 350 |  | 
|  | 351 | \begin{enumerate} | 
|  | 352 | \item Import the traceback module (before entering the | 
|  | 353 | try-except!) | 
|  | 354 |  | 
|  | 355 | \item Make sure you finish printing the headers and the blank | 
|  | 356 | line early | 
|  | 357 |  | 
|  | 358 | \item Assign \code{sys.stderr} to \code{sys.stdout} | 
|  | 359 |  | 
|  | 360 | \item Wrap all remaining code in a try-except statement | 
|  | 361 |  | 
|  | 362 | \item In the except clause, call \code{traceback.print_exc()} | 
|  | 363 | \end{enumerate} | 
|  | 364 |  | 
|  | 365 | For example: | 
|  | 366 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 367 | \bcode\begin{verbatim} | 
|  | 368 | import sys | 
|  | 369 | import traceback | 
|  | 370 | print "Content-type: text/html" | 
|  | 371 | print | 
|  | 372 | sys.stderr = sys.stdout | 
|  | 373 | try: | 
|  | 374 | ...your code here... | 
|  | 375 | except: | 
|  | 376 | print "\n\n<PRE>" | 
|  | 377 | traceback.print_exc() | 
|  | 378 | \end{verbatim}\ecode | 
|  | 379 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 380 | Notes: The assignment to \code{sys.stderr} is needed because the traceback | 
|  | 381 | prints to \code{sys.stderr}.  The \code{print "$\backslash$n$\backslash$n<PRE>"} statement is necessary to | 
|  | 382 | disable the word wrapping in HTML. | 
|  | 383 |  | 
|  | 384 | If you suspect that there may be a problem in importing the traceback | 
|  | 385 | module, you can use an even more robust approach (which only uses | 
|  | 386 | built-in modules): | 
|  | 387 |  | 
| Guido van Rossum | e47da0a | 1997-07-17 16:34:52 +0000 | [diff] [blame^] | 388 | \bcode\begin{verbatim} | 
|  | 389 | import sys | 
|  | 390 | sys.stderr = sys.stdout | 
|  | 391 | print "Content-type: text/plain" | 
|  | 392 | print | 
|  | 393 | ...your code here... | 
|  | 394 | \end{verbatim}\ecode | 
|  | 395 | % | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 396 | This relies on the Python interpreter to print the traceback.  The | 
|  | 397 | content type of the output is set to plain text, which disables all | 
|  | 398 | HTML processing.  If your script works, the raw HTML will be displayed | 
|  | 399 | by your client.  If it raises an exception, most likely after the | 
|  | 400 | first two lines have been printed, a traceback will be displayed. | 
|  | 401 | Because no HTML interpretation is going on, the traceback will | 
|  | 402 | readable. | 
|  | 403 |  | 
|  | 404 |  | 
|  | 405 | \subsection{Common problems and solutions} | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 406 |  | 
|  | 407 | \begin{itemize} | 
| Guido van Rossum | a29cc97 | 1996-07-30 18:22:07 +0000 | [diff] [blame] | 408 | \item Most HTTP servers buffer the output from CGI scripts until the | 
|  | 409 | script is completed.  This means that it is not possible to display a | 
|  | 410 | progress report on the client's display while the script is running. | 
|  | 411 |  | 
|  | 412 | \item Check the installation instructions above. | 
|  | 413 |  | 
|  | 414 | \item Check the HTTP server's log files.  (\code{tail -f logfile} in a separate | 
|  | 415 | window may be useful!) | 
|  | 416 |  | 
|  | 417 | \item Always check a script for syntax errors first, by doing something | 
|  | 418 | like \code{python script.py}. | 
|  | 419 |  | 
|  | 420 | \item When using any of the debugging techniques, don't forget to add | 
|  | 421 | \code{import sys} to the top of the script. | 
|  | 422 |  | 
|  | 423 | \item When invoking external programs, make sure they can be found. | 
|  | 424 | Usually, this means using absolute path names -- \code{\$PATH} is usually not | 
|  | 425 | set to a very useful value in a CGI script. | 
|  | 426 |  | 
|  | 427 | \item When reading or writing external files, make sure they can be read | 
|  | 428 | or written by every user on the system. | 
|  | 429 |  | 
|  | 430 | \item Don't try to give a CGI script a set-uid mode.  This doesn't work on | 
|  | 431 | most systems, and is a security liability as well. | 
| Guido van Rossum | 470be14 | 1995-03-17 16:07:09 +0000 | [diff] [blame] | 432 | \end{itemize} | 
|  | 433 |  |