blob: e90bf1cd8c9596d130a3e0aa054d18cc04042dae [file] [log] [blame]
Fred Drake295da241998-08-10 19:42:37 +00001\section{\module{cgi} ---
2 Common Gateway Interface support.}
Fred Drakeb91e9341998-07-23 17:59:49 +00003\declaremodule{standard}{cgi}
4
Fred Drake295da241998-08-10 19:42:37 +00005\modulesynopsis{Common Gateway Interface support, used to interpret
6forms in server-side scripts.}
Fred Drakeb91e9341998-07-23 17:59:49 +00007
Guido van Rossuma12ef941995-02-27 17:53:25 +00008\indexii{WWW}{server}
9\indexii{CGI}{protocol}
10\indexii{HTTP}{protocol}
11\indexii{MIME}{headers}
12\index{URL}
13
Guido van Rossum86751151995-02-28 17:14:32 +000014
Fred Drake6a79be81998-04-03 03:47:03 +000015Support module for CGI (Common Gateway Interface) scripts.%
16\index{Common Gateway Interface}
Guido van Rossuma12ef941995-02-27 17:53:25 +000017
Guido van Rossuma29cc971996-07-30 18:22:07 +000018This module defines a number of utilities for use by CGI scripts
19written in Python.
Guido van Rossuma12ef941995-02-27 17:53:25 +000020
Guido van Rossuma29cc971996-07-30 18:22:07 +000021\subsection{Introduction}
Fred Drake12d9fc91998-04-14 17:19:54 +000022\nodename{cgi-intro}
Guido van Rossuma12ef941995-02-27 17:53:25 +000023
Guido van Rossuma29cc971996-07-30 18:22:07 +000024A CGI script is invoked by an HTTP server, usually to process user
Fred Drake637af131998-08-21 20:02:06 +000025input submitted through an HTML \code{<FORM>} or \code{<ISINDEX>} element.
Guido van Rossuma29cc971996-07-30 18:22:07 +000026
Fred Drakea2e268a1997-12-09 03:28:42 +000027Most often, CGI scripts live in the server's special \file{cgi-bin}
Guido van Rossuma29cc971996-07-30 18:22:07 +000028directory. The HTTP server places all sorts of information about the
29request (such as the client's hostname, the requested URL, the query
30string, and lots of other goodies) in the script's shell environment,
31executes the script, and sends the script's output back to the client.
32
33The script's input is connected to the client too, and sometimes the
34form data is read this way; at other times the form data is passed via
Fred Drake6ef871c1998-03-12 06:52:05 +000035the ``query string'' part of the URL. This module is intended
Guido van Rossuma29cc971996-07-30 18:22:07 +000036to take care of the different cases and provide a simpler interface to
37the Python script. It also provides a number of utilities that help
38in debugging scripts, and the latest addition is support for file
Fred Drake6ef871c1998-03-12 06:52:05 +000039uploads from a form (if your browser supports it --- Grail 0.3 and
Guido van Rossuma29cc971996-07-30 18:22:07 +000040Netscape 2.0 do).
41
42The output of a CGI script should consist of two sections, separated
43by a blank line. The first section contains a number of headers,
44telling the client what kind of data is following. Python code to
45generate a minimal header section looks like this:
Guido van Rossuma12ef941995-02-27 17:53:25 +000046
Fred Drake19479911998-02-13 06:58:54 +000047\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +000048print "Content-type: text/html" # HTML is following
49print # blank line, end of headers
Fred Drake19479911998-02-13 06:58:54 +000050\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +000051
Guido van Rossuma29cc971996-07-30 18:22:07 +000052The second section is usually HTML, which allows the client software
53to display nicely formatted text with header, in-line images, etc.
54Here's Python code that prints a simple piece of HTML:
Guido van Rossum470be141995-03-17 16:07:09 +000055
Fred Drake19479911998-02-13 06:58:54 +000056\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +000057print "<TITLE>CGI script output</TITLE>"
58print "<H1>This is my first CGI script</H1>"
59print "Hello, world!"
Fred Drake19479911998-02-13 06:58:54 +000060\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +000061
Guido van Rossuma29cc971996-07-30 18:22:07 +000062(It may not be fully legal HTML according to the letter of the
63standard, but any browser will understand it.)
Guido van Rossum470be141995-03-17 16:07:09 +000064
Guido van Rossuma29cc971996-07-30 18:22:07 +000065\subsection{Using the cgi module}
66\nodename{Using the cgi module}
67
Fred Drake6ef871c1998-03-12 06:52:05 +000068Begin by writing \samp{import cgi}. Do not use \samp{from cgi import
69*} --- the module defines all sorts of names for its own use or for
70backward compatibility that you don't want in your namespace.
Guido van Rossuma29cc971996-07-30 18:22:07 +000071
Fred Drake6ef871c1998-03-12 06:52:05 +000072It's best to use the \class{FieldStorage} class. The other classes
73defined in this module are provided mostly for backward compatibility.
74Instantiate it exactly once, without arguments. This reads the form
75contents from standard input or the environment (depending on the
76value of various environment variables set according to the CGI
77standard). Since it may consume standard input, it should be
78instantiated only once.
Guido van Rossuma29cc971996-07-30 18:22:07 +000079
Fred Drake6ef871c1998-03-12 06:52:05 +000080The \class{FieldStorage} instance can be accessed as if it were a Python
Guido van Rossuma29cc971996-07-30 18:22:07 +000081dictionary. For instance, the following code (which assumes that the
Fred Drake6ef871c1998-03-12 06:52:05 +000082\code{content-type} header and blank line have already been printed)
83checks that the fields \code{name} and \code{addr} are both set to a
84non-empty string:
Guido van Rossum470be141995-03-17 16:07:09 +000085
Fred Drake19479911998-02-13 06:58:54 +000086\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +000087form = cgi.FieldStorage()
88form_ok = 0
89if form.has_key("name") and form.has_key("addr"):
90 if form["name"].value != "" and form["addr"].value != "":
91 form_ok = 1
92if not form_ok:
93 print "<H1>Error</H1>"
94 print "Please fill in the name and addr fields."
95 return
96...further form processing here...
Fred Drake19479911998-02-13 06:58:54 +000097\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +000098
99Here the fields, accessed through \samp{form[\var{key}]}, are
100themselves instances of \class{FieldStorage} (or
101\class{MiniFieldStorage}, depending on the form encoding).
Guido van Rossum470be141995-03-17 16:07:09 +0000102
Guido van Rossuma29cc971996-07-30 18:22:07 +0000103If the submitted form data contains more than one field with the same
Fred Drake6ef871c1998-03-12 06:52:05 +0000104name, the object retrieved by \samp{form[\var{key}]} is not a
105\class{FieldStorage} or \class{MiniFieldStorage}
Guido van Rossuma29cc971996-07-30 18:22:07 +0000106instance but a list of such instances. If you expect this possibility
Thomas Woutersf8316632000-07-16 19:01:10 +0000107(i.e., when your HTML form contains multiple fields with the same
Fred Drake6ef871c1998-03-12 06:52:05 +0000108name), use the \function{type()} function to determine whether you
109have a single instance or a list of instances. For example, here's
110code that concatenates any number of username fields, separated by
111commas:
Guido van Rossum470be141995-03-17 16:07:09 +0000112
Fred Drake19479911998-02-13 06:58:54 +0000113\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000114username = form["username"]
115if type(username) is type([]):
116 # Multiple username fields specified
117 usernames = ""
118 for item in username:
119 if usernames:
120 # Next item -- insert comma
121 usernames = usernames + "," + item.value
122 else:
123 # First item -- don't insert comma
124 usernames = item.value
125else:
126 # Single username field specified
127 usernames = username.value
Fred Drake19479911998-02-13 06:58:54 +0000128\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000129
130If a field represents an uploaded file, the value attribute reads the
131entire file in memory as a string. This may not be what you want.
132You can test for an uploaded file by testing either the filename
133attribute or the file attribute. You can then read the data at
Thomas Woutersf8316632000-07-16 19:01:10 +0000134leisure from the file attribute:
Guido van Rossuma29cc971996-07-30 18:22:07 +0000135
Fred Drake19479911998-02-13 06:58:54 +0000136\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000137fileitem = form["userfile"]
138if fileitem.file:
139 # It's an uploaded file; count lines
140 linecount = 0
141 while 1:
142 line = fileitem.file.readline()
143 if not line: break
144 linecount = linecount + 1
Fred Drake19479911998-02-13 06:58:54 +0000145\end{verbatim}
Guido van Rossuma29cc971996-07-30 18:22:07 +0000146
Fred Drake6ef871c1998-03-12 06:52:05 +0000147The file upload draft standard entertains the possibility of uploading
148multiple files from one field (using a recursive
149\mimetype{multipart/*} encoding). When this occurs, the item will be
150a dictionary-like \class{FieldStorage} item. This can be determined
151by testing its \member{type} attribute, which should be
152\mimetype{multipart/form-data} (or perhaps another MIME type matching
Fred Drake7eca8e51999-01-18 15:46:02 +0000153\mimetype{multipart/*}). In this case, it can be iterated over
Fred Drake6ef871c1998-03-12 06:52:05 +0000154recursively just like the top-level form object.
155
156When a form is submitted in the ``old'' format (as the query string or
157as a single data part of type
158\mimetype{application/x-www-form-urlencoded}), the items will actually
159be instances of the class \class{MiniFieldStorage}. In this case, the
160list, file and filename attributes are always \code{None}.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000161
162
163\subsection{Old classes}
164
Fred Drake6ef871c1998-03-12 06:52:05 +0000165These classes, present in earlier versions of the \module{cgi} module,
166are still supported for backward compatibility. New applications
167should use the \class{FieldStorage} class.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000168
Fred Drake6ef871c1998-03-12 06:52:05 +0000169\class{SvFormContentDict} stores single value form content as
170dictionary; it assumes each field name occurs in the form only once.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000171
Fred Drake6ef871c1998-03-12 06:52:05 +0000172\class{FormContentDict} stores multiple value form content as a
173dictionary (the form items are lists of values). Useful if your form
174contains multiple fields with the same name.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000175
Fred Drake6ef871c1998-03-12 06:52:05 +0000176Other classes (\class{FormContent}, \class{InterpFormContentDict}) are
177present for backwards compatibility with really old applications only.
178If you still use these and would be inconvenienced when they
179disappeared from a next version of this module, drop me a note.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000180
181
182\subsection{Functions}
Fred Drake4b3f0311996-12-13 22:04:31 +0000183\nodename{Functions in cgi module}
Guido van Rossuma29cc971996-07-30 18:22:07 +0000184
185These are useful if you want more control, or if you want to employ
186some of the algorithms implemented in this module in other
187circumstances.
188
Guido van Rossum81e479a1997-08-25 18:28:03 +0000189\begin{funcdesc}{parse}{fp}
Fred Drake6ef871c1998-03-12 06:52:05 +0000190Parse a query in the environment or from a file (default
191\code{sys.stdin}).
Guido van Rossuma29cc971996-07-30 18:22:07 +0000192\end{funcdesc}
193
Guido van Rossum66ab4e81999-06-10 03:11:41 +0000194\begin{funcdesc}{parse_qs}{qs\optional{, keep_blank_values, strict_parsing}}
Fred Drake6ef871c1998-03-12 06:52:05 +0000195Parse a query string given as a string argument (data of type
Guido van Rossum66ab4e81999-06-10 03:11:41 +0000196\mimetype{application/x-www-form-urlencoded}). Data are
197returned as a dictionary. The dictionary keys are the unique query
Fred Drake38e5d272000-04-03 20:13:55 +0000198variable names and the values are lists of values for each name.
Guido van Rossum66ab4e81999-06-10 03:11:41 +0000199
200The optional argument \var{keep_blank_values} is
201a flag indicating whether blank values in
202URL encoded queries should be treated as blank strings.
203A true value indicates that blanks should be retained as
204blank strings. The default false value indicates that
205blank values are to be ignored and treated as if they were
206not included.
207
208The optional argument \var{strict_parsing} is a flag indicating what
209to do with parsing errors. If false (the default), errors
210are silently ignored. If true, errors raise a ValueError
211exception.
212\end{funcdesc}
213
214\begin{funcdesc}{parse_qsl}{qs\optional{, keep_blank_values, strict_parsing}}
215Parse a query string given as a string argument (data of type
216\mimetype{application/x-www-form-urlencoded}). Data are
217returned as a list of name, value pairs.
218
219The optional argument \var{keep_blank_values} is
220a flag indicating whether blank values in
221URL encoded queries should be treated as blank strings.
222A true value indicates that blanks should be retained as
223blank strings. The default false value indicates that
224blank values are to be ignored and treated as if they were
225not included.
226
227The optional argument \var{strict_parsing} is a flag indicating what
228to do with parsing errors. If false (the default), errors
229are silently ignored. If true, errors raise a ValueError
230exception.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000231\end{funcdesc}
232
Fred Drakecce10901998-03-17 06:33:25 +0000233\begin{funcdesc}{parse_multipart}{fp, pdict}
Fred Drake6ef871c1998-03-12 06:52:05 +0000234Parse input of type \mimetype{multipart/form-data} (for
235file uploads). Arguments are \var{fp} for the input file and
236\var{pdict} for the dictionary containing other parameters of
237\code{content-type} header
Guido van Rossuma29cc971996-07-30 18:22:07 +0000238
Fred Drake6ef871c1998-03-12 06:52:05 +0000239Returns a dictionary just like \function{parse_qs()} keys are the
240field names, each value is a list of values for that field. This is
241easy to use but not much good if you are expecting megabytes to be
242uploaded --- in that case, use the \class{FieldStorage} class instead
243which is much more flexible. Note that \code{content-type} is the
244raw, unparsed contents of the \code{content-type} header.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000245
Fred Drake6ef871c1998-03-12 06:52:05 +0000246Note that this does not parse nested multipart parts --- use
247\class{FieldStorage} for that.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000248\end{funcdesc}
249
Guido van Rossum81e479a1997-08-25 18:28:03 +0000250\begin{funcdesc}{parse_header}{string}
Fred Drake6ef871c1998-03-12 06:52:05 +0000251Parse a header like \code{content-type} into a main
Guido van Rossuma29cc971996-07-30 18:22:07 +0000252content-type and a dictionary of parameters.
253\end{funcdesc}
254
Guido van Rossum81e479a1997-08-25 18:28:03 +0000255\begin{funcdesc}{test}{}
Fred Drake6ef871c1998-03-12 06:52:05 +0000256Robust test CGI script, usable as main program.
257Writes minimal HTTP headers and formats all information provided to
258the script in HTML form.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000259\end{funcdesc}
260
Guido van Rossum81e479a1997-08-25 18:28:03 +0000261\begin{funcdesc}{print_environ}{}
Fred Drake6ef871c1998-03-12 06:52:05 +0000262Format the shell environment in HTML.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000263\end{funcdesc}
264
Guido van Rossum81e479a1997-08-25 18:28:03 +0000265\begin{funcdesc}{print_form}{form}
Fred Drake6ef871c1998-03-12 06:52:05 +0000266Format a form in HTML.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000267\end{funcdesc}
268
Guido van Rossum81e479a1997-08-25 18:28:03 +0000269\begin{funcdesc}{print_directory}{}
Fred Drake6ef871c1998-03-12 06:52:05 +0000270Format the current directory in HTML.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000271\end{funcdesc}
272
Guido van Rossum81e479a1997-08-25 18:28:03 +0000273\begin{funcdesc}{print_environ_usage}{}
Fred Drake6ef871c1998-03-12 06:52:05 +0000274Print a list of useful (used by CGI) environment variables in
Guido van Rossuma29cc971996-07-30 18:22:07 +0000275HTML.
276\end{funcdesc}
277
Fred Drakecce10901998-03-17 06:33:25 +0000278\begin{funcdesc}{escape}{s\optional{, quote}}
Fred Drake6ef871c1998-03-12 06:52:05 +0000279Convert the characters
280\character{\&}, \character{<} and \character{>} in string \var{s} to
281HTML-safe sequences. Use this if you need to display text that might
282contain such characters in HTML. If the optional flag \var{quote} is
283true, the double quote character (\character{"}) is also translated;
284this helps for inclusion in an HTML attribute value, e.g. in \code{<A
285HREF="...">}.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000286\end{funcdesc}
287
288
289\subsection{Caring about security}
290
291There's one important rule: if you invoke an external program (e.g.
Fred Drake6ef871c1998-03-12 06:52:05 +0000292via the \function{os.system()} or \function{os.popen()} functions),
293make very sure you don't pass arbitrary strings received from the
294client to the shell. This is a well-known security hole whereby
295clever hackers anywhere on the web can exploit a gullible CGI script
296to invoke arbitrary shell commands. Even parts of the URL or field
297names cannot be trusted, since the request doesn't have to come from
298your form!
Guido van Rossuma29cc971996-07-30 18:22:07 +0000299
300To be on the safe side, if you must pass a string gotten from a form
301to a shell command, you should make sure the string contains only
302alphanumeric characters, dashes, underscores, and periods.
303
304
305\subsection{Installing your CGI script on a Unix system}
306
307Read the documentation for your HTTP server and check with your local
308system administrator to find the directory where CGI scripts should be
Fred Drakea2e268a1997-12-09 03:28:42 +0000309installed; usually this is in a directory \file{cgi-bin} in the server tree.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000310
311Make sure that your script is readable and executable by ``others''; the
Fred Drake6ef871c1998-03-12 06:52:05 +0000312\UNIX{} file mode should be \code{0755} octal (use \samp{chmod 0755
Fred Drake7eca8e51999-01-18 15:46:02 +0000313\var{filename}}). Make sure that the first line of the script contains
Fred Drake6ef871c1998-03-12 06:52:05 +0000314\code{\#!} starting in column 1 followed by the pathname of the Python
315interpreter, for instance:
Guido van Rossuma29cc971996-07-30 18:22:07 +0000316
Fred Drake19479911998-02-13 06:58:54 +0000317\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000318#!/usr/local/bin/python
Fred Drake19479911998-02-13 06:58:54 +0000319\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000320
Guido van Rossuma29cc971996-07-30 18:22:07 +0000321Make sure the Python interpreter exists and is executable by ``others''.
322
323Make sure that any files your script needs to read or write are
Fred Drake6ef871c1998-03-12 06:52:05 +0000324readable or writable, respectively, by ``others'' --- their mode
325should be \code{0644} for readable and \code{0666} for writable. This
326is because, for security reasons, the HTTP server executes your script
327as user ``nobody'', without any special privileges. It can only read
328(write, execute) files that everybody can read (write, execute). The
329current directory at execution time is also different (it is usually
330the server's cgi-bin directory) and the set of environment variables
331is also different from what you get at login. In particular, don't
332count on the shell's search path for executables (\envvar{PATH}) or
333the Python module search path (\envvar{PYTHONPATH}) to be set to
334anything interesting.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000335
336If you need to load modules from a directory which is not on Python's
337default module search path, you can change the path in your script,
338before importing other modules, e.g.:
339
Fred Drake19479911998-02-13 06:58:54 +0000340\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000341import sys
342sys.path.insert(0, "/usr/home/joe/lib/python")
343sys.path.insert(0, "/usr/local/lib/python")
Fred Drake19479911998-02-13 06:58:54 +0000344\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000345
Guido van Rossuma29cc971996-07-30 18:22:07 +0000346(This way, the directory inserted last will be searched first!)
347
Fred Drakeefc1e0f1998-01-13 19:00:33 +0000348Instructions for non-\UNIX{} systems will vary; check your HTTP server's
Guido van Rossuma29cc971996-07-30 18:22:07 +0000349documentation (it will usually have a section on CGI scripts).
350
351
352\subsection{Testing your CGI script}
353
354Unfortunately, a CGI script will generally not run when you try it
355from the command line, and a script that works perfectly from the
356command line may fail mysteriously when run from the server. There's
357one reason why you should still test your script from the command
Fred Drake6a79be81998-04-03 03:47:03 +0000358line: if it contains a syntax error, the Python interpreter won't
Guido van Rossuma29cc971996-07-30 18:22:07 +0000359execute it at all, and the HTTP server will most likely send a cryptic
360error to the client.
361
362Assuming your script has no syntax errors, yet it does not work, you
Fred Drake6ef871c1998-03-12 06:52:05 +0000363have no choice but to read the next section.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000364
365
366\subsection{Debugging CGI scripts}
367
Fred Drake6ef871c1998-03-12 06:52:05 +0000368First of all, check for trivial installation errors --- reading the
Guido van Rossuma29cc971996-07-30 18:22:07 +0000369section above on installing your CGI script carefully can save you a
370lot of time. If you wonder whether you have understood the
371installation procedure correctly, try installing a copy of this module
Fred Drakea2e268a1997-12-09 03:28:42 +0000372file (\file{cgi.py}) as a CGI script. When invoked as a script, the file
Guido van Rossuma29cc971996-07-30 18:22:07 +0000373will dump its environment and the contents of the form in HTML form.
374Give it the right mode etc, and send it a request. If it's installed
Fred Drakea2e268a1997-12-09 03:28:42 +0000375in the standard \file{cgi-bin} directory, it should be possible to send it a
Guido van Rossuma29cc971996-07-30 18:22:07 +0000376request by entering a URL into your browser of the form:
377
Fred Drake19479911998-02-13 06:58:54 +0000378\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000379http://yourhostname/cgi-bin/cgi.py?name=Joe+Blow&addr=At+Home
Fred Drake19479911998-02-13 06:58:54 +0000380\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000381
Guido van Rossuma29cc971996-07-30 18:22:07 +0000382If this gives an error of type 404, the server cannot find the script
383-- perhaps you need to install it in a different directory. If it
384gives another error (e.g. 500), there's an installation problem that
385you should fix before trying to go any further. If you get a nicely
386formatted listing of the environment and form content (in this
387example, the fields should be listed as ``addr'' with value ``At Home''
Fred Drakea2e268a1997-12-09 03:28:42 +0000388and ``name'' with value ``Joe Blow''), the \file{cgi.py} script has been
Guido van Rossuma29cc971996-07-30 18:22:07 +0000389installed correctly. If you follow the same procedure for your own
390script, you should now be able to debug it.
391
Fred Drake6ef871c1998-03-12 06:52:05 +0000392The next step could be to call the \module{cgi} module's
393\function{test()} function from your script: replace its main code
394with the single statement
Guido van Rossuma29cc971996-07-30 18:22:07 +0000395
Fred Drake19479911998-02-13 06:58:54 +0000396\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000397cgi.test()
Fred Drake19479911998-02-13 06:58:54 +0000398\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000399
Guido van Rossuma29cc971996-07-30 18:22:07 +0000400This should produce the same results as those gotten from installing
Fred Drakea2e268a1997-12-09 03:28:42 +0000401the \file{cgi.py} file itself.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000402
403When an ordinary Python script raises an unhandled exception
404(e.g. because of a typo in a module name, a file that can't be opened,
405etc.), the Python interpreter prints a nice traceback and exits.
406While the Python interpreter will still do this when your CGI script
407raises an exception, most likely the traceback will end up in one of
408the HTTP server's log file, or be discarded altogether.
409
410Fortunately, once you have managed to get your script to execute
Fred Drake6ef871c1998-03-12 06:52:05 +0000411\emph{some} code, it is easy to catch exceptions and cause a traceback
412to be printed. The \function{test()} function below in this module is
413an example. Here are the rules:
Guido van Rossuma29cc971996-07-30 18:22:07 +0000414
415\begin{enumerate}
Fred Drake6ef871c1998-03-12 06:52:05 +0000416\item Import the traceback module before entering the \keyword{try}
417 ... \keyword{except} statement
418
419\item Assign \code{sys.stderr} to be \code{sys.stdout}
420
421\item Make sure you finish printing the headers and the blank line
422 early
423
424\item Wrap all remaining code in a \keyword{try} ... \keyword{except}
425 statement
426
427\item In the except clause, call \function{traceback.print_exc()}
Guido van Rossuma29cc971996-07-30 18:22:07 +0000428\end{enumerate}
429
430For example:
431
Fred Drake19479911998-02-13 06:58:54 +0000432\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000433import sys
434import traceback
435print "Content-type: text/html"
436print
437sys.stderr = sys.stdout
438try:
439 ...your code here...
440except:
441 print "\n\n<PRE>"
442 traceback.print_exc()
Fred Drake19479911998-02-13 06:58:54 +0000443\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000444
445Notes: The assignment to \code{sys.stderr} is needed because the
446traceback prints to \code{sys.stderr}.
Guido van Rossum9d62e801997-11-25 00:35:44 +0000447The \code{print "{\e}n{\e}n<PRE>"} statement is necessary to
Guido van Rossuma29cc971996-07-30 18:22:07 +0000448disable the word wrapping in HTML.
449
450If you suspect that there may be a problem in importing the traceback
451module, you can use an even more robust approach (which only uses
452built-in modules):
453
Fred Drake19479911998-02-13 06:58:54 +0000454\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000455import sys
456sys.stderr = sys.stdout
457print "Content-type: text/plain"
458print
459...your code here...
Fred Drake19479911998-02-13 06:58:54 +0000460\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000461
Guido van Rossuma29cc971996-07-30 18:22:07 +0000462This relies on the Python interpreter to print the traceback. The
463content type of the output is set to plain text, which disables all
464HTML processing. If your script works, the raw HTML will be displayed
465by your client. If it raises an exception, most likely after the
466first two lines have been printed, a traceback will be displayed.
467Because no HTML interpretation is going on, the traceback will
468readable.
469
470
471\subsection{Common problems and solutions}
Guido van Rossum470be141995-03-17 16:07:09 +0000472
473\begin{itemize}
Guido van Rossuma29cc971996-07-30 18:22:07 +0000474\item Most HTTP servers buffer the output from CGI scripts until the
475script is completed. This means that it is not possible to display a
476progress report on the client's display while the script is running.
477
478\item Check the installation instructions above.
479
Fred Drake6ef871c1998-03-12 06:52:05 +0000480\item Check the HTTP server's log files. (\samp{tail -f logfile} in a
481separate window may be useful!)
Guido van Rossuma29cc971996-07-30 18:22:07 +0000482
483\item Always check a script for syntax errors first, by doing something
Fred Drake6ef871c1998-03-12 06:52:05 +0000484like \samp{python script.py}.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000485
486\item When using any of the debugging techniques, don't forget to add
Fred Drake6ef871c1998-03-12 06:52:05 +0000487\samp{import sys} to the top of the script.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000488
489\item When invoking external programs, make sure they can be found.
Fred Drake6ef871c1998-03-12 06:52:05 +0000490Usually, this means using absolute path names --- \envvar{PATH} is
491usually not set to a very useful value in a CGI script.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000492
493\item When reading or writing external files, make sure they can be read
494or written by every user on the system.
495
496\item Don't try to give a CGI script a set-uid mode. This doesn't work on
497most systems, and is a security liability as well.
Guido van Rossum470be141995-03-17 16:07:09 +0000498\end{itemize}
499