blob: 24d6afdc5a8a948f5f132d92adfa6a270410946f [file] [log] [blame]
Fred Drake295da241998-08-10 19:42:37 +00001\section{\module{cgi} ---
2 Common Gateway Interface support.}
Fred Drakeb91e9341998-07-23 17:59:49 +00003\declaremodule{standard}{cgi}
4
Fred Drake295da241998-08-10 19:42:37 +00005\modulesynopsis{Common Gateway Interface support, used to interpret
6forms in server-side scripts.}
Fred Drakeb91e9341998-07-23 17:59:49 +00007
Guido van Rossuma12ef941995-02-27 17:53:25 +00008\indexii{WWW}{server}
9\indexii{CGI}{protocol}
10\indexii{HTTP}{protocol}
11\indexii{MIME}{headers}
12\index{URL}
13
Guido van Rossum86751151995-02-28 17:14:32 +000014
Fred Drake8ee679f2001-07-14 02:50:55 +000015Support module for Common Gateway Interface (CGI) scripts.%
Fred Drake6a79be81998-04-03 03:47:03 +000016\index{Common Gateway Interface}
Guido van Rossuma12ef941995-02-27 17:53:25 +000017
Guido van Rossuma29cc971996-07-30 18:22:07 +000018This module defines a number of utilities for use by CGI scripts
19written in Python.
Guido van Rossuma12ef941995-02-27 17:53:25 +000020
Guido van Rossuma29cc971996-07-30 18:22:07 +000021\subsection{Introduction}
Fred Drake12d9fc91998-04-14 17:19:54 +000022\nodename{cgi-intro}
Guido van Rossuma12ef941995-02-27 17:53:25 +000023
Guido van Rossuma29cc971996-07-30 18:22:07 +000024A CGI script is invoked by an HTTP server, usually to process user
Fred Drake637af131998-08-21 20:02:06 +000025input submitted through an HTML \code{<FORM>} or \code{<ISINDEX>} element.
Guido van Rossuma29cc971996-07-30 18:22:07 +000026
Fred Drakea2e268a1997-12-09 03:28:42 +000027Most often, CGI scripts live in the server's special \file{cgi-bin}
Guido van Rossuma29cc971996-07-30 18:22:07 +000028directory. The HTTP server places all sorts of information about the
29request (such as the client's hostname, the requested URL, the query
30string, and lots of other goodies) in the script's shell environment,
31executes the script, and sends the script's output back to the client.
32
33The script's input is connected to the client too, and sometimes the
34form data is read this way; at other times the form data is passed via
Fred Drake6ef871c1998-03-12 06:52:05 +000035the ``query string'' part of the URL. This module is intended
Guido van Rossuma29cc971996-07-30 18:22:07 +000036to take care of the different cases and provide a simpler interface to
37the Python script. It also provides a number of utilities that help
38in debugging scripts, and the latest addition is support for file
Fred Drake6ef871c1998-03-12 06:52:05 +000039uploads from a form (if your browser supports it --- Grail 0.3 and
Guido van Rossuma29cc971996-07-30 18:22:07 +000040Netscape 2.0 do).
41
42The output of a CGI script should consist of two sections, separated
43by a blank line. The first section contains a number of headers,
44telling the client what kind of data is following. Python code to
45generate a minimal header section looks like this:
Guido van Rossuma12ef941995-02-27 17:53:25 +000046
Fred Drake19479911998-02-13 06:58:54 +000047\begin{verbatim}
Moshe Zadkaa1a4b592000-08-25 21:47:56 +000048print "Content-Type: text/html" # HTML is following
Guido van Rossume47da0a1997-07-17 16:34:52 +000049print # blank line, end of headers
Fred Drake19479911998-02-13 06:58:54 +000050\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +000051
Guido van Rossuma29cc971996-07-30 18:22:07 +000052The second section is usually HTML, which allows the client software
53to display nicely formatted text with header, in-line images, etc.
54Here's Python code that prints a simple piece of HTML:
Guido van Rossum470be141995-03-17 16:07:09 +000055
Fred Drake19479911998-02-13 06:58:54 +000056\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +000057print "<TITLE>CGI script output</TITLE>"
58print "<H1>This is my first CGI script</H1>"
59print "Hello, world!"
Fred Drake19479911998-02-13 06:58:54 +000060\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +000061
Guido van Rossuma29cc971996-07-30 18:22:07 +000062\subsection{Using the cgi module}
63\nodename{Using the cgi module}
64
Fred Drake6ef871c1998-03-12 06:52:05 +000065Begin by writing \samp{import cgi}. Do not use \samp{from cgi import
66*} --- the module defines all sorts of names for its own use or for
67backward compatibility that you don't want in your namespace.
Guido van Rossuma29cc971996-07-30 18:22:07 +000068
Fred Drake6ef871c1998-03-12 06:52:05 +000069It's best to use the \class{FieldStorage} class. The other classes
70defined in this module are provided mostly for backward compatibility.
71Instantiate it exactly once, without arguments. This reads the form
72contents from standard input or the environment (depending on the
73value of various environment variables set according to the CGI
74standard). Since it may consume standard input, it should be
75instantiated only once.
Guido van Rossuma29cc971996-07-30 18:22:07 +000076
Moshe Zadkaa1a4b592000-08-25 21:47:56 +000077The \class{FieldStorage} instance can be indexed like a Python
78dictionary, and also supports the standard dictionary methods
Fred Drake84e58ab2001-08-11 03:28:41 +000079\method{has_key()} and \method{keys()}. The built-in \function{len()}
80is also supported. Form fields containing empty strings are ignored
Moshe Zadkaa1a4b592000-08-25 21:47:56 +000081and do not appear in the dictionary; to keep such values, provide
Fred Drake84e58ab2001-08-11 03:28:41 +000082a true value for the the optional \var{keep_blank_values} keyword
83parameter when creating the \class{FieldStorage} instance.
Moshe Zadkaa1a4b592000-08-25 21:47:56 +000084
85For instance, the following code (which assumes that the
Fred Drake84e58ab2001-08-11 03:28:41 +000086\mailheader{Content-Type} header and blank line have already been
87printed) checks that the fields \code{name} and \code{addr} are both
88set to a non-empty string:
Guido van Rossum470be141995-03-17 16:07:09 +000089
Fred Drake19479911998-02-13 06:58:54 +000090\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +000091form = cgi.FieldStorage()
Fred Drake9f9bd6a2001-06-29 14:59:01 +000092if not (form.has_key("name") and form.has_key("addr")):
Guido van Rossume47da0a1997-07-17 16:34:52 +000093 print "<H1>Error</H1>"
94 print "Please fill in the name and addr fields."
95 return
Moshe Zadkaa1a4b592000-08-25 21:47:56 +000096print "<p>name:", form["name"].value
97print "<p>addr:", form["addr"].value
Guido van Rossume47da0a1997-07-17 16:34:52 +000098...further form processing here...
Fred Drake19479911998-02-13 06:58:54 +000099\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000100
101Here the fields, accessed through \samp{form[\var{key}]}, are
102themselves instances of \class{FieldStorage} (or
103\class{MiniFieldStorage}, depending on the form encoding).
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000104The \member{value} attribute of the instance yields the string value
Fred Drake84e58ab2001-08-11 03:28:41 +0000105of the field. The \method{getvalue()} method returns this string value
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000106directly; it also accepts an optional second argument as a default to
107return if the requested key is not present.
Guido van Rossum470be141995-03-17 16:07:09 +0000108
Guido van Rossuma29cc971996-07-30 18:22:07 +0000109If the submitted form data contains more than one field with the same
Fred Drake6ef871c1998-03-12 06:52:05 +0000110name, the object retrieved by \samp{form[\var{key}]} is not a
111\class{FieldStorage} or \class{MiniFieldStorage}
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000112instance but a list of such instances. Similarly, in this situation,
113\samp{form.getvalue(\var{key})} would return a list of strings.
114If you expect this possibility
Fred Drake84e58ab2001-08-11 03:28:41 +0000115(when your HTML form contains multiple fields with the same name), use
116the \function{type()} built-in function to determine whether you
117have a single instance or a list of instances. For example, this
118code concatenates any number of username fields, separated by
Fred Drake6ef871c1998-03-12 06:52:05 +0000119commas:
Guido van Rossum470be141995-03-17 16:07:09 +0000120
Fred Drake19479911998-02-13 06:58:54 +0000121\begin{verbatim}
Fred Drake84e58ab2001-08-11 03:28:41 +0000122ListType = type([])
123
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000124value = form.getvalue("username", "")
Fred Drake84e58ab2001-08-11 03:28:41 +0000125if isinstance(value, ListType):
Guido van Rossume47da0a1997-07-17 16:34:52 +0000126 # Multiple username fields specified
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000127 usernames = ",".join(value)
Guido van Rossume47da0a1997-07-17 16:34:52 +0000128else:
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000129 # Single or no username field specified
130 usernames = value
Fred Drake19479911998-02-13 06:58:54 +0000131\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000132
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000133If a field represents an uploaded file, accessing the value via the
134\member{value} attribute or the \function{getvalue()} method reads the
Fred Drake6ef871c1998-03-12 06:52:05 +0000135entire file in memory as a string. This may not be what you want.
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000136You can test for an uploaded file by testing either the \member{filename}
137attribute or the \member{file} attribute. You can then read the data at
138leisure from the \member{file} attribute:
Guido van Rossuma29cc971996-07-30 18:22:07 +0000139
Fred Drake19479911998-02-13 06:58:54 +0000140\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000141fileitem = form["userfile"]
142if fileitem.file:
143 # It's an uploaded file; count lines
144 linecount = 0
145 while 1:
146 line = fileitem.file.readline()
147 if not line: break
148 linecount = linecount + 1
Fred Drake19479911998-02-13 06:58:54 +0000149\end{verbatim}
Guido van Rossuma29cc971996-07-30 18:22:07 +0000150
Fred Drake6ef871c1998-03-12 06:52:05 +0000151The file upload draft standard entertains the possibility of uploading
152multiple files from one field (using a recursive
153\mimetype{multipart/*} encoding). When this occurs, the item will be
154a dictionary-like \class{FieldStorage} item. This can be determined
155by testing its \member{type} attribute, which should be
156\mimetype{multipart/form-data} (or perhaps another MIME type matching
Fred Drake7eca8e51999-01-18 15:46:02 +0000157\mimetype{multipart/*}). In this case, it can be iterated over
Fred Drake6ef871c1998-03-12 06:52:05 +0000158recursively just like the top-level form object.
159
160When a form is submitted in the ``old'' format (as the query string or
161as a single data part of type
162\mimetype{application/x-www-form-urlencoded}), the items will actually
163be instances of the class \class{MiniFieldStorage}. In this case, the
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000164\member{list}, \member{file}, and \member{filename} attributes are
165always \code{None}.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000166
167
168\subsection{Old classes}
169
Fred Drake6ef871c1998-03-12 06:52:05 +0000170These classes, present in earlier versions of the \module{cgi} module,
171are still supported for backward compatibility. New applications
172should use the \class{FieldStorage} class.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000173
Fred Drake6ef871c1998-03-12 06:52:05 +0000174\class{SvFormContentDict} stores single value form content as
175dictionary; it assumes each field name occurs in the form only once.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000176
Fred Drake6ef871c1998-03-12 06:52:05 +0000177\class{FormContentDict} stores multiple value form content as a
178dictionary (the form items are lists of values). Useful if your form
179contains multiple fields with the same name.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000180
Fred Drake6ef871c1998-03-12 06:52:05 +0000181Other classes (\class{FormContent}, \class{InterpFormContentDict}) are
182present for backwards compatibility with really old applications only.
183If you still use these and would be inconvenienced when they
184disappeared from a next version of this module, drop me a note.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000185
186
187\subsection{Functions}
Fred Drake4b3f0311996-12-13 22:04:31 +0000188\nodename{Functions in cgi module}
Guido van Rossuma29cc971996-07-30 18:22:07 +0000189
190These are useful if you want more control, or if you want to employ
191some of the algorithms implemented in this module in other
192circumstances.
193
Guido van Rossum81e479a1997-08-25 18:28:03 +0000194\begin{funcdesc}{parse}{fp}
Fred Drake6ef871c1998-03-12 06:52:05 +0000195Parse a query in the environment or from a file (default
196\code{sys.stdin}).
Guido van Rossuma29cc971996-07-30 18:22:07 +0000197\end{funcdesc}
198
Guido van Rossum66ab4e81999-06-10 03:11:41 +0000199\begin{funcdesc}{parse_qs}{qs\optional{, keep_blank_values, strict_parsing}}
Fred Drake6ef871c1998-03-12 06:52:05 +0000200Parse a query string given as a string argument (data of type
Guido van Rossum66ab4e81999-06-10 03:11:41 +0000201\mimetype{application/x-www-form-urlencoded}). Data are
202returned as a dictionary. The dictionary keys are the unique query
Fred Drake38e5d272000-04-03 20:13:55 +0000203variable names and the values are lists of values for each name.
Guido van Rossum66ab4e81999-06-10 03:11:41 +0000204
205The optional argument \var{keep_blank_values} is
206a flag indicating whether blank values in
207URL encoded queries should be treated as blank strings.
208A true value indicates that blanks should be retained as
209blank strings. The default false value indicates that
210blank values are to be ignored and treated as if they were
211not included.
212
213The optional argument \var{strict_parsing} is a flag indicating what
214to do with parsing errors. If false (the default), errors
215are silently ignored. If true, errors raise a ValueError
216exception.
217\end{funcdesc}
218
219\begin{funcdesc}{parse_qsl}{qs\optional{, keep_blank_values, strict_parsing}}
220Parse a query string given as a string argument (data of type
221\mimetype{application/x-www-form-urlencoded}). Data are
222returned as a list of name, value pairs.
223
224The optional argument \var{keep_blank_values} is
225a flag indicating whether blank values in
226URL encoded queries should be treated as blank strings.
227A true value indicates that blanks should be retained as
228blank strings. The default false value indicates that
229blank values are to be ignored and treated as if they were
230not included.
231
232The optional argument \var{strict_parsing} is a flag indicating what
233to do with parsing errors. If false (the default), errors
234are silently ignored. If true, errors raise a ValueError
235exception.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000236\end{funcdesc}
237
Fred Drakecce10901998-03-17 06:33:25 +0000238\begin{funcdesc}{parse_multipart}{fp, pdict}
Fred Drake6ef871c1998-03-12 06:52:05 +0000239Parse input of type \mimetype{multipart/form-data} (for
240file uploads). Arguments are \var{fp} for the input file and
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000241\var{pdict} for a dictionary containing other parameters in
Fred Drake84e58ab2001-08-11 03:28:41 +0000242the \mailheader{Content-Type} header.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000243
Fred Drake6ef871c1998-03-12 06:52:05 +0000244Returns a dictionary just like \function{parse_qs()} keys are the
245field names, each value is a list of values for that field. This is
246easy to use but not much good if you are expecting megabytes to be
247uploaded --- in that case, use the \class{FieldStorage} class instead
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000248which is much more flexible.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000249
Fred Drake6ef871c1998-03-12 06:52:05 +0000250Note that this does not parse nested multipart parts --- use
251\class{FieldStorage} for that.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000252\end{funcdesc}
253
Guido van Rossum81e479a1997-08-25 18:28:03 +0000254\begin{funcdesc}{parse_header}{string}
Fred Drake84e58ab2001-08-11 03:28:41 +0000255Parse a MIME header (such as \mailheader{Content-Type}) into a main
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000256value and a dictionary of parameters.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000257\end{funcdesc}
258
Guido van Rossum81e479a1997-08-25 18:28:03 +0000259\begin{funcdesc}{test}{}
Fred Drake6ef871c1998-03-12 06:52:05 +0000260Robust test CGI script, usable as main program.
261Writes minimal HTTP headers and formats all information provided to
262the script in HTML form.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000263\end{funcdesc}
264
Guido van Rossum81e479a1997-08-25 18:28:03 +0000265\begin{funcdesc}{print_environ}{}
Fred Drake6ef871c1998-03-12 06:52:05 +0000266Format the shell environment in HTML.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000267\end{funcdesc}
268
Guido van Rossum81e479a1997-08-25 18:28:03 +0000269\begin{funcdesc}{print_form}{form}
Fred Drake6ef871c1998-03-12 06:52:05 +0000270Format a form in HTML.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000271\end{funcdesc}
272
Guido van Rossum81e479a1997-08-25 18:28:03 +0000273\begin{funcdesc}{print_directory}{}
Fred Drake6ef871c1998-03-12 06:52:05 +0000274Format the current directory in HTML.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000275\end{funcdesc}
276
Guido van Rossum81e479a1997-08-25 18:28:03 +0000277\begin{funcdesc}{print_environ_usage}{}
Fred Drake6ef871c1998-03-12 06:52:05 +0000278Print a list of useful (used by CGI) environment variables in
Guido van Rossuma29cc971996-07-30 18:22:07 +0000279HTML.
280\end{funcdesc}
281
Fred Drakecce10901998-03-17 06:33:25 +0000282\begin{funcdesc}{escape}{s\optional{, quote}}
Fred Drake6ef871c1998-03-12 06:52:05 +0000283Convert the characters
284\character{\&}, \character{<} and \character{>} in string \var{s} to
285HTML-safe sequences. Use this if you need to display text that might
286contain such characters in HTML. If the optional flag \var{quote} is
Fred Drake84e58ab2001-08-11 03:28:41 +0000287true, the double-quote character (\character{"}) is also translated;
Fred Drake91f2f262001-07-06 19:28:48 +0000288this helps for inclusion in an HTML attribute value, as in \code{<A
Fred Drake84e58ab2001-08-11 03:28:41 +0000289HREF="...">}. If the value to be qouted might include single- or
290double-quote characters, or both, consider using the
291\function{quoteattr()} function in the \refmodule{xml.sax.saxutils}
292module instead.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000293\end{funcdesc}
294
295
296\subsection{Caring about security}
297
Fred Drake91f2f262001-07-06 19:28:48 +0000298There's one important rule: if you invoke an external program (via the
299\function{os.system()} or \function{os.popen()} functions. or others
300with similar functionality), make very sure you don't pass arbitrary
301strings received from the client to the shell. This is a well-known
Fred Drake8ee679f2001-07-14 02:50:55 +0000302security hole whereby clever hackers anywhere on the Web can exploit a
Fred Drake91f2f262001-07-06 19:28:48 +0000303gullible CGI script to invoke arbitrary shell commands. Even parts of
304the URL or field names cannot be trusted, since the request doesn't
305have to come from your form!
Guido van Rossuma29cc971996-07-30 18:22:07 +0000306
307To be on the safe side, if you must pass a string gotten from a form
308to a shell command, you should make sure the string contains only
309alphanumeric characters, dashes, underscores, and periods.
310
311
312\subsection{Installing your CGI script on a Unix system}
313
314Read the documentation for your HTTP server and check with your local
315system administrator to find the directory where CGI scripts should be
Fred Drakea2e268a1997-12-09 03:28:42 +0000316installed; usually this is in a directory \file{cgi-bin} in the server tree.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000317
318Make sure that your script is readable and executable by ``others''; the
Fred Drake6ef871c1998-03-12 06:52:05 +0000319\UNIX{} file mode should be \code{0755} octal (use \samp{chmod 0755
Fred Drake7eca8e51999-01-18 15:46:02 +0000320\var{filename}}). Make sure that the first line of the script contains
Fred Drake6ef871c1998-03-12 06:52:05 +0000321\code{\#!} starting in column 1 followed by the pathname of the Python
322interpreter, for instance:
Guido van Rossuma29cc971996-07-30 18:22:07 +0000323
Fred Drake19479911998-02-13 06:58:54 +0000324\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000325#!/usr/local/bin/python
Fred Drake19479911998-02-13 06:58:54 +0000326\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000327
Guido van Rossuma29cc971996-07-30 18:22:07 +0000328Make sure the Python interpreter exists and is executable by ``others''.
329
330Make sure that any files your script needs to read or write are
Fred Drake6ef871c1998-03-12 06:52:05 +0000331readable or writable, respectively, by ``others'' --- their mode
332should be \code{0644} for readable and \code{0666} for writable. This
333is because, for security reasons, the HTTP server executes your script
334as user ``nobody'', without any special privileges. It can only read
335(write, execute) files that everybody can read (write, execute). The
336current directory at execution time is also different (it is usually
337the server's cgi-bin directory) and the set of environment variables
Fred Drake8ee679f2001-07-14 02:50:55 +0000338is also different from what you get when you log in. In particular, don't
Fred Drake6ef871c1998-03-12 06:52:05 +0000339count on the shell's search path for executables (\envvar{PATH}) or
340the Python module search path (\envvar{PYTHONPATH}) to be set to
341anything interesting.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000342
343If you need to load modules from a directory which is not on Python's
344default module search path, you can change the path in your script,
Fred Drake91f2f262001-07-06 19:28:48 +0000345before importing other modules. For example:
Guido van Rossuma29cc971996-07-30 18:22:07 +0000346
Fred Drake19479911998-02-13 06:58:54 +0000347\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000348import sys
349sys.path.insert(0, "/usr/home/joe/lib/python")
350sys.path.insert(0, "/usr/local/lib/python")
Fred Drake19479911998-02-13 06:58:54 +0000351\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000352
Guido van Rossuma29cc971996-07-30 18:22:07 +0000353(This way, the directory inserted last will be searched first!)
354
Fred Drakeefc1e0f1998-01-13 19:00:33 +0000355Instructions for non-\UNIX{} systems will vary; check your HTTP server's
Guido van Rossuma29cc971996-07-30 18:22:07 +0000356documentation (it will usually have a section on CGI scripts).
357
358
359\subsection{Testing your CGI script}
360
361Unfortunately, a CGI script will generally not run when you try it
362from the command line, and a script that works perfectly from the
363command line may fail mysteriously when run from the server. There's
364one reason why you should still test your script from the command
Fred Drake6a79be81998-04-03 03:47:03 +0000365line: if it contains a syntax error, the Python interpreter won't
Guido van Rossuma29cc971996-07-30 18:22:07 +0000366execute it at all, and the HTTP server will most likely send a cryptic
367error to the client.
368
369Assuming your script has no syntax errors, yet it does not work, you
Fred Drake6ef871c1998-03-12 06:52:05 +0000370have no choice but to read the next section.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000371
372
373\subsection{Debugging CGI scripts}
374
Fred Drake6ef871c1998-03-12 06:52:05 +0000375First of all, check for trivial installation errors --- reading the
Guido van Rossuma29cc971996-07-30 18:22:07 +0000376section above on installing your CGI script carefully can save you a
377lot of time. If you wonder whether you have understood the
378installation procedure correctly, try installing a copy of this module
Fred Drakea2e268a1997-12-09 03:28:42 +0000379file (\file{cgi.py}) as a CGI script. When invoked as a script, the file
Guido van Rossuma29cc971996-07-30 18:22:07 +0000380will dump its environment and the contents of the form in HTML form.
381Give it the right mode etc, and send it a request. If it's installed
Fred Drakea2e268a1997-12-09 03:28:42 +0000382in the standard \file{cgi-bin} directory, it should be possible to send it a
Guido van Rossuma29cc971996-07-30 18:22:07 +0000383request by entering a URL into your browser of the form:
384
Fred Drake19479911998-02-13 06:58:54 +0000385\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000386http://yourhostname/cgi-bin/cgi.py?name=Joe+Blow&addr=At+Home
Fred Drake19479911998-02-13 06:58:54 +0000387\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000388
Guido van Rossuma29cc971996-07-30 18:22:07 +0000389If this gives an error of type 404, the server cannot find the script
390-- perhaps you need to install it in a different directory. If it
Fred Drake91f2f262001-07-06 19:28:48 +0000391gives another error, there's an installation problem that
Guido van Rossuma29cc971996-07-30 18:22:07 +0000392you should fix before trying to go any further. If you get a nicely
393formatted listing of the environment and form content (in this
394example, the fields should be listed as ``addr'' with value ``At Home''
Fred Drakea2e268a1997-12-09 03:28:42 +0000395and ``name'' with value ``Joe Blow''), the \file{cgi.py} script has been
Guido van Rossuma29cc971996-07-30 18:22:07 +0000396installed correctly. If you follow the same procedure for your own
397script, you should now be able to debug it.
398
Fred Drake6ef871c1998-03-12 06:52:05 +0000399The next step could be to call the \module{cgi} module's
400\function{test()} function from your script: replace its main code
401with the single statement
Guido van Rossuma29cc971996-07-30 18:22:07 +0000402
Fred Drake19479911998-02-13 06:58:54 +0000403\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000404cgi.test()
Fred Drake19479911998-02-13 06:58:54 +0000405\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000406
Guido van Rossuma29cc971996-07-30 18:22:07 +0000407This should produce the same results as those gotten from installing
Fred Drakea2e268a1997-12-09 03:28:42 +0000408the \file{cgi.py} file itself.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000409
Fred Drake91f2f262001-07-06 19:28:48 +0000410When an ordinary Python script raises an unhandled exception (for
411whatever reason: of a typo in a module name, a file that can't be
412opened, etc.), the Python interpreter prints a nice traceback and
413exits. While the Python interpreter will still do this when your CGI
414script raises an exception, most likely the traceback will end up in
415one of the HTTP server's log file, or be discarded altogether.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000416
417Fortunately, once you have managed to get your script to execute
Fred Drake6ef871c1998-03-12 06:52:05 +0000418\emph{some} code, it is easy to catch exceptions and cause a traceback
419to be printed. The \function{test()} function below in this module is
420an example. Here are the rules:
Guido van Rossuma29cc971996-07-30 18:22:07 +0000421
422\begin{enumerate}
Fred Drake6ef871c1998-03-12 06:52:05 +0000423\item Import the traceback module before entering the \keyword{try}
424 ... \keyword{except} statement
425
426\item Assign \code{sys.stderr} to be \code{sys.stdout}
427
428\item Make sure you finish printing the headers and the blank line
429 early
430
431\item Wrap all remaining code in a \keyword{try} ... \keyword{except}
432 statement
433
434\item In the except clause, call \function{traceback.print_exc()}
Guido van Rossuma29cc971996-07-30 18:22:07 +0000435\end{enumerate}
436
437For example:
438
Fred Drake19479911998-02-13 06:58:54 +0000439\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000440import sys
441import traceback
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000442print "Content-Type: text/html"
Guido van Rossume47da0a1997-07-17 16:34:52 +0000443print
444sys.stderr = sys.stdout
445try:
446 ...your code here...
447except:
448 print "\n\n<PRE>"
449 traceback.print_exc()
Fred Drake19479911998-02-13 06:58:54 +0000450\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000451
452Notes: The assignment to \code{sys.stderr} is needed because the
453traceback prints to \code{sys.stderr}.
Guido van Rossum9d62e801997-11-25 00:35:44 +0000454The \code{print "{\e}n{\e}n<PRE>"} statement is necessary to
Guido van Rossuma29cc971996-07-30 18:22:07 +0000455disable the word wrapping in HTML.
456
457If you suspect that there may be a problem in importing the traceback
458module, you can use an even more robust approach (which only uses
459built-in modules):
460
Fred Drake19479911998-02-13 06:58:54 +0000461\begin{verbatim}
Guido van Rossume47da0a1997-07-17 16:34:52 +0000462import sys
463sys.stderr = sys.stdout
Moshe Zadkaa1a4b592000-08-25 21:47:56 +0000464print "Content-Type: text/plain"
Guido van Rossume47da0a1997-07-17 16:34:52 +0000465print
466...your code here...
Fred Drake19479911998-02-13 06:58:54 +0000467\end{verbatim}
Fred Drake6ef871c1998-03-12 06:52:05 +0000468
Guido van Rossuma29cc971996-07-30 18:22:07 +0000469This relies on the Python interpreter to print the traceback. The
470content type of the output is set to plain text, which disables all
471HTML processing. If your script works, the raw HTML will be displayed
472by your client. If it raises an exception, most likely after the
473first two lines have been printed, a traceback will be displayed.
474Because no HTML interpretation is going on, the traceback will
475readable.
476
477
478\subsection{Common problems and solutions}
Guido van Rossum470be141995-03-17 16:07:09 +0000479
480\begin{itemize}
Guido van Rossuma29cc971996-07-30 18:22:07 +0000481\item Most HTTP servers buffer the output from CGI scripts until the
482script is completed. This means that it is not possible to display a
483progress report on the client's display while the script is running.
484
485\item Check the installation instructions above.
486
Fred Drake6ef871c1998-03-12 06:52:05 +0000487\item Check the HTTP server's log files. (\samp{tail -f logfile} in a
488separate window may be useful!)
Guido van Rossuma29cc971996-07-30 18:22:07 +0000489
490\item Always check a script for syntax errors first, by doing something
Fred Drake6ef871c1998-03-12 06:52:05 +0000491like \samp{python script.py}.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000492
493\item When using any of the debugging techniques, don't forget to add
Fred Drake6ef871c1998-03-12 06:52:05 +0000494\samp{import sys} to the top of the script.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000495
496\item When invoking external programs, make sure they can be found.
Fred Drake6ef871c1998-03-12 06:52:05 +0000497Usually, this means using absolute path names --- \envvar{PATH} is
498usually not set to a very useful value in a CGI script.
Guido van Rossuma29cc971996-07-30 18:22:07 +0000499
500\item When reading or writing external files, make sure they can be read
501or written by every user on the system.
502
503\item Don't try to give a CGI script a set-uid mode. This doesn't work on
504most systems, and is a security liability as well.
Guido van Rossum470be141995-03-17 16:07:09 +0000505\end{itemize}
506