blob: fbc9fd91de60fe1a7208a6723e35e37e501490f9 [file] [log] [blame]
Georg Brandl0f3629d2008-09-07 17:00:17 +00001*******************************
2 HOWTO Use Python in the web
3*******************************
4
5:Author: Marek Kubica
6
7.. topic:: Abstract
8
Ezio Melottic0deec12010-04-05 03:51:38 +00009 This document shows how Python fits into the web. It presents some ways
10 to integrate Python with a web server, and general practices useful for
Georg Brandl0f3629d2008-09-07 17:00:17 +000011 developing web sites.
12
13
Ezio Melottic0deec12010-04-05 03:51:38 +000014Programming for the Web has become a hot topic since the rise of "Web 2.0",
Georg Brandl0f3629d2008-09-07 17:00:17 +000015which focuses on user-generated content on web sites. It has always been
16possible to use Python for creating web sites, but it was a rather tedious task.
Ezio Melottic0deec12010-04-05 03:51:38 +000017Therefore, many frameworks and helper tools have been created to assist
18developers in creating faster and more robust sites. This HOWTO describes
19some of the methods used to combine Python with a web server to create
20dynamic content. It is not meant as a complete introduction, as this topic is
21far too broad to be covered in one single document. However, a short overview
22of the most popular libraries is provided.
Georg Brandl0f3629d2008-09-07 17:00:17 +000023
24.. seealso::
25
Ezio Melottic0deec12010-04-05 03:51:38 +000026 While this HOWTO tries to give an overview of Python in the web, it cannot
27 always be as up to date as desired. Web development in Python is rapidly
28 moving forward, so the wiki page on `Web Programming
29 <http://wiki.python.org/moin/WebProgramming>`_ may be more in sync with
Georg Brandl0f3629d2008-09-07 17:00:17 +000030 recent development.
31
32
Ezio Melottic0deec12010-04-05 03:51:38 +000033The Low-Level View
Georg Brandl0f3629d2008-09-07 17:00:17 +000034==================
35
Ezio Melottic0deec12010-04-05 03:51:38 +000036When a user enters a web site, their browser makes a connection to the site's
37web server (this is called the *request*). The server looks up the file in the
Georg Brandl0f3629d2008-09-07 17:00:17 +000038file system and sends it back to the user's browser, which displays it (this is
Ezio Melottid72a6282010-03-23 23:26:21 +000039the *response*). This is roughly how the underlying protocol, HTTP, works.
Georg Brandl0f3629d2008-09-07 17:00:17 +000040
Ezio Melottic0deec12010-04-05 03:51:38 +000041Dynamic web sites are not based on files in the file system, but rather on
42programs which are run by the web server when a request comes in, and which
43*generate* the content that is returned to the user. They can do all sorts of
44useful things, like display the postings of a bulletin board, show your email,
45configure software, or just display the current time. These programs can be
46written in any programming language the server supports. Since most servers
47support Python, it is easy to use Python to create dynamic web sites.
Georg Brandl0f3629d2008-09-07 17:00:17 +000048
Ezio Melottic0deec12010-04-05 03:51:38 +000049Most HTTP servers are written in C or C++, so they cannot execute Python code
50directly -- a bridge is needed between the server and the program. These
51bridges, or rather interfaces, define how programs interact with the server.
52There have been numerous attempts to create the best possible interface, but
53there are only a few worth mentioning.
Georg Brandl0f3629d2008-09-07 17:00:17 +000054
Ezio Melottic0deec12010-04-05 03:51:38 +000055Not every web server supports every interface. Many web servers only support
56old, now-obsolete interfaces; however, they can often be extended using
57third-party modules to support newer ones.
Georg Brandl0f3629d2008-09-07 17:00:17 +000058
59
60Common Gateway Interface
61------------------------
62
Ezio Melottic0deec12010-04-05 03:51:38 +000063This interface, most commonly referred to as "CGI", is the oldest, and is
64supported by nearly every web server out of the box. Programs using CGI to
65communicate with their web server need to be started by the server for every
66request. So, every request starts a new Python interpreter -- which takes some
67time to start up -- thus making the whole interface only usable for low load
68situations.
Georg Brandl0f3629d2008-09-07 17:00:17 +000069
Ezio Melottic0deec12010-04-05 03:51:38 +000070The upside of CGI is that it is simple -- writing a Python program which uses
71CGI is a matter of about three lines of code. This simplicity comes at a
72price: it does very few things to help the developer.
Georg Brandl0f3629d2008-09-07 17:00:17 +000073
Ezio Melottic0deec12010-04-05 03:51:38 +000074Writing CGI programs, while still possible, is no longer recommended. With
75:ref:`WSGI <WSGI>`, a topic covered later in this document, it is possible to write
76programs that emulate CGI, so they can be run as CGI if no better option is
77available.
Georg Brandl0f3629d2008-09-07 17:00:17 +000078
79.. seealso::
80
81 The Python standard library includes some modules that are helpful for
82 creating plain CGI programs:
83
84 * :mod:`cgi` -- Handling of user input in CGI scripts
Ezio Melottic0deec12010-04-05 03:51:38 +000085 * :mod:`cgitb` -- Displays nice tracebacks when errors happen in CGI
Georg Brandl0f3629d2008-09-07 17:00:17 +000086 applications, instead of presenting a "500 Internal Server Error" message
87
88 The Python wiki features a page on `CGI scripts
89 <http://wiki.python.org/moin/CgiScripts>`_ with some additional information
90 about CGI in Python.
91
Georg Brandlc62ef8b2009-01-03 20:55:06 +000092
Georg Brandl0f3629d2008-09-07 17:00:17 +000093Simple script for testing CGI
94^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
95
96To test whether your web server works with CGI, you can use this short and
97simple CGI program::
98
99 #!/usr/bin/env python
100 # -*- coding: UTF-8 -*-
101
102 # enable debugging
Georg Brandl6b5dbaa2009-02-20 08:22:21 +0000103 import cgitb
104 cgitb.enable()
Georg Brandl0f3629d2008-09-07 17:00:17 +0000105
106 print "Content-Type: text/plain;charset=utf-8"
107 print
108
109 print "Hello World!"
110
Ezio Melottic0deec12010-04-05 03:51:38 +0000111Depending on your web server configuration, you may need to save this code with
112a ``.py`` or ``.cgi`` extension. Additionally, this file may also need to be
113in a ``cgi-bin`` folder, for security reasons.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000114
115You might wonder what the ``cgitb`` line is about. This line makes it possible
116to display a nice traceback instead of just crashing and displaying an "Internal
117Server Error" in the user's browser. This is useful for debugging, but it might
Ezio Melottic0deec12010-04-05 03:51:38 +0000118risk exposing some confidential data to the user. You should not use ``cgitb``
119in production code for this reason. You should *always* catch exceptions, and
Georg Brandl0f3629d2008-09-07 17:00:17 +0000120display proper error pages -- end-users don't like to see nondescript "Internal
121Server Errors" in their browsers.
122
123
124Setting up CGI on your own server
125^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
126
127If you don't have your own web server, this does not apply to you. You can
Ezio Melottic0deec12010-04-05 03:51:38 +0000128check whether it works as-is, and if not you will need to talk to the
129administrator of your web server. If it is a big host, you can try filing a
130ticket asking for Python support.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000131
Ezio Melottic0deec12010-04-05 03:51:38 +0000132If you are your own administrator or want to set up CGI for testing purposes on
133your own computers, you have to configure it by yourself. There is no single
134way to configure CGI, as there are many web servers with different
135configuration options. Currently the most widely used free web server is
136`Apache HTTPd <http://httpd.apache.org/>`_, or Apache for short. Apache can be
137easily installed on nearly every system using the system's package management
138tool. `lighttpd <http://www.lighttpd.net>`_ is another alternative and is
139said to have better performance. On many systems this server can also be
140installed using the package management tool, so manually compiling the web
141server may not be needed.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000142
Ezio Melottic0deec12010-04-05 03:51:38 +0000143* On Apache you can take a look at the `Dynamic Content with CGI
Georg Brandl0f3629d2008-09-07 17:00:17 +0000144 <http://httpd.apache.org/docs/2.2/howto/cgi.html>`_ tutorial, where everything
145 is described. Most of the time it is enough just to set ``+ExecCGI``. The
146 tutorial also describes the most common gotchas that might arise.
Ezio Melottic0deec12010-04-05 03:51:38 +0000147
Georg Brandl0f3629d2008-09-07 17:00:17 +0000148* On lighttpd you need to use the `CGI module
Ezio Melottic0deec12010-04-05 03:51:38 +0000149 <http://redmine.lighttpd.net/wiki/lighttpd/Docs:ModCGI>`_\ , which can be configured
Georg Brandl0f3629d2008-09-07 17:00:17 +0000150 in a straightforward way. It boils down to setting ``cgi.assign`` properly.
151
152
153Common problems with CGI scripts
154^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
155
Ezio Melottic0deec12010-04-05 03:51:38 +0000156Using CGI sometimes leads to small annoyances while trying to get these
157scripts to run. Sometimes a seemingly correct script does not work as
158expected, the cause being some small hidden problem that's difficult to spot.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000159
Ezio Melottic0deec12010-04-05 03:51:38 +0000160Some of these potential problems are:
Georg Brandl0f3629d2008-09-07 17:00:17 +0000161
Ezio Melottic0deec12010-04-05 03:51:38 +0000162* The Python script is not marked as executable. When CGI scripts are not
163 executable most web servers will let the user download it, instead of
Georg Brandl0f3629d2008-09-07 17:00:17 +0000164 running it and sending the output to the user. For CGI scripts to run
Ezio Melottic0deec12010-04-05 03:51:38 +0000165 properly on Unix-like operating systems, the ``+x`` bit needs to be set.
166 Using ``chmod a+x your_script.py`` may solve this problem.
167
168* On a Unix-like system, The line endings in the program file must be Unix
169 style line endings. This is important because the web server checks the
170 first line of the script (called shebang) and tries to run the program
171 specified there. It gets easily confused by Windows line endings (Carriage
172 Return & Line Feed, also called CRLF), so you have to convert the file to
173 Unix line endings (only Line Feed, LF). This can be done automatically by
174 uploading the file via FTP in text mode instead of binary mode, but the
175 preferred way is just telling your editor to save the files with Unix line
176 endings. Most editors support this.
177
178* Your web server must be able to read the file, and you need to make sure the
179 permissions are correct. On unix-like systems, the server often runs as user
180 and group ``www-data``, so it might be worth a try to change the file
181 ownership, or making the file world readable by using ``chmod a+r
182 your_script.py``.
183
184* The web server must know that the file you're trying to access is a CGI script.
185 Check the configuration of your web server, as it may be configured
186 to expect a specific file extension for CGI scripts.
187
188* On Unix-like systems, the path to the interpreter in the shebang
189 (``#!/usr/bin/env python``) must be correct. This line calls
190 ``/usr/bin/env`` to find Python, but it will fail if there is no
191 ``/usr/bin/env``, or if Python is not in the web server's path. If you know
192 where your Python is installed, you can also use that full path. The
193 commands ``whereis python`` and ``type -p python`` could help you find
194 where it is installed. Once you know the path, you can change the shebang
195 accordingly: ``#!/usr/bin/python``.
196
Georg Brandl0f3629d2008-09-07 17:00:17 +0000197* The file must not contain a BOM (Byte Order Mark). The BOM is meant for
Ezio Melottic0deec12010-04-05 03:51:38 +0000198 determining the byte order of UTF-16 and UTF-32 encodings, but some editors
199 write this also into UTF-8 files. The BOM interferes with the shebang line,
200 so be sure to tell your editor not to write the BOM.
201
202* If the web server is using :ref:`mod-python`, ``mod_python`` may be having
203 problems. ``mod_python`` is able to handle CGI scripts by itself, but it can
204 also be a source of issues.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000205
206
207.. _mod-python:
208
209mod_python
210----------
211
212People coming from PHP often find it hard to grasp how to use Python in the web.
Ezio Melottic0deec12010-04-05 03:51:38 +0000213Their first thought is mostly `mod_python <http://www.modpython.org/>`_\ ,
214because they think that this is the equivalent to ``mod_php``. Actually, there
215are many differences. What ``mod_python`` does is embed the interpreter into
216the Apache process, thus speeding up requests by not having to start a Python
217interpreter for each request. On the other hand, it is not "Python intermixed
218with HTML" in the way that PHP is often intermixed with HTML. The Python
219equivalent of that is a template engine. ``mod_python`` itself is much more
220powerful and provides more access to Apache internals. It can emulate CGI,
221work in a "Python Server Pages" mode (similar to JSP) which is "HTML
222intermingled with Python", and it has a "Publisher" which designates one file
223to accept all requests and decide what to do with them.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000224
Ezio Melottic0deec12010-04-05 03:51:38 +0000225``mod_python`` does have some problems. Unlike the PHP interpreter, the Python
226interpreter uses caching when executing files, so changes to a file will
227require the web server to be restarted. Another problem is the basic concept
228-- Apache starts child processes to handle the requests, and unfortunately
229every child process needs to load the whole Python interpreter even if it does
230not use it. This makes the whole web server slower. Another problem is that,
231because ``mod_python`` is linked against a specific version of ``libpython``,
232it is not possible to switch from an older version to a newer (e.g. 2.4 to 2.5)
233without recompiling ``mod_python``. ``mod_python`` is also bound to the Apache
234web server, so programs written for ``mod_python`` cannot easily run on other
235web servers.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000236
Ezio Melottic0deec12010-04-05 03:51:38 +0000237These are the reasons why ``mod_python`` should be avoided when writing new
238programs. In some circumstances it still might be a good idea to use
239``mod_python`` for deployment, but WSGI makes it possible to run WSGI programs
240under ``mod_python`` as well.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000241
242
243FastCGI and SCGI
244----------------
245
246FastCGI and SCGI try to solve the performance problem of CGI in another way.
247Instead of embedding the interpreter into the web server, they create
Ezio Melottic0deec12010-04-05 03:51:38 +0000248long-running background processes. There is still a module in the web server
249which makes it possible for the web server to "speak" with the background
250process. As the background process is independent of the server, it can be
251written in any language, including Python. The language just needs to have a
252library which handles the communication with the webserver.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000253
254The difference between FastCGI and SCGI is very small, as SCGI is essentially
Ezio Melottic0deec12010-04-05 03:51:38 +0000255just a "simpler FastCGI". As the web server support for SCGI is limited,
Georg Brandl0f3629d2008-09-07 17:00:17 +0000256most people use FastCGI instead, which works the same way. Almost everything
Ezio Melottic0deec12010-04-05 03:51:38 +0000257that applies to SCGI also applies to FastCGI as well, so we'll only cover
Georg Brandl0f3629d2008-09-07 17:00:17 +0000258the latter.
259
Ezio Melottic0deec12010-04-05 03:51:38 +0000260These days, FastCGI is never used directly. Just like ``mod_python``, it is only
Georg Brandl0f3629d2008-09-07 17:00:17 +0000261used for the deployment of WSGI applications.
262
263.. seealso::
264
265 * `FastCGI, SCGI, and Apache: Background and Future
266 <http://www.vmunix.com/mark/blog/archives/2006/01/02/fastcgi-scgi-and-apache-background-and-future/>`_
Ezio Melotti1e87da12011-10-19 10:39:35 +0300267 is a discussion on why the concept of FastCGI and SCGI is better than that
Georg Brandl0f3629d2008-09-07 17:00:17 +0000268 of mod_python.
269
270
271Setting up FastCGI
272^^^^^^^^^^^^^^^^^^
273
Ezio Melottic0deec12010-04-05 03:51:38 +0000274Each web server requires a specific module.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000275
Ezio Melottic0deec12010-04-05 03:51:38 +0000276* Apache has both `mod_fastcgi <http://www.fastcgi.com/drupal/>`_ and `mod_fcgid
Sandro Tosi117e1f02011-12-31 18:13:59 +0100277 <http://httpd.apache.org/mod_fcgid/>`_. ``mod_fastcgi`` is the original one, but it
Ezio Melottic0deec12010-04-05 03:51:38 +0000278 has some licensing issues, which is why it is sometimes considered non-free.
279 ``mod_fcgid`` is a smaller, compatible alternative. One of these modules needs
Georg Brandl0f3629d2008-09-07 17:00:17 +0000280 to be loaded by Apache.
Ezio Melottic0deec12010-04-05 03:51:38 +0000281
Georg Brandl0f3629d2008-09-07 17:00:17 +0000282* lighttpd ships its own `FastCGI module
Ezio Melottic0deec12010-04-05 03:51:38 +0000283 <http://redmine.lighttpd.net/wiki/lighttpd/Docs:ModFastCGI>`_ as well as an
284 `SCGI module <http://redmine.lighttpd.net/wiki/lighttpd/Docs:ModSCGI>`_.
285
286* `nginx <http://nginx.org/>`_ also supports `FastCGI
287 <http://wiki.nginx.org/NginxSimplePythonFCGI>`_.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000288
289Once you have installed and configured the module, you can test it with the
290following WSGI-application::
291
292 #!/usr/bin/env python
293 # -*- coding: UTF-8 -*-
294
295 from cgi import escape
Benjamin Petersona7b55a32009-02-20 03:31:23 +0000296 import sys, os
Georg Brandl0f3629d2008-09-07 17:00:17 +0000297 from flup.server.fcgi import WSGIServer
298
299 def app(environ, start_response):
300 start_response('200 OK', [('Content-Type', 'text/html')])
301
302 yield '<h1>FastCGI Environment</h1>'
303 yield '<table>'
304 for k, v in sorted(environ.items()):
305 yield '<tr><th>%s</th><td>%s</td></tr>' % (escape(k), escape(v))
306 yield '</table>'
307
308 WSGIServer(app).run()
309
310This is a simple WSGI application, but you need to install `flup
311<http://pypi.python.org/pypi/flup/1.0>`_ first, as flup handles the low level
312FastCGI access.
313
314.. seealso::
315
316 There is some documentation on `setting up Django with FastCGI
Ezio Melottic0deec12010-04-05 03:51:38 +0000317 <http://docs.djangoproject.com/en/dev/howto/deployment/fastcgi/>`_, most of
318 which can be reused for other WSGI-compliant frameworks and libraries.
319 Only the ``manage.py`` part has to be changed, the example used here can be
320 used instead. Django does more or less the exact same thing.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000321
322
323mod_wsgi
324--------
325
Ezio Melottic0deec12010-04-05 03:51:38 +0000326`mod_wsgi <http://code.google.com/p/modwsgi/>`_ is an attempt to get rid of the
327low level gateways. Given that FastCGI, SCGI, and mod_python are mostly used to
328deploy WSGI applications, mod_wsgi was started to directly embed WSGI applications
329into the Apache web server. mod_wsgi is specifically designed to host WSGI
330applications. It makes the deployment of WSGI applications much easier than
331deployment using other low level methods, which need glue code. The downside
332is that mod_wsgi is limited to the Apache web server; other servers would need
Georg Brandl0f3629d2008-09-07 17:00:17 +0000333their own implementations of mod_wsgi.
334
Ezio Melottic0deec12010-04-05 03:51:38 +0000335mod_wsgi supports two modes: embedded mode, in which it integrates with the
336Apache process, and daemon mode, which is more FastCGI-like. Unlike FastCGI,
337mod_wsgi handles the worker-processes by itself, which makes administration
Georg Brandl0f3629d2008-09-07 17:00:17 +0000338easier.
339
340
341.. _WSGI:
342
343Step back: WSGI
344===============
345
Ezio Melottic0deec12010-04-05 03:51:38 +0000346WSGI has already been mentioned several times, so it has to be something
347important. In fact it really is, and now it is time to explain it.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000348
Ezio Melottic0deec12010-04-05 03:51:38 +0000349The *Web Server Gateway Interface*, or WSGI for short, is defined in
350:pep:`333` and is currently the best way to do Python web programming. While
351it is great for programmers writing frameworks, a normal web developer does not
352need to get in direct contact with it. When choosing a framework for web
353development it is a good idea to choose one which supports WSGI.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000354
Ezio Melottic0deec12010-04-05 03:51:38 +0000355The big benefit of WSGI is the unification of the application programming
356interface. When your program is compatible with WSGI -- which at the outer
357level means that the framework you are using has support for WSGI -- your
358program can be deployed via any web server interface for which there are WSGI
359wrappers. You do not need to care about whether the application user uses
360mod_python or FastCGI or mod_wsgi -- with WSGI your application will work on
361any gateway interface. The Python standard library contains its own WSGI
362server, :mod:`wsgiref`, which is a small web server that can be used for
363testing.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000364
Ezio Melottic0deec12010-04-05 03:51:38 +0000365A really great WSGI feature is middleware. Middleware is a layer around your
366program which can add various functionality to it. There is quite a bit of
Sandro Tosi117e1f02011-12-31 18:13:59 +0100367`middleware <http://www.wsgi.org/en/latest/libraries.html>`_ already
Ezio Melottic0deec12010-04-05 03:51:38 +0000368available. For example, instead of writing your own session management (HTTP
369is a stateless protocol, so to associate multiple HTTP requests with a single
370user your application must create and manage such state via a session), you can
371just download middleware which does that, plug it in, and get on with coding
372the unique parts of your application. The same thing with compression -- there
373is existing middleware which handles compressing your HTML using gzip to save
374on your server's bandwidth. Authentication is another a problem easily solved
375using existing middleware.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000376
Ezio Melottic0deec12010-04-05 03:51:38 +0000377Although WSGI may seem complex, the initial phase of learning can be very
378rewarding because WSGI and the associated middleware already have solutions to
379many problems that might arise while developing web sites.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000380
381
382WSGI Servers
383------------
384
385The code that is used to connect to various low level gateways like CGI or
Ezio Melottic0deec12010-04-05 03:51:38 +0000386mod_python is called a *WSGI server*. One of these servers is ``flup``, which
387supports FastCGI and SCGI, as well as `AJP
Georg Brandl0f3629d2008-09-07 17:00:17 +0000388<http://en.wikipedia.org/wiki/Apache_JServ_Protocol>`_. Some of these servers
Ezio Melottic0deec12010-04-05 03:51:38 +0000389are written in Python, as ``flup`` is, but there also exist others which are
Georg Brandl0f3629d2008-09-07 17:00:17 +0000390written in C and can be used as drop-in replacements.
391
Ezio Melottic0deec12010-04-05 03:51:38 +0000392There are many servers already available, so a Python web application
393can be deployed nearly anywhere. This is one big advantage that Python has
394compared with other web technologies.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000395
396.. seealso::
397
Sandro Tosi117e1f02011-12-31 18:13:59 +0100398 A good overview of WSGI-related code can be found in the `WSGI homepage
399 <http://www.wsgi.org/en/latest/index.html>`_, which contains an extensive list of `WSGI servers
400 <http://www.wsgi.org/en/latest/servers.html>`_ which can be used by *any* application
Georg Brandl0f3629d2008-09-07 17:00:17 +0000401 supporting WSGI.
402
403 You might be interested in some WSGI-supporting modules already contained in
404 the standard library, namely:
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000405
Georg Brandl0f3629d2008-09-07 17:00:17 +0000406 * :mod:`wsgiref` -- some tiny utilities and servers for WSGI
407
408
409Case study: MoinMoin
410--------------------
411
Ezio Melottic0deec12010-04-05 03:51:38 +0000412What does WSGI give the web application developer? Let's take a look at
413an application that's been around for a while, which was written in
414Python without using WSGI.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000415
Ezio Melottic0deec12010-04-05 03:51:38 +0000416One of the most widely used wiki software packages is `MoinMoin
417<http://moinmo.in/>`_. It was created in 2000, so it predates WSGI by about
418three years. Older versions needed separate code to run on CGI, mod_python,
419FastCGI and standalone.
420
421It now includes support for WSGI. Using WSGI, it is possible to deploy
422MoinMoin on any WSGI compliant server, with no additional glue code.
423Unlike the pre-WSGI versions, this could include WSGI servers that the
424authors of MoinMoin know nothing about.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000425
426
Ezio Melottic0deec12010-04-05 03:51:38 +0000427Model-View-Controller
Georg Brandl0f3629d2008-09-07 17:00:17 +0000428=====================
429
Ezio Melottic0deec12010-04-05 03:51:38 +0000430The term *MVC* is often encountered in statements such as "framework *foo*
431supports MVC". MVC is more about the overall organization of code, rather than
432any particular API. Many web frameworks use this model to help the developer
433bring structure to their program. Bigger web applications can have lots of
434code, so it is a good idea to have an effective structure right from the beginning.
435That way, even users of other frameworks (or even other languages, since MVC is
436not Python-specific) can easily understand the code, given that they are
437already familiar with the MVC structure.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000438
439MVC stands for three components:
440
Ezio Melottic0deec12010-04-05 03:51:38 +0000441* The *model*. This is the data that will be displayed and modified. In
442 Python frameworks, this component is often represented by the classes used by
443 an object-relational mapper.
444
Georg Brandl0f3629d2008-09-07 17:00:17 +0000445* The *view*. This component's job is to display the data of the model to the
Ezio Melottic0deec12010-04-05 03:51:38 +0000446 user. Typically this component is implemented via templates.
447
Georg Brandl0f3629d2008-09-07 17:00:17 +0000448* The *controller*. This is the layer between the user and the model. The
Ezio Melottic0deec12010-04-05 03:51:38 +0000449 controller reacts to user actions (like opening some specific URL), tells
450 the model to modify the data if necessary, and tells the view code what to
451 display,
Georg Brandl0f3629d2008-09-07 17:00:17 +0000452
453While one might think that MVC is a complex design pattern, in fact it is not.
454It is used in Python because it has turned out to be useful for creating clean,
455maintainable web sites.
456
457.. note::
458
459 While not all Python frameworks explicitly support MVC, it is often trivial
Mark Dickinson3e4caeb2009-02-21 20:27:01 +0000460 to create a web site which uses the MVC pattern by separating the data logic
Georg Brandl0f3629d2008-09-07 17:00:17 +0000461 (the model) from the user interaction logic (the controller) and the
Mark Dickinson3e4caeb2009-02-21 20:27:01 +0000462 templates (the view). That's why it is important not to write unnecessary
Ezio Melottic0deec12010-04-05 03:51:38 +0000463 Python code in the templates -- it works against the MVC model and creates
464 chaos in the code base, making it harder to understand and modify.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000465
466.. seealso::
467
Ezio Melottic0deec12010-04-05 03:51:38 +0000468 The English Wikipedia has an article about the `Model-View-Controller pattern
469 <http://en.wikipedia.org/wiki/Model-view-controller>`_. It includes a long
470 list of web frameworks for various programming languages.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000471
472
Ezio Melottic0deec12010-04-05 03:51:38 +0000473Ingredients for Websites
474========================
Georg Brandl0f3629d2008-09-07 17:00:17 +0000475
Ezio Melottic0deec12010-04-05 03:51:38 +0000476Websites are complex constructs, so tools have been created to help web
477developers make their code easier to write and more maintainable. Tools like
478these exist for all web frameworks in all languages. Developers are not forced
479to use these tools, and often there is no "best" tool. It is worth learning
480about the available tools because they can greatly simplify the process of
481developing a web site.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000482
483
484.. seealso::
485
Ezio Melottic0deec12010-04-05 03:51:38 +0000486 There are far more components than can be presented here. The Python wiki
487 has a page about these components, called
Georg Brandl0f3629d2008-09-07 17:00:17 +0000488 `Web Components <http://wiki.python.org/moin/WebComponents>`_.
489
490
491Templates
492---------
493
Ezio Melottic0deec12010-04-05 03:51:38 +0000494Mixing of HTML and Python code is made possible by a few libraries. While
Georg Brandl0f3629d2008-09-07 17:00:17 +0000495convenient at first, it leads to horribly unmaintainable code. That's why
496templates exist. Templates are, in the simplest case, just HTML files with
Ezio Melottic0deec12010-04-05 03:51:38 +0000497placeholders. The HTML is sent to the user's browser after filling in the
Georg Brandl0f3629d2008-09-07 17:00:17 +0000498placeholders.
499
Ezio Melottic0deec12010-04-05 03:51:38 +0000500Python already includes two ways to build simple templates::
Georg Brandl0f3629d2008-09-07 17:00:17 +0000501
Ezio Melottic0deec12010-04-05 03:51:38 +0000502 >>> template = "<html><body><h1>Hello %s!</h1></body></html>"
503 >>> print template % "Reader"
504 <html><body><h1>Hello Reader!</h1></body></html>
Georg Brandl0f3629d2008-09-07 17:00:17 +0000505
Ezio Melottic0deec12010-04-05 03:51:38 +0000506 >>> from string import Template
507 >>> template = Template("<html><body><h1>Hello ${name}</h1></body></html>")
508 >>> print template.substitute(dict(name='Dinsdale'))
509 <html><body><h1>Hello Dinsdale!</h1></body></html>
Georg Brandl0f3629d2008-09-07 17:00:17 +0000510
Ezio Melottic0deec12010-04-05 03:51:38 +0000511To generate complex HTML based on non-trivial model data, conditional
512and looping constructs like Python's *for* and *if* are generally needed.
513*Template engines* support templates of this complexity.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000514
Ezio Melottic0deec12010-04-05 03:51:38 +0000515There are a lot of template engines available for Python which can be used with
516or without a `framework`_. Some of these define a plain-text programming
517language which is easy to learn, partly because it is limited in scope.
518Others use XML, and the template output is guaranteed to be always be valid
519XML. There are many other variations.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000520
Ezio Melottic0deec12010-04-05 03:51:38 +0000521Some `frameworks`_ ship their own template engine or recommend one in
522particular. In the absence of a reason to use a different template engine,
523using the one provided by or recommended by the framework is a good idea.
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000524
Ezio Melottic0deec12010-04-05 03:51:38 +0000525Popular template engines include:
Georg Brandl0f3629d2008-09-07 17:00:17 +0000526
Ezio Melottic0deec12010-04-05 03:51:38 +0000527 * `Mako <http://www.makotemplates.org/>`_
528 * `Genshi <http://genshi.edgewall.org/>`_
529 * `Jinja <http://jinja.pocoo.org/2/>`_
Georg Brandl0f3629d2008-09-07 17:00:17 +0000530
531.. seealso::
532
Ezio Melotti056f33d2010-11-16 21:08:14 +0000533 There are many template engines competing for attention, because it is
Ezio Melottic0deec12010-04-05 03:51:38 +0000534 pretty easy to create them in Python. The page `Templating
Georg Brandl0f3629d2008-09-07 17:00:17 +0000535 <http://wiki.python.org/moin/Templating>`_ in the wiki lists a big,
Ezio Melottic0deec12010-04-05 03:51:38 +0000536 ever-growing number of these. The three listed above are considered "second
537 generation" template engines and are a good place to start.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000538
539
540Data persistence
541----------------
542
Ezio Melottic0deec12010-04-05 03:51:38 +0000543*Data persistence*, while sounding very complicated, is just about storing data.
544This data might be the text of blog entries, the postings on a bulletin board or
545the text of a wiki page. There are, of course, a number of different ways to store
546information on a web server.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000547
Ezio Melottic0deec12010-04-05 03:51:38 +0000548Often, relational database engines like `MySQL <http://www.mysql.com/>`_ or
549`PostgreSQL <http://www.postgresql.org/>`_ are used because of their good
550performance when handling very large databases consisting of millions of
551entries. There is also a small database engine called `SQLite
552<http://www.sqlite.org/>`_, which is bundled with Python in the :mod:`sqlite3`
553module, and which uses only one file. It has no other dependencies. For
554smaller sites SQLite is just enough.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000555
Ezio Melottic0deec12010-04-05 03:51:38 +0000556Relational databases are *queried* using a language called `SQL
557<http://en.wikipedia.org/wiki/SQL>`_. Python programmers in general do not
558like SQL too much, as they prefer to work with objects. It is possible to save
559Python objects into a database using a technology called `ORM
560<http://en.wikipedia.org/wiki/Object-relational_mapping>`_ (Object Relational
561Mapping). ORM translates all object-oriented access into SQL code under the
562hood, so the developer does not need to think about it. Most `frameworks`_ use
563ORMs, and it works quite well.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000564
Ezio Melottic0deec12010-04-05 03:51:38 +0000565A second possibility is storing data in normal, plain text files (some
566times called "flat files"). This is very easy for simple sites,
567but can be difficult to get right if the web site is performing many
568updates to the stored data.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000569
Ezio Melottic0deec12010-04-05 03:51:38 +0000570A third possibility are object oriented databases (also called "object
571databases"). These databases store the object data in a form that closely
572parallels the way the objects are structured in memory during program
573execution. (By contrast, ORMs store the object data as rows of data in tables
574and relations between those rows.) Storing the objects directly has the
575advantage that nearly all objects can be saved in a straightforward way, unlike
576in relational databases where some objects are very hard to represent.
577
578`Frameworks`_ often give hints on which data storage method to choose. It is
579usually a good idea to stick to the data store recommended by the framework
580unless the application has special requirements better satisfied by an
581alternate storage mechanism.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000582
583.. seealso::
584
585 * `Persistence Tools <http://wiki.python.org/moin/PersistenceTools>`_ lists
Ezio Melottic0deec12010-04-05 03:51:38 +0000586 possibilities on how to save data in the file system. Some of these
587 modules are part of the standard library
588
Georg Brandl0f3629d2008-09-07 17:00:17 +0000589 * `Database Programming <http://wiki.python.org/moin/DatabaseProgramming>`_
Ezio Melottic0deec12010-04-05 03:51:38 +0000590 helps with choosing a method for saving data
591
592 * `SQLAlchemy <http://www.sqlalchemy.org/>`_, the most powerful OR-Mapper
593 for Python, and `Elixir <http://elixir.ematia.de/>`_, which makes
594 SQLAlchemy easier to use
595
Georg Brandl0f3629d2008-09-07 17:00:17 +0000596 * `SQLObject <http://www.sqlobject.org/>`_, another popular OR-Mapper
Ezio Melottic0deec12010-04-05 03:51:38 +0000597
Georg Brandl0f3629d2008-09-07 17:00:17 +0000598 * `ZODB <https://launchpad.net/zodb>`_ and `Durus
599 <http://www.mems-exchange.org/software/durus/>`_, two object oriented
600 databases
601
602
603.. _framework:
604
605Frameworks
606==========
607
Ezio Melottic0deec12010-04-05 03:51:38 +0000608The process of creating code to run web sites involves writing code to provide
609various services. The code to provide a particular service often works the
610same way regardless of the complexity or purpose of the web site in question.
611Abstracting these common solutions into reusable code produces what are called
612"frameworks" for web development. Perhaps the most well-known framework for
613web development is Ruby on Rails, but Python has its own frameworks. Some of
614these were partly inspired by Rails, or borrowed ideas from Rails, but many
615existed a long time before Rails.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000616
Ezio Melottic0deec12010-04-05 03:51:38 +0000617Originally Python web frameworks tended to incorporate all of the services
618needed to develop web sites as a giant, integrated set of tools. No two web
619frameworks were interoperable: a program developed for one could not be
620deployed on a different one without considerable re-engineering work. This led
621to the development of "minimalist" web frameworks that provided just the tools
622to communicate between the Python code and the http protocol, with all other
623services to be added on top via separate components. Some ad hoc standards
624were developed that allowed for limited interoperability between frameworks,
625such as a standard that allowed different template engines to be used
626interchangeably.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000627
Ezio Melottic0deec12010-04-05 03:51:38 +0000628Since the advent of WSGI, the Python web framework world has been evolving
629toward interoperability based on the WSGI standard. Now many web frameworks,
630whether "full stack" (providing all the tools one needs to deploy the most
631complex web sites) or minimalist, or anything in between, are built from
632collections of reusable components that can be used with more than one
633framework.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000634
Ezio Melottic0deec12010-04-05 03:51:38 +0000635The majority of users will probably want to select a "full stack" framework
636that has an active community. These frameworks tend to be well documented,
637and provide the easiest path to producing a fully functional web site in
638minimal time.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000639
640
641Some notable frameworks
642-----------------------
643
Ezio Melottic0deec12010-04-05 03:51:38 +0000644There are an incredible number of frameworks, so they cannot all be covered
645here. Instead we will briefly touch on some of the most popular.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000646
647
648Django
649^^^^^^
650
651`Django <http://www.djangoproject.com/>`_ is a framework consisting of several
652tightly coupled elements which were written from scratch and work together very
Ezio Melottic0deec12010-04-05 03:51:38 +0000653well. It includes an ORM which is quite powerful while being simple to use,
654and has a great online administration interface which makes it possible to edit
655the data in the database with a browser. The template engine is text-based and
656is designed to be usable for page designers who cannot write Python. It
657supports template inheritance and filters (which work like Unix pipes). Django
658has many handy features bundled, such as creation of RSS feeds or generic views,
659which make it possible to create web sites almost without writing any Python code.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000660
Ezio Melottic0deec12010-04-05 03:51:38 +0000661It has a big, international community, the members of which have created many
662web sites. There are also a lot of add-on projects which extend Django's normal
Georg Brandl0f3629d2008-09-07 17:00:17 +0000663functionality. This is partly due to Django's well written `online
Georg Brandla4314c22009-10-11 20:16:16 +0000664documentation <http://docs.djangoproject.com/>`_ and the `Django book
Georg Brandl0f3629d2008-09-07 17:00:17 +0000665<http://www.djangobook.com/>`_.
666
667
668.. note::
669
Ezio Melottic0deec12010-04-05 03:51:38 +0000670 Although Django is an MVC-style framework, it names the elements
Georg Brandl0f3629d2008-09-07 17:00:17 +0000671 differently, which is described in the `Django FAQ
Ezio Melottic0deec12010-04-05 03:51:38 +0000672 <http://docs.djangoproject.com/en/dev/faq/general/#django-appears-to-be-a-mvc-framework-but-you-call-the-controller-the-view-and-the-view-the-template-how-come-you-don-t-use-the-standard-names>`_.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000673
674
675TurboGears
676^^^^^^^^^^
677
Ezio Melottic0deec12010-04-05 03:51:38 +0000678Another popular web framework for Python is `TurboGears
679<http://www.turbogears.org/>`_. TurboGears takes the approach of using already
680existing components and combining them with glue code to create a seamless
681experience. TurboGears gives the user flexibility in choosing components. For
682example the ORM and template engine can be changed to use packages different
683from those used by default.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000684
685The documentation can be found in the `TurboGears wiki
686<http://docs.turbogears.org/>`_, where links to screencasts can be found.
687TurboGears has also an active user community which can respond to most related
688questions. There is also a `TurboGears book <http://turbogearsbook.com/>`_
689published, which is a good starting point.
690
Ezio Melottic0deec12010-04-05 03:51:38 +0000691The newest version of TurboGears, version 2.0, moves even further in direction
692of WSGI support and a component-based architecture. TurboGears 2 is based on
693the WSGI stack of another popular component-based web framework, `Pylons
694<http://pylonshq.com/>`_.
695
696
697Zope
698^^^^
699
700The Zope framework is one of the "old original" frameworks. Its current
701incarnation in Zope2 is a tightly integrated full-stack framework. One of its
702most interesting feature is its tight integration with a powerful object
703database called the `ZODB <https://launchpad.net/zodb>`_ (Zope Object Database).
704Because of its highly integrated nature, Zope wound up in a somewhat isolated
705ecosystem: code written for Zope wasn't very usable outside of Zope, and
706vice-versa. To solve this problem the Zope 3 effort was started. Zope 3
707re-engineers Zope as a set of more cleanly isolated components. This effort
708was started before the advent of the WSGI standard, but there is WSGI support
709for Zope 3 from the `Repoze <http://repoze.org/>`_ project. Zope components
710have many years of production use behind them, and the Zope 3 project gives
711access to these components to the wider Python community. There is even a
712separate framework based on the Zope components: `Grok
713<http://grok.zope.org/>`_.
714
715Zope is also the infrastructure used by the `Plone <http://plone.org/>`_ content
716management system, one of the most powerful and popular content management
717systems available.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000718
719
720Other notable frameworks
721^^^^^^^^^^^^^^^^^^^^^^^^
722
Ezio Melottic0deec12010-04-05 03:51:38 +0000723Of course these are not the only frameworks that are available. There are
724many other frameworks worth mentioning.
Georg Brandl0f3629d2008-09-07 17:00:17 +0000725
726Another framework that's already been mentioned is `Pylons`_. Pylons is much
Ezio Melottic0deec12010-04-05 03:51:38 +0000727like TurboGears, but with an even stronger emphasis on flexibility, which comes
Georg Brandl0f3629d2008-09-07 17:00:17 +0000728at the cost of being more difficult to use. Nearly every component can be
Mark Dickinson3e4caeb2009-02-21 20:27:01 +0000729exchanged, which makes it necessary to use the documentation of every single
Ezio Melottic0deec12010-04-05 03:51:38 +0000730component, of which there are many. Pylons builds upon `Paste
Georg Brandl0f3629d2008-09-07 17:00:17 +0000731<http://pythonpaste.org/>`_, an extensive set of tools which are handy for WSGI.
732
733And that's still not everything. The most up-to-date information can always be
734found in the Python wiki.
735
736.. seealso::
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000737
Georg Brandl0f3629d2008-09-07 17:00:17 +0000738 The Python wiki contains an extensive list of `web frameworks
739 <http://wiki.python.org/moin/WebFrameworks>`_.
740
741 Most frameworks also have their own mailing lists and IRC channels, look out
Ezio Melottic0deec12010-04-05 03:51:38 +0000742 for these on the projects' web sites. There is also a general "Python in the
Georg Brandl0f3629d2008-09-07 17:00:17 +0000743 Web" IRC channel on freenode called `#python.web
744 <http://wiki.python.org/moin/PoundPythonWeb>`_.