Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 1 | |
| 2 | :mod:`SocketServer` --- A framework for network servers |
| 3 | ======================================================= |
| 4 | |
| 5 | .. module:: SocketServer |
| 6 | :synopsis: A framework for network servers. |
| 7 | |
| 8 | |
| 9 | The :mod:`SocketServer` module simplifies the task of writing network servers. |
| 10 | |
| 11 | There are four basic server classes: :class:`TCPServer` uses the Internet TCP |
| 12 | protocol, which provides for continuous streams of data between the client and |
| 13 | server. :class:`UDPServer` uses datagrams, which are discrete packets of |
| 14 | information that may arrive out of order or be lost while in transit. The more |
| 15 | infrequently used :class:`UnixStreamServer` and :class:`UnixDatagramServer` |
| 16 | classes are similar, but use Unix domain sockets; they're not available on |
| 17 | non-Unix platforms. For more details on network programming, consult a book |
| 18 | such as |
| 19 | W. Richard Steven's UNIX Network Programming or Ralph Davis's Win32 Network |
| 20 | Programming. |
| 21 | |
| 22 | These four classes process requests :dfn:`synchronously`; each request must be |
| 23 | completed before the next request can be started. This isn't suitable if each |
| 24 | request takes a long time to complete, because it requires a lot of computation, |
| 25 | or because it returns a lot of data which the client is slow to process. The |
| 26 | solution is to create a separate process or thread to handle each request; the |
| 27 | :class:`ForkingMixIn` and :class:`ThreadingMixIn` mix-in classes can be used to |
| 28 | support asynchronous behaviour. |
| 29 | |
| 30 | Creating a server requires several steps. First, you must create a request |
| 31 | handler class by subclassing the :class:`BaseRequestHandler` class and |
| 32 | overriding its :meth:`handle` method; this method will process incoming |
| 33 | requests. Second, you must instantiate one of the server classes, passing it |
| 34 | the server's address and the request handler class. Finally, call the |
| 35 | :meth:`handle_request` or :meth:`serve_forever` method of the server object to |
| 36 | process one or many requests. |
| 37 | |
| 38 | When inheriting from :class:`ThreadingMixIn` for threaded connection behavior, |
| 39 | you should explicitly declare how you want your threads to behave on an abrupt |
| 40 | shutdown. The :class:`ThreadingMixIn` class defines an attribute |
| 41 | *daemon_threads*, which indicates whether or not the server should wait for |
| 42 | thread termination. You should set the flag explicitly if you would like threads |
| 43 | to behave autonomously; the default is :const:`False`, meaning that Python will |
| 44 | not exit until all threads created by :class:`ThreadingMixIn` have exited. |
| 45 | |
| 46 | Server classes have the same external methods and attributes, no matter what |
Andrew M. Kuchling | e45a77a | 2008-01-19 16:26:13 +0000 | [diff] [blame] | 47 | network protocol they use. |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 48 | |
| 49 | |
| 50 | Server Creation Notes |
| 51 | --------------------- |
| 52 | |
| 53 | There are five classes in an inheritance diagram, four of which represent |
| 54 | synchronous servers of four types:: |
| 55 | |
| 56 | +------------+ |
| 57 | | BaseServer | |
| 58 | +------------+ |
| 59 | | |
| 60 | v |
| 61 | +-----------+ +------------------+ |
| 62 | | TCPServer |------->| UnixStreamServer | |
| 63 | +-----------+ +------------------+ |
| 64 | | |
| 65 | v |
| 66 | +-----------+ +--------------------+ |
| 67 | | UDPServer |------->| UnixDatagramServer | |
| 68 | +-----------+ +--------------------+ |
| 69 | |
| 70 | Note that :class:`UnixDatagramServer` derives from :class:`UDPServer`, not from |
| 71 | :class:`UnixStreamServer` --- the only difference between an IP and a Unix |
| 72 | stream server is the address family, which is simply repeated in both Unix |
| 73 | server classes. |
| 74 | |
| 75 | Forking and threading versions of each type of server can be created using the |
| 76 | :class:`ForkingMixIn` and :class:`ThreadingMixIn` mix-in classes. For instance, |
| 77 | a threading UDP server class is created as follows:: |
| 78 | |
| 79 | class ThreadingUDPServer(ThreadingMixIn, UDPServer): pass |
| 80 | |
| 81 | The mix-in class must come first, since it overrides a method defined in |
| 82 | :class:`UDPServer`. Setting the various member variables also changes the |
| 83 | behavior of the underlying server mechanism. |
| 84 | |
| 85 | To implement a service, you must derive a class from :class:`BaseRequestHandler` |
| 86 | and redefine its :meth:`handle` method. You can then run various versions of |
| 87 | the service by combining one of the server classes with your request handler |
| 88 | class. The request handler class must be different for datagram or stream |
| 89 | services. This can be hidden by using the handler subclasses |
| 90 | :class:`StreamRequestHandler` or :class:`DatagramRequestHandler`. |
| 91 | |
| 92 | Of course, you still have to use your head! For instance, it makes no sense to |
| 93 | use a forking server if the service contains state in memory that can be |
| 94 | modified by different requests, since the modifications in the child process |
| 95 | would never reach the initial state kept in the parent process and passed to |
| 96 | each child. In this case, you can use a threading server, but you will probably |
| 97 | have to use locks to protect the integrity of the shared data. |
| 98 | |
| 99 | On the other hand, if you are building an HTTP server where all data is stored |
| 100 | externally (for instance, in the file system), a synchronous class will |
| 101 | essentially render the service "deaf" while one request is being handled -- |
| 102 | which may be for a very long time if a client is slow to receive all the data it |
| 103 | has requested. Here a threading or forking server is appropriate. |
| 104 | |
| 105 | In some cases, it may be appropriate to process part of a request synchronously, |
| 106 | but to finish processing in a forked child depending on the request data. This |
| 107 | can be implemented by using a synchronous server and doing an explicit fork in |
| 108 | the request handler class :meth:`handle` method. |
| 109 | |
| 110 | Another approach to handling multiple simultaneous requests in an environment |
| 111 | that supports neither threads nor :func:`fork` (or where these are too expensive |
| 112 | or inappropriate for the service) is to maintain an explicit table of partially |
| 113 | finished requests and to use :func:`select` to decide which request to work on |
| 114 | next (or whether to handle a new incoming request). This is particularly |
| 115 | important for stream services where each client can potentially be connected for |
| 116 | a long time (if threads or subprocesses cannot be used). |
| 117 | |
Georg Brandl | b19be57 | 2007-12-29 10:57:00 +0000 | [diff] [blame] | 118 | .. XXX should data and methods be intermingled, or separate? |
| 119 | how should the distinction between class and instance variables be drawn? |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 120 | |
| 121 | |
| 122 | Server Objects |
| 123 | -------------- |
| 124 | |
| 125 | |
| 126 | .. function:: fileno() |
| 127 | |
| 128 | Return an integer file descriptor for the socket on which the server is |
| 129 | listening. This function is most commonly passed to :func:`select.select`, to |
| 130 | allow monitoring multiple servers in the same process. |
| 131 | |
| 132 | |
| 133 | .. function:: handle_request() |
| 134 | |
| 135 | Process a single request. This function calls the following methods in order: |
| 136 | :meth:`get_request`, :meth:`verify_request`, and :meth:`process_request`. If |
| 137 | the user-provided :meth:`handle` method of the handler class raises an |
| 138 | exception, the server's :meth:`handle_error` method will be called. |
| 139 | |
| 140 | |
| 141 | .. function:: serve_forever() |
| 142 | |
| 143 | Handle an infinite number of requests. This simply calls :meth:`handle_request` |
| 144 | inside an infinite loop. |
| 145 | |
| 146 | |
| 147 | .. data:: address_family |
| 148 | |
| 149 | The family of protocols to which the server's socket belongs. |
| 150 | :const:`socket.AF_INET` and :const:`socket.AF_UNIX` are two possible values. |
| 151 | |
| 152 | |
| 153 | .. data:: RequestHandlerClass |
| 154 | |
| 155 | The user-provided request handler class; an instance of this class is created |
| 156 | for each request. |
| 157 | |
| 158 | |
| 159 | .. data:: server_address |
| 160 | |
| 161 | The address on which the server is listening. The format of addresses varies |
| 162 | depending on the protocol family; see the documentation for the socket module |
| 163 | for details. For Internet protocols, this is a tuple containing a string giving |
| 164 | the address, and an integer port number: ``('127.0.0.1', 80)``, for example. |
| 165 | |
| 166 | |
| 167 | .. data:: socket |
| 168 | |
| 169 | The socket object on which the server will listen for incoming requests. |
| 170 | |
| 171 | The server classes support the following class variables: |
| 172 | |
Georg Brandl | b19be57 | 2007-12-29 10:57:00 +0000 | [diff] [blame] | 173 | .. XXX should class variables be covered before instance variables, or vice versa? |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 174 | |
| 175 | |
| 176 | .. data:: allow_reuse_address |
| 177 | |
| 178 | Whether the server will allow the reuse of an address. This defaults to |
| 179 | :const:`False`, and can be set in subclasses to change the policy. |
| 180 | |
| 181 | |
| 182 | .. data:: request_queue_size |
| 183 | |
| 184 | The size of the request queue. If it takes a long time to process a single |
| 185 | request, any requests that arrive while the server is busy are placed into a |
| 186 | queue, up to :attr:`request_queue_size` requests. Once the queue is full, |
| 187 | further requests from clients will get a "Connection denied" error. The default |
| 188 | value is usually 5, but this can be overridden by subclasses. |
| 189 | |
| 190 | |
| 191 | .. data:: socket_type |
| 192 | |
| 193 | The type of socket used by the server; :const:`socket.SOCK_STREAM` and |
| 194 | :const:`socket.SOCK_DGRAM` are two possible values. |
| 195 | |
Andrew M. Kuchling | e45a77a | 2008-01-19 16:26:13 +0000 | [diff] [blame] | 196 | .. data:: timeout |
| 197 | |
| 198 | Timeout duration, measured in seconds, or :const:`None` if no timeout is desired. |
| 199 | If no incoming requests are received within the timeout period, |
| 200 | the :meth:`handle_timeout` method is called and then the server resumes waiting for |
| 201 | requests. |
| 202 | |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 203 | There are various server methods that can be overridden by subclasses of base |
| 204 | server classes like :class:`TCPServer`; these methods aren't useful to external |
| 205 | users of the server object. |
| 206 | |
Georg Brandl | b19be57 | 2007-12-29 10:57:00 +0000 | [diff] [blame] | 207 | .. XXX should the default implementations of these be documented, or should |
| 208 | it be assumed that the user will look at SocketServer.py? |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 209 | |
| 210 | |
| 211 | .. function:: finish_request() |
| 212 | |
| 213 | Actually processes the request by instantiating :attr:`RequestHandlerClass` and |
| 214 | calling its :meth:`handle` method. |
| 215 | |
| 216 | |
| 217 | .. function:: get_request() |
| 218 | |
| 219 | Must accept a request from the socket, and return a 2-tuple containing the *new* |
| 220 | socket object to be used to communicate with the client, and the client's |
| 221 | address. |
| 222 | |
| 223 | |
| 224 | .. function:: handle_error(request, client_address) |
| 225 | |
| 226 | This function is called if the :attr:`RequestHandlerClass`'s :meth:`handle` |
| 227 | method raises an exception. The default action is to print the traceback to |
| 228 | standard output and continue handling further requests. |
| 229 | |
Andrew M. Kuchling | e45a77a | 2008-01-19 16:26:13 +0000 | [diff] [blame] | 230 | .. function:: handle_timeout() |
| 231 | |
| 232 | This function is called when the :attr:`timeout` attribute has been set to a |
| 233 | value other than :const:`None` and the timeout period has passed with no |
| 234 | requests being received. The default action for forking servers is |
| 235 | to collect the status of any child processes that have exited, while |
| 236 | in threading servers this method does nothing. |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 237 | |
| 238 | .. function:: process_request(request, client_address) |
| 239 | |
| 240 | Calls :meth:`finish_request` to create an instance of the |
| 241 | :attr:`RequestHandlerClass`. If desired, this function can create a new process |
| 242 | or thread to handle the request; the :class:`ForkingMixIn` and |
| 243 | :class:`ThreadingMixIn` classes do this. |
| 244 | |
Georg Brandl | b19be57 | 2007-12-29 10:57:00 +0000 | [diff] [blame] | 245 | .. Is there any point in documenting the following two functions? |
| 246 | What would the purpose of overriding them be: initializing server |
| 247 | instance variables, adding new network families? |
Georg Brandl | 8ec7f65 | 2007-08-15 14:28:01 +0000 | [diff] [blame] | 248 | |
| 249 | |
| 250 | .. function:: server_activate() |
| 251 | |
| 252 | Called by the server's constructor to activate the server. The default behavior |
| 253 | just :meth:`listen`\ s to the server's socket. May be overridden. |
| 254 | |
| 255 | |
| 256 | .. function:: server_bind() |
| 257 | |
| 258 | Called by the server's constructor to bind the socket to the desired address. |
| 259 | May be overridden. |
| 260 | |
| 261 | |
| 262 | .. function:: verify_request(request, client_address) |
| 263 | |
| 264 | Must return a Boolean value; if the value is :const:`True`, the request will be |
| 265 | processed, and if it's :const:`False`, the request will be denied. This function |
| 266 | can be overridden to implement access controls for a server. The default |
| 267 | implementation always returns :const:`True`. |
| 268 | |
| 269 | |
| 270 | RequestHandler Objects |
| 271 | ---------------------- |
| 272 | |
| 273 | The request handler class must define a new :meth:`handle` method, and can |
| 274 | override any of the following methods. A new instance is created for each |
| 275 | request. |
| 276 | |
| 277 | |
| 278 | .. function:: finish() |
| 279 | |
| 280 | Called after the :meth:`handle` method to perform any clean-up actions required. |
| 281 | The default implementation does nothing. If :meth:`setup` or :meth:`handle` |
| 282 | raise an exception, this function will not be called. |
| 283 | |
| 284 | |
| 285 | .. function:: handle() |
| 286 | |
| 287 | This function must do all the work required to service a request. The default |
| 288 | implementation does nothing. Several instance attributes are available to it; |
| 289 | the request is available as :attr:`self.request`; the client address as |
| 290 | :attr:`self.client_address`; and the server instance as :attr:`self.server`, in |
| 291 | case it needs access to per-server information. |
| 292 | |
| 293 | The type of :attr:`self.request` is different for datagram or stream services. |
| 294 | For stream services, :attr:`self.request` is a socket object; for datagram |
| 295 | services, :attr:`self.request` is a string. However, this can be hidden by using |
| 296 | the request handler subclasses :class:`StreamRequestHandler` or |
| 297 | :class:`DatagramRequestHandler`, which override the :meth:`setup` and |
| 298 | :meth:`finish` methods, and provide :attr:`self.rfile` and :attr:`self.wfile` |
| 299 | attributes. :attr:`self.rfile` and :attr:`self.wfile` can be read or written, |
| 300 | respectively, to get the request data or return data to the client. |
| 301 | |
| 302 | |
| 303 | .. function:: setup() |
| 304 | |
| 305 | Called before the :meth:`handle` method to perform any initialization actions |
| 306 | required. The default implementation does nothing. |
| 307 | |