blob: 0689f44d2d22db750457e30cd950ca06b9730c67 [file] [log] [blame]
Guido van Rossum5fdeeea1994-01-02 01:22:07 +00001\section{Built-in Module \sectcode{socket}}
2
3\bimodindex{socket}
4This module provides access to the BSD {\em socket} interface.
5It is available on \UNIX{} systems that support this interface.
6
7For an introduction to socket programming (in C), see the following
8papers: \emph{An Introductory 4.3BSD Interprocess Communication
9Tutorial}, by Stuart Sechrest and \emph{An Advanced 4.3BSD Interprocess
10Communication Tutorial}, by Samuel J. Leffler et al, both in the
11\UNIX{} Programmer's Manual, Supplementary Documents 1 (sections PS1:7
12and PS1:8). The \UNIX{} manual pages for the various socket-related
13system calls also a valuable source of information on the details of
14socket semantics.
15
16The Python interface is a straightforward transliteration of the
17\UNIX{} system call and library interface for sockets to Python's
18object-oriented style: the \code{socket()} function returns a
19\dfn{socket object} whose methods implement the various socket system
20calls. Parameter types are somewhat higer-level than in the C
21interface: as with \code{read()} and \code{write()} operations on Python
22files, buffer allocation on receive operations is automatic, and
23buffer length is implicit on send operations.
24
25Socket addresses are represented as a single string for the
26\code{AF_UNIX} address family and as a pair
27\code{(\var{host}, \var{port})} for the \code{AF_INET} address family,
28where \var{host} is a string representing
29either a hostname in Internet domain notation like
30\code{'daring.cwi.nl'} or an IP address like \code{'100.50.200.5'},
31and \var{port} is an integral port number. Other address families are
32currently not supported. The address format required by a particular
33socket object is automatically selected based on the address family
34specified when the socket object was created.
35
36All errors raise exceptions. The normal exceptions for invalid
37argument types and out-of-memory conditions can be raised; errors
38related to socket or address semantics raise the error \code{socket.error}.
39
40Non-blocking and asynchronous mode are not supported; see module
41\code{select} for a way to do non-blocking socket I/O.
42
43The module \code{socket} exports the following constants and functions:
44
45\renewcommand{\indexsubitem}{(in module socket)}
46\begin{excdesc}{error}
47This exception is raised for socket- or address-related errors.
48The accompanying value is either a string telling what went wrong or a
49pair \code{(\var{errno}, \var{string})}
50representing an error returned by a system
51call, similar to the value accompanying \code{posix.error}.
52\end{excdesc}
53
54\begin{datadesc}{AF_UNIX}
55\dataline{AF_INET}
56These constants represent the address (and protocol) families,
Guido van Rossum781db5d1994-08-05 13:37:36 +000057used for the first argument to \code{socket()}. If the \code{AF_UNIX}
58constant is not defined then this protocol is unsupported.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000059\end{datadesc}
60
61\begin{datadesc}{SOCK_STREAM}
62\dataline{SOCK_DGRAM}
Guido van Rossum781db5d1994-08-05 13:37:36 +000063\dataline{SOCK_RAW}
64\dataline{SOCK_RDM}
65\dataline{SOCK_SEQPACKET}
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000066These constants represent the socket types,
67used for the second argument to \code{socket()}.
Guido van Rossum781db5d1994-08-05 13:37:36 +000068(Only \code{SOCK_STREAM} and
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000069\code{SOCK_DGRAM} appear to be generally useful.)
70\end{datadesc}
71
Guido van Rossumed2bad81995-02-16 16:29:18 +000072\begin{datadesc}{SO_*}
73\dataline{SOMAXCONN}
74\dataline{MSG_*}
75\dataline{SOL_*}
76\dataline{IPPROTO_*}
77\dataline{IPPORT_*}
78\dataline{INADDR_*}
79\dataline{IP_*}
80Many constants of these forms, documented in the Unix documentation on
81sockets and/or the IP protocol, are also defined in the socket module.
82They are generally used in arguments to the \code{setsockopt} and
83\code{getsockopt} methods of socket objects. In most cases, only
84those symbols that are defined in the Unix header files are defined;
85for a few symbols, default values are provided.
86\end{datadesc}
87
Guido van Rossum5fdeeea1994-01-02 01:22:07 +000088\begin{funcdesc}{gethostbyname}{hostname}
89Translate a host name to IP address format. The IP address is
90returned as a string, e.g., \code{'100.50.200.5'}. If the host name
91is an IP address itself it is returned unchanged.
92\end{funcdesc}
93
Guido van Rossum781db5d1994-08-05 13:37:36 +000094\begin{funcdesc}{gethostname}{}
Guido van Rossum16d6e711994-08-08 12:30:22 +000095Return a string containing the hostname of the machine where
96the Python interpreter is currently executing. If you want to know the
97current machine's IP address, use
Guido van Rossum31cce971995-01-04 19:17:34 +000098\code{socket.gethostbyname(socket.gethostname())} instead.
99\end{funcdesc}
100
101\begin{funcdesc}{gethostbyaddr}{ip_address}
102Return a triple \code{(hostname, aliaslist, ipaddrlist)} where
103\code{hostname} is the primary host name responding to the given
104\var{ip_address}, \code{aliaslist} is a (possibly empty) list of
105alternative host names for the same address, and \code{ipaddrlist} is
106a list of IP addresses for the same interface on the same
107host (most likely containing only a single address).
Guido van Rossum781db5d1994-08-05 13:37:36 +0000108\end{funcdesc}
109
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000110\begin{funcdesc}{getservbyname}{servicename\, protocolname}
111Translate an Internet service name and protocol name to a port number
112for that service. The protocol name should be \code{'tcp'} or
113\code{'udp'}.
114\end{funcdesc}
115
116\begin{funcdesc}{socket}{family\, type\, proto}
117Create a new socket using the given address family, socket type and
118protocol number. The address family should be \code{AF_INET} or
119\code{AF_UNIX}. The socket type should be \code{SOCK_STREAM},
120\code{SOCK_DGRAM} or perhaps one of the other \samp{SOCK_} constants.
121The protocol number is usually zero and may be omitted in that case.
122\end{funcdesc}
123
124\begin{funcdesc}{fromfd}{fd\, family\, type\, proto}
125Build a socket object from an existing file descriptor (an integer as
126returned by a file object's \code{fileno} method). Address family,
127socket type and protocol number are as for the \code{socket} function
128above. The file descriptor should refer to a socket, but this is not
129checked --- subsequent operations on the object may fail if the file
130descriptor is invalid. This function is rarely needed, but can be
131used to get or set socket options on a socket passed to a program as
132standard input or output (e.g. a server started by the \UNIX{} inet
133daemon).
134\end{funcdesc}
135
136\subsection{Socket Object Methods}
137
138\noindent
139Socket objects have the following methods. Except for
140\code{makefile()} these correspond to \UNIX{} system calls applicable to
141sockets.
142
143\renewcommand{\indexsubitem}{(socket method)}
144\begin{funcdesc}{accept}{}
145Accept a connection.
146The socket must be bound to an address and listening for connections.
147The return value is a pair \code{(\var{conn}, \var{address})}
148where \var{conn} is a \emph{new} socket object usable to send and
149receive data on the connection, and \var{address} is the address bound
150to the socket on the other end of the connection.
151\end{funcdesc}
152
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000153\begin{funcdesc}{bind}{address}
Guido van Rossuma84ec511994-06-23 12:13:52 +0000154Bind the socket to \var{address}. The socket must not already be bound.
155(The format of \var{address} depends on the address family -- see above.)
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000156\end{funcdesc}
157
158\begin{funcdesc}{close}{}
159Close the socket. All future operations on the socket object will fail.
160The remote end will receive no more data (after queued data is flushed).
161Sockets are automatically closed when they are garbage-collected.
162\end{funcdesc}
163
164\begin{funcdesc}{connect}{address}
Guido van Rossuma84ec511994-06-23 12:13:52 +0000165Connect to a remote socket at \var{address}.
166(The format of \var{address} depends on the address family -- see above.)
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000167\end{funcdesc}
168
169\begin{funcdesc}{fileno}{}
170Return the socket's file descriptor (a small integer). This is useful
171with \code{select}.
172\end{funcdesc}
173
174\begin{funcdesc}{getpeername}{}
175Return the remote address to which the socket is connected. This is
176useful to find out the port number of a remote IP socket, for instance.
Guido van Rossuma84ec511994-06-23 12:13:52 +0000177(The format of the address returned depends on the address family --
Guido van Rossum781db5d1994-08-05 13:37:36 +0000178see above.) On some systems this function is not supported.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000179\end{funcdesc}
180
181\begin{funcdesc}{getsockname}{}
182Return the socket's own address. This is useful to find out the port
183number of an IP socket, for instance.
Guido van Rossuma84ec511994-06-23 12:13:52 +0000184(The format of the address returned depends on the address family --
185see above.)
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000186\end{funcdesc}
187
188\begin{funcdesc}{getsockopt}{level\, optname\, buflen}
189Return the value of the given socket option (see the \UNIX{} man page
190{\it getsockopt}(2)). The needed symbolic constants are defined in module
191SOCKET. If the optional third argument is absent, an integer option
192is assumed and its integer value is returned by the function. If
193\var{buflen} is present, it specifies the maximum length of the buffer used
194to receive the option in, and this buffer is returned as a string.
195It's up to the caller to decode the contents of the buffer (see the
196optional built-in module \code{struct} for a way to decode C structures
197encoded as strings).
198\end{funcdesc}
199
200\begin{funcdesc}{listen}{backlog}
201Listen for connections made to the socket.
Guido van Rossum5da57551994-03-02 10:52:16 +0000202The argument specifies the maximum number of queued connections and
203should be at least 1; the maximum value is system-dependent.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000204\end{funcdesc}
205
206\begin{funcdesc}{makefile}{mode}
207Return a \dfn{file object} associated with the socket.
208(File objects were described earlier under Built-in Types.)
209The file object references a \code{dup}ped version of the socket file
210descriptor, so the file object and socket object may be closed or
211garbage-collected independently.
212\end{funcdesc}
213
214\begin{funcdesc}{recv}{bufsize\, flags}
215Receive data from the socket. The return value is a string representing
216the data received. The maximum amount of data to be received
217at once is specified by \var{bufsize}. See the \UNIX{} manual page
218for the meaning of the optional argument \var{flags}; it defaults to
219zero.
220\end{funcdesc}
221
222\begin{funcdesc}{recvfrom}{bufsize}
223Receive data from the socket. The return value is a pair
224\code{(\var{string}, \var{address})} where \var{string} is a string
225representing the data received and \var{address} is the address of the
226socket sending the data.
Guido van Rossuma84ec511994-06-23 12:13:52 +0000227(The format of \var{address} depends on the address family -- see above.)
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000228\end{funcdesc}
229
230\begin{funcdesc}{send}{string}
231Send data to the socket. The socket must be connected to a remote
Guido van Rossumab3a2501994-08-01 12:18:36 +0000232socket. Return the number of bytes sent.
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000233\end{funcdesc}
234
235\begin{funcdesc}{sendto}{string\, address}
236Send data to the socket. The socket should not be connected to a
237remote socket, since the destination socket is specified by
Guido van Rossumab3a2501994-08-01 12:18:36 +0000238\code{address}. Return the number of bytes sent.
Guido van Rossuma84ec511994-06-23 12:13:52 +0000239(The format of \var{address} depends on the address family -- see above.)
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000240\end{funcdesc}
241
Guido van Rossum91951481994-09-07 14:39:14 +0000242\begin{funcdesc}{setblocking}{flag}
243Set blocking or non-blocking mode of the socket: if \var{flag} is 0,
244the socket is set to non-blocking, else to blocking mode. Initially
245all sockets are in blocking mode. In non-blocking mode, if a
246\code{recv} call doesn't find any data, or if a \code{send} call can't
247immediately dispose of the data, a \code{socket.error} exception is
248raised; in blocking mode, the calls block until they can proceed.
249\end{funcdesc}
250
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000251\begin{funcdesc}{setsockopt}{level\, optname\, value}
252Set the value of the given socket option (see the \UNIX{} man page
253{\it setsockopt}(2)). The needed symbolic constants are defined in module
254\code{SOCKET}. The value can be an integer or a string representing a
255buffer. In the latter case it is up to the caller to ensure that the
256string contains the proper bits (see the optional built-in module
257\code{struct} for a way to encode C structures as strings).
258\end{funcdesc}
259
260\begin{funcdesc}{shutdown}{how}
261Shut down one or both halves of the connection. If \var{how} is \code{0},
262further receives are disallowed. If \var{how} is \code{1}, further sends are
263disallowed. If \var{how} is \code{2}, further sends and receives are
264disallowed.
265\end{funcdesc}
266
267Note that there are no methods \code{read()} or \code{write()}; use
268\code{recv()} and \code{send()} without \var{flags} argument instead.
269
270\subsection{Example}
271\nodename{Socket Example}
272
273Here are two minimal example programs using the TCP/IP protocol: a
274server that echoes all data that it receives back (servicing only one
275client), and a client using it. Note that a server must perform the
276sequence \code{socket}, \code{bind}, \code{listen}, \code{accept}
277(possibly repeating the \code{accept} to service more than one client),
278while a client only needs the sequence \code{socket}, \code{connect}.
279Also note that the server does not \code{send}/\code{receive} on the
280socket it is listening on but on the new socket returned by
281\code{accept}.
282
283\bcode\begin{verbatim}
284# Echo server program
285from socket import *
286HOST = '' # Symbolic name meaning the local host
287PORT = 50007 # Arbitrary non-privileged server
288s = socket(AF_INET, SOCK_STREAM)
289s.bind(HOST, PORT)
Guido van Rossum5da57551994-03-02 10:52:16 +0000290s.listen(1)
Guido van Rossum5fdeeea1994-01-02 01:22:07 +0000291conn, addr = s.accept()
292print 'Connected by', addr
293while 1:
294 data = conn.recv(1024)
295 if not data: break
296 conn.send(data)
297conn.close()
298\end{verbatim}\ecode
299
300\bcode\begin{verbatim}
301# Echo client program
302from socket import *
303HOST = 'daring.cwi.nl' # The remote host
304PORT = 50007 # The same port as used by the server
305s = socket(AF_INET, SOCK_STREAM)
306s.connect(HOST, PORT)
307s.send('Hello, world')
308data = s.recv(1024)
309s.close()
310print 'Received', `data`
311\end{verbatim}\ecode