blob: 13243593260c30e692a0dc8cbf3a89853ab1c8da [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001.. _tut-io:
2
3****************
4Input and Output
5****************
6
7There are several ways to present the output of a program; data can be printed
8in a human-readable form, or written to a file for future use. This chapter will
9discuss some of the possibilities.
10
11
12.. _tut-formatting:
13
14Fancier Output Formatting
15=========================
16
17So far we've encountered two ways of writing values: *expression statements* and
Guido van Rossum0616b792007-08-31 03:25:11 +000018the :func:`print` function. (A third way is using the :meth:`write` method
Georg Brandl116aa622007-08-15 14:28:22 +000019of file objects; the standard output file can be referenced as ``sys.stdout``.
20See the Library Reference for more information on this.)
21
Georg Brandl116aa622007-08-15 14:28:22 +000022Often you'll want more control over the formatting of your output than simply
23printing space-separated values. There are two ways to format your output; the
24first way is to do all the string handling yourself; using string slicing and
25concatenation operations you can create any layout you can imagine. The
Georg Brandl3640e182011-03-06 10:56:18 +010026string type has some methods that perform useful operations for padding
Georg Brandl116aa622007-08-15 14:28:22 +000027strings to a given column width; these will be discussed shortly. The second
Benjamin Petersone6f00632008-05-26 01:03:56 +000028way is to use the :meth:`str.format` method.
29
Georg Brandl3640e182011-03-06 10:56:18 +010030The :mod:`string` module contains a :class:`~string.Template` class which offers
31yet another way to substitute values into strings.
Georg Brandl116aa622007-08-15 14:28:22 +000032
33One question remains, of course: how do you convert values to strings? Luckily,
34Python has ways to convert any value to a string: pass it to the :func:`repr`
Georg Brandl1e3830a2008-08-08 06:45:01 +000035or :func:`str` functions.
Georg Brandl116aa622007-08-15 14:28:22 +000036
37The :func:`str` function is meant to return representations of values which are
38fairly human-readable, while :func:`repr` is meant to generate representations
39which can be read by the interpreter (or will force a :exc:`SyntaxError` if
Sandro Tosia17ef142012-08-14 19:51:43 +020040there is no equivalent syntax). For objects which don't have a particular
Georg Brandl116aa622007-08-15 14:28:22 +000041representation for human consumption, :func:`str` will return the same value as
42:func:`repr`. Many values, such as numbers or structures like lists and
Ezio Melotti0def5c62011-03-13 02:27:26 +020043dictionaries, have the same representation using either function. Strings, in
44particular, have two distinct representations.
Georg Brandl116aa622007-08-15 14:28:22 +000045
46Some examples::
47
48 >>> s = 'Hello, world.'
49 >>> str(s)
50 'Hello, world.'
51 >>> repr(s)
52 "'Hello, world.'"
Ezio Melotti0def5c62011-03-13 02:27:26 +020053 >>> str(1/7)
Mark Dickinson5a55b612009-06-28 20:59:42 +000054 '0.14285714285714285'
Georg Brandl116aa622007-08-15 14:28:22 +000055 >>> x = 10 * 3.25
56 >>> y = 200 * 200
57 >>> s = 'The value of x is ' + repr(x) + ', and y is ' + repr(y) + '...'
Guido van Rossum0616b792007-08-31 03:25:11 +000058 >>> print(s)
Georg Brandl116aa622007-08-15 14:28:22 +000059 The value of x is 32.5, and y is 40000...
60 >>> # The repr() of a string adds string quotes and backslashes:
61 ... hello = 'hello, world\n'
62 >>> hellos = repr(hello)
Guido van Rossum0616b792007-08-31 03:25:11 +000063 >>> print(hellos)
Georg Brandl116aa622007-08-15 14:28:22 +000064 'hello, world\n'
65 >>> # The argument to repr() may be any Python object:
66 ... repr((x, y, ('spam', 'eggs')))
67 "(32.5, 40000, ('spam', 'eggs'))"
Georg Brandl116aa622007-08-15 14:28:22 +000068
69Here are two ways to write a table of squares and cubes::
70
71 >>> for x in range(1, 11):
Georg Brandle4ac7502007-09-03 07:10:24 +000072 ... print(repr(x).rjust(2), repr(x*x).rjust(3), end=' ')
Guido van Rossum0616b792007-08-31 03:25:11 +000073 ... # Note use of 'end' on previous line
74 ... print(repr(x*x*x).rjust(4))
Georg Brandl116aa622007-08-15 14:28:22 +000075 ...
76 1 1 1
77 2 4 8
78 3 9 27
79 4 16 64
80 5 25 125
81 6 36 216
82 7 49 343
83 8 64 512
84 9 81 729
85 10 100 1000
86
Georg Brandle4ac7502007-09-03 07:10:24 +000087 >>> for x in range(1, 11):
Benjamin Petersone6f00632008-05-26 01:03:56 +000088 ... print('{0:2d} {1:3d} {2:4d}'.format(x, x*x, x*x*x))
Georg Brandl48310cd2009-01-03 21:18:54 +000089 ...
Georg Brandl116aa622007-08-15 14:28:22 +000090 1 1 1
91 2 4 8
92 3 9 27
93 4 16 64
94 5 25 125
95 6 36 216
96 7 49 343
97 8 64 512
98 9 81 729
99 10 100 1000
100
101(Note that in the first example, one space between each column was added by the
Guido van Rossum0616b792007-08-31 03:25:11 +0000102way :func:`print` works: it always adds spaces between its arguments.)
Georg Brandl116aa622007-08-15 14:28:22 +0000103
Ezio Melotti2b736602011-03-13 02:19:57 +0200104This example demonstrates the :meth:`str.rjust` method of string
105objects, which right-justifies a string in a field of a given width by padding
106it with spaces on the left. There are similar methods :meth:`str.ljust` and
107:meth:`str.center`. These methods do not write anything, they just return a
108new string. If the input string is too long, they don't truncate it, but
109return it unchanged; this will mess up your column lay-out but that's usually
110better than the alternative, which would be lying about a value. (If you
111really want truncation you can always add a slice operation, as in
112``x.ljust(n)[:n]``.)
Georg Brandl116aa622007-08-15 14:28:22 +0000113
Ezio Melotti2b736602011-03-13 02:19:57 +0200114There is another method, :meth:`str.zfill`, which pads a numeric string on the
115left with zeros. It understands about plus and minus signs::
Georg Brandl116aa622007-08-15 14:28:22 +0000116
117 >>> '12'.zfill(5)
118 '00012'
119 >>> '-3.14'.zfill(7)
120 '-003.14'
121 >>> '3.14159265359'.zfill(5)
122 '3.14159265359'
123
Benjamin Petersone6f00632008-05-26 01:03:56 +0000124Basic usage of the :meth:`str.format` method looks like this::
125
Georg Brandl2f3ed682009-09-01 07:42:40 +0000126 >>> print('We are the {} who say "{}!"'.format('knights', 'Ni'))
Benjamin Petersone6f00632008-05-26 01:03:56 +0000127 We are the knights who say "Ni!"
128
129The brackets and characters within them (called format fields) are replaced with
Ezio Melotti2b736602011-03-13 02:19:57 +0200130the objects passed into the :meth:`str.format` method. A number in the
Georg Brandl2f3ed682009-09-01 07:42:40 +0000131brackets can be used to refer to the position of the object passed into the
Ezio Melotti2b736602011-03-13 02:19:57 +0200132:meth:`str.format` method. ::
Benjamin Petersone6f00632008-05-26 01:03:56 +0000133
Benjamin Peterson0cea1572008-07-26 21:59:03 +0000134 >>> print('{0} and {1}'.format('spam', 'eggs'))
Benjamin Petersone6f00632008-05-26 01:03:56 +0000135 spam and eggs
Benjamin Peterson0cea1572008-07-26 21:59:03 +0000136 >>> print('{1} and {0}'.format('spam', 'eggs'))
Benjamin Petersone6f00632008-05-26 01:03:56 +0000137 eggs and spam
138
Ezio Melotti2b736602011-03-13 02:19:57 +0200139If keyword arguments are used in the :meth:`str.format` method, their values
Alexandre Vassalotti6d3dfc32009-07-29 19:54:39 +0000140are referred to by using the name of the argument. ::
Benjamin Petersone6f00632008-05-26 01:03:56 +0000141
Benjamin Peterson71141932008-07-26 22:27:04 +0000142 >>> print('This {food} is {adjective}.'.format(
143 ... food='spam', adjective='absolutely horrible'))
Benjamin Petersone6f00632008-05-26 01:03:56 +0000144 This spam is absolutely horrible.
145
146Positional and keyword arguments can be arbitrarily combined::
147
Benjamin Peterson71141932008-07-26 22:27:04 +0000148 >>> print('The story of {0}, {1}, and {other}.'.format('Bill', 'Manfred',
149 other='Georg'))
Benjamin Petersone6f00632008-05-26 01:03:56 +0000150 The story of Bill, Manfred, and Georg.
151
Georg Brandl2f3ed682009-09-01 07:42:40 +0000152``'!a'`` (apply :func:`ascii`), ``'!s'`` (apply :func:`str`) and ``'!r'``
153(apply :func:`repr`) can be used to convert the value before it is formatted::
154
155 >>> import math
156 >>> print('The value of PI is approximately {}.'.format(math.pi))
157 The value of PI is approximately 3.14159265359.
158 >>> print('The value of PI is approximately {!r}.'.format(math.pi))
159 The value of PI is approximately 3.141592653589793.
160
Alexandre Vassalottie223eb82009-07-29 20:12:15 +0000161An optional ``':'`` and format specifier can follow the field name. This allows
Benjamin Petersone6f00632008-05-26 01:03:56 +0000162greater control over how the value is formatted. The following example
Raymond Hettinger756fe262011-02-24 00:06:16 +0000163rounds Pi to three places after the decimal.
Georg Brandl116aa622007-08-15 14:28:22 +0000164
165 >>> import math
Benjamin Petersone6f00632008-05-26 01:03:56 +0000166 >>> print('The value of PI is approximately {0:.3f}.'.format(math.pi))
Georg Brandl116aa622007-08-15 14:28:22 +0000167 The value of PI is approximately 3.142.
168
Benjamin Petersone6f00632008-05-26 01:03:56 +0000169Passing an integer after the ``':'`` will cause that field to be a minimum
Alexandre Vassalotti6d3dfc32009-07-29 19:54:39 +0000170number of characters wide. This is useful for making tables pretty. ::
Georg Brandl116aa622007-08-15 14:28:22 +0000171
172 >>> table = {'Sjoerd': 4127, 'Jack': 4098, 'Dcab': 7678}
173 >>> for name, phone in table.items():
Benjamin Petersone6f00632008-05-26 01:03:56 +0000174 ... print('{0:10} ==> {1:10d}'.format(name, phone))
Georg Brandl48310cd2009-01-03 21:18:54 +0000175 ...
Georg Brandl116aa622007-08-15 14:28:22 +0000176 Jack ==> 4098
177 Dcab ==> 7678
178 Sjoerd ==> 4127
179
Georg Brandl116aa622007-08-15 14:28:22 +0000180If you have a really long format string that you don't want to split up, it
181would be nice if you could reference the variables to be formatted by name
Benjamin Petersone6f00632008-05-26 01:03:56 +0000182instead of by position. This can be done by simply passing the dict and using
183square brackets ``'[]'`` to access the keys ::
Georg Brandl116aa622007-08-15 14:28:22 +0000184
185 >>> table = {'Sjoerd': 4127, 'Jack': 4098, 'Dcab': 8637678}
Benjamin Peterson71141932008-07-26 22:27:04 +0000186 >>> print('Jack: {0[Jack]:d}; Sjoerd: {0[Sjoerd]:d}; '
Andrew Svetlove9cf97c2012-10-17 16:41:28 +0300187 ... 'Dcab: {0[Dcab]:d}'.format(table))
Benjamin Petersone6f00632008-05-26 01:03:56 +0000188 Jack: 4098; Sjoerd: 4127; Dcab: 8637678
189
190This could also be done by passing the table as keyword arguments with the '**'
Alexandre Vassalotti6d3dfc32009-07-29 19:54:39 +0000191notation. ::
Benjamin Petersone6f00632008-05-26 01:03:56 +0000192
193 >>> table = {'Sjoerd': 4127, 'Jack': 4098, 'Dcab': 8637678}
194 >>> print('Jack: {Jack:d}; Sjoerd: {Sjoerd:d}; Dcab: {Dcab:d}'.format(**table))
Georg Brandl116aa622007-08-15 14:28:22 +0000195 Jack: 4098; Sjoerd: 4127; Dcab: 8637678
196
Ezio Melotti2b736602011-03-13 02:19:57 +0200197This is particularly useful in combination with the built-in function
198:func:`vars`, which returns a dictionary containing all local variables.
Georg Brandl116aa622007-08-15 14:28:22 +0000199
Mark Dickinson934896d2009-02-21 20:59:32 +0000200For a complete overview of string formatting with :meth:`str.format`, see
Benjamin Petersone6f00632008-05-26 01:03:56 +0000201:ref:`formatstrings`.
202
203
204Old string formatting
205---------------------
206
207The ``%`` operator can also be used for string formatting. It interprets the
Georg Brandl60203b42010-10-06 10:11:56 +0000208left argument much like a :c:func:`sprintf`\ -style format string to be applied
Benjamin Petersone6f00632008-05-26 01:03:56 +0000209to the right argument, and returns the string resulting from this formatting
210operation. For example::
211
212 >>> import math
Georg Brandl11e18b02008-08-05 09:04:16 +0000213 >>> print('The value of PI is approximately %5.3f.' % math.pi)
Benjamin Petersone6f00632008-05-26 01:03:56 +0000214 The value of PI is approximately 3.142.
215
216Since :meth:`str.format` is quite new, a lot of Python code still uses the ``%``
Alexandre Vassalottie223eb82009-07-29 20:12:15 +0000217operator. However, because this old style of formatting will eventually be
218removed from the language, :meth:`str.format` should generally be used.
Benjamin Petersone6f00632008-05-26 01:03:56 +0000219
220More information can be found in the :ref:`old-string-formatting` section.
221
Georg Brandl116aa622007-08-15 14:28:22 +0000222
223.. _tut-files:
224
225Reading and Writing Files
226=========================
227
228.. index::
229 builtin: open
230 object: file
231
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000232:func:`open` returns a :term:`file object`, and is most commonly used with
233two arguments: ``open(filename, mode)``.
Georg Brandl116aa622007-08-15 14:28:22 +0000234
Georg Brandl116aa622007-08-15 14:28:22 +0000235::
236
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000237 >>> f = open('/tmp/workfile', 'w')
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000238
239.. XXX str(f) is <io.TextIOWrapper object at 0x82e8dc4>
240
Guido van Rossum0616b792007-08-31 03:25:11 +0000241 >>> print(f)
Georg Brandl116aa622007-08-15 14:28:22 +0000242 <open file '/tmp/workfile', mode 'w' at 80a0960>
243
244The first argument is a string containing the filename. The second argument is
245another string containing a few characters describing the way in which the file
246will be used. *mode* can be ``'r'`` when the file will only be read, ``'w'``
247for only writing (an existing file with the same name will be erased), and
248``'a'`` opens the file for appending; any data written to the file is
249automatically added to the end. ``'r+'`` opens the file for both reading and
250writing. The *mode* argument is optional; ``'r'`` will be assumed if it's
251omitted.
252
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000253Normally, files are opened in :dfn:`text mode`, that means, you read and write
254strings from and to the file, which are encoded in a specific encoding (the
255default being UTF-8). ``'b'`` appended to the mode opens the file in
256:dfn:`binary mode`: now the data is read and written in the form of bytes
257objects. This mode should be used for all files that don't contain text.
Skip Montanaro4e02c502007-09-26 01:10:12 +0000258
Chris Jerdonek5bf7f1f2012-10-17 20:17:41 -0700259In text mode, the default when reading is to convert platform-specific line
260endings (``\n`` on Unix, ``\r\n`` on Windows) to just ``\n``. When writing in
261text mode, the default is to convert occurrences of ``\n`` back to
262platform-specific line endings. This behind-the-scenes modification
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000263to file data is fine for text files, but will corrupt binary data like that in
264:file:`JPEG` or :file:`EXE` files. Be very careful to use binary mode when
265reading and writing such files.
Georg Brandl116aa622007-08-15 14:28:22 +0000266
267
268.. _tut-filemethods:
269
270Methods of File Objects
271-----------------------
272
273The rest of the examples in this section will assume that a file object called
274``f`` has already been created.
275
276To read a file's contents, call ``f.read(size)``, which reads some quantity of
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000277data and returns it as a string or bytes object. *size* is an optional numeric
278argument. When *size* is omitted or negative, the entire contents of the file
279will be read and returned; it's your problem if the file is twice as large as
280your machine's memory. Otherwise, at most *size* bytes are read and returned.
281If the end of the file has been reached, ``f.read()`` will return an empty
282string (``''``). ::
Georg Brandl116aa622007-08-15 14:28:22 +0000283
284 >>> f.read()
285 'This is the entire file.\n'
286 >>> f.read()
287 ''
288
289``f.readline()`` reads a single line from the file; a newline character (``\n``)
290is left at the end of the string, and is only omitted on the last line of the
291file if the file doesn't end in a newline. This makes the return value
292unambiguous; if ``f.readline()`` returns an empty string, the end of the file
293has been reached, while a blank line is represented by ``'\n'``, a string
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000294containing only a single newline. ::
Georg Brandl116aa622007-08-15 14:28:22 +0000295
296 >>> f.readline()
297 'This is the first line of the file.\n'
298 >>> f.readline()
299 'Second line of the file\n'
300 >>> f.readline()
301 ''
302
303``f.readlines()`` returns a list containing all the lines of data in the file.
304If given an optional parameter *sizehint*, it reads that many bytes from the
305file and enough more to complete a line, and returns the lines from that. This
306is often used to allow efficient reading of a large file by lines, but without
307having to load the entire file in memory. Only complete lines will be returned.
308::
309
310 >>> f.readlines()
311 ['This is the first line of the file.\n', 'Second line of the file\n']
312
Thomas Wouters8ce81f72007-09-20 18:22:40 +0000313An alternative approach to reading lines is to loop over the file object. This is
Georg Brandl116aa622007-08-15 14:28:22 +0000314memory efficient, fast, and leads to simpler code::
315
316 >>> for line in f:
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000317 ... print(line, end='')
318 ...
Georg Brandl116aa622007-08-15 14:28:22 +0000319 This is the first line of the file.
320 Second line of the file
321
322The alternative approach is simpler but does not provide as fine-grained
323control. Since the two approaches manage line buffering differently, they
324should not be mixed.
325
326``f.write(string)`` writes the contents of *string* to the file, returning
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000327the number of characters written. ::
Georg Brandl116aa622007-08-15 14:28:22 +0000328
329 >>> f.write('This is a test\n')
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000330 15
Georg Brandl116aa622007-08-15 14:28:22 +0000331
332To write something other than a string, it needs to be converted to a string
333first::
334
335 >>> value = ('the answer', 42)
336 >>> s = str(value)
337 >>> f.write(s)
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000338 18
Georg Brandl116aa622007-08-15 14:28:22 +0000339
340``f.tell()`` returns an integer giving the file object's current position in the
341file, measured in bytes from the beginning of the file. To change the file
342object's position, use ``f.seek(offset, from_what)``. The position is computed
343from adding *offset* to a reference point; the reference point is selected by
344the *from_what* argument. A *from_what* value of 0 measures from the beginning
345of the file, 1 uses the current file position, and 2 uses the end of the file as
346the reference point. *from_what* can be omitted and defaults to 0, using the
347beginning of the file as the reference point. ::
348
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000349 >>> f = open('/tmp/workfile', 'rb+')
350 >>> f.write(b'0123456789abcdef')
351 16
Georg Brandl116aa622007-08-15 14:28:22 +0000352 >>> f.seek(5) # Go to the 6th byte in the file
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000353 5
Georg Brandl48310cd2009-01-03 21:18:54 +0000354 >>> f.read(1)
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000355 b'5'
Georg Brandl116aa622007-08-15 14:28:22 +0000356 >>> f.seek(-3, 2) # Go to the 3rd byte before the end
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000357 13
Georg Brandl116aa622007-08-15 14:28:22 +0000358 >>> f.read(1)
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000359 b'd'
Georg Brandl116aa622007-08-15 14:28:22 +0000360
Georg Brandl0dcb7ac2008-08-08 07:04:38 +0000361In text files (those opened without a ``b`` in the mode string), only seeks
362relative to the beginning of the file are allowed (the exception being seeking
363to the very file end with ``seek(0, 2)``).
Georg Brandl48310cd2009-01-03 21:18:54 +0000364
Georg Brandl116aa622007-08-15 14:28:22 +0000365When you're done with a file, call ``f.close()`` to close it and free up any
366system resources taken up by the open file. After calling ``f.close()``,
367attempts to use the file object will automatically fail. ::
368
369 >>> f.close()
370 >>> f.read()
371 Traceback (most recent call last):
372 File "<stdin>", line 1, in ?
373 ValueError: I/O operation on closed file
374
Georg Brandl3dbca812008-07-23 16:10:53 +0000375It is good practice to use the :keyword:`with` keyword when dealing with file
376objects. This has the advantage that the file is properly closed after its
377suite finishes, even if an exception is raised on the way. It is also much
378shorter than writing equivalent :keyword:`try`\ -\ :keyword:`finally` blocks::
379
380 >>> with open('/tmp/workfile', 'r') as f:
381 ... read_data = f.read()
382 >>> f.closed
383 True
384
Alexandre Vassalotti6d3dfc32009-07-29 19:54:39 +0000385File objects have some additional methods, such as :meth:`~file.isatty` and
386:meth:`~file.truncate` which are less frequently used; consult the Library
387Reference for a complete guide to file objects.
Georg Brandl116aa622007-08-15 14:28:22 +0000388
389
390.. _tut-pickle:
391
392The :mod:`pickle` Module
393------------------------
394
395.. index:: module: pickle
396
397Strings can easily be written to and read from a file. Numbers take a bit more
398effort, since the :meth:`read` method only returns strings, which will have to
399be passed to a function like :func:`int`, which takes a string like ``'123'``
400and returns its numeric value 123. However, when you want to save more complex
401data types like lists, dictionaries, or class instances, things get a lot more
402complicated.
403
404Rather than have users be constantly writing and debugging code to save
405complicated data types, Python provides a standard module called :mod:`pickle`.
406This is an amazing module that can take almost any Python object (even some
407forms of Python code!), and convert it to a string representation; this process
408is called :dfn:`pickling`. Reconstructing the object from the string
409representation is called :dfn:`unpickling`. Between pickling and unpickling,
410the string representing the object may have been stored in a file or data, or
411sent over a network connection to some distant machine.
412
413If you have an object ``x``, and a file object ``f`` that's been opened for
414writing, the simplest way to pickle the object takes only one line of code::
415
416 pickle.dump(x, f)
417
418To unpickle the object again, if ``f`` is a file object which has been opened
419for reading::
420
421 x = pickle.load(f)
422
423(There are other variants of this, used when pickling many objects or when you
424don't want to write the pickled data to a file; consult the complete
425documentation for :mod:`pickle` in the Python Library Reference.)
426
427:mod:`pickle` is the standard way to make Python objects which can be stored and
428reused by other programs or by a future invocation of the same program; the
429technical term for this is a :dfn:`persistent` object. Because :mod:`pickle` is
430so widely used, many authors who write Python extensions take care to ensure
431that new data types such as matrices can be properly pickled and unpickled.
432
433