blob: d30c5fde7095a02cc075eebc5b91a522c2f1e986 [file] [log] [blame]
Georg Brandl014197c2008-04-09 18:40:51 +00001:mod:`io` --- Core tools for working with streams
2=================================================
3
4.. module:: io
5 :synopsis: Core tools for working with streams.
6.. moduleauthor:: Guido van Rossum <guido@python.org>
7.. moduleauthor:: Mike Verdone <mike.verdone@gmail.com>
8.. moduleauthor:: Mark Russell <mark.russell@zen.co.uk>
Benjamin Peterson4fa88fa2009-03-04 00:14:51 +00009.. moduleauthor:: Antoine Pitrou <solipsis@pitrou.net>
10.. moduleauthor:: Amaury Forgeot d'Arc <amauryfa@gmail.com>
Benjamin Petersonef9f2bd2009-05-01 20:45:43 +000011.. moduleauthor:: Benjamin Peterson <benjamin@python.org>
Benjamin Peterson058e31e2009-01-16 03:54:08 +000012.. sectionauthor:: Benjamin Peterson <benjamin@python.org>
Georg Brandl014197c2008-04-09 18:40:51 +000013
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000014.. _io-overview:
15
Antoine Pitroub530e142010-08-30 12:41:00 +000016Overview
17--------
Georg Brandl014197c2008-04-09 18:40:51 +000018
R David Murray9f0c9402012-08-17 20:33:54 -040019.. index::
20 single: file object; io module
21
22The :mod:`io` module provides Python's main facilities for dealing with various
23types of I/O. There are three main types of I/O: *text I/O*, *binary I/O*
24and *raw I/O*. These are generic categories, and various backing stores can
25be used for each of them. A concrete object belonging to any of these
26categories is called a :term:`file object`. Other common terms are *stream*
27and *file-like object*.
Georg Brandl014197c2008-04-09 18:40:51 +000028
Antoine Pitroub530e142010-08-30 12:41:00 +000029Independently of its category, each concrete stream object will also have
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000030various capabilities: it can be read-only, write-only, or read-write. It can
31also allow arbitrary random access (seeking forwards or backwards to any
32location), or only sequential access (for example in the case of a socket or
33pipe).
Georg Brandl014197c2008-04-09 18:40:51 +000034
Antoine Pitroub530e142010-08-30 12:41:00 +000035All streams are careful about the type of data you give to them. For example
36giving a :class:`str` object to the ``write()`` method of a binary stream
37will raise a ``TypeError``. So will giving a :class:`bytes` object to the
38``write()`` method of a text stream.
Georg Brandl014197c2008-04-09 18:40:51 +000039
Antoine Pitroua787b652011-10-12 19:02:52 +020040.. versionchanged:: 3.3
Eli Benderskyf877a7c2012-07-14 21:22:25 +030041 Operations that used to raise :exc:`IOError` now raise :exc:`OSError`, since
42 :exc:`IOError` is now an alias of :exc:`OSError`.
Antoine Pitroua787b652011-10-12 19:02:52 +020043
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000044
Antoine Pitroub530e142010-08-30 12:41:00 +000045Text I/O
46^^^^^^^^
Georg Brandl014197c2008-04-09 18:40:51 +000047
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000048Text I/O expects and produces :class:`str` objects. This means that whenever
49the backing store is natively made of bytes (such as in the case of a file),
50encoding and decoding of data is made transparently as well as optional
51translation of platform-specific newline characters.
Benjamin Peterson2c5f8282008-04-13 00:27:46 +000052
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000053The easiest way to create a text stream is with :meth:`open()`, optionally
54specifying an encoding::
Antoine Pitroub530e142010-08-30 12:41:00 +000055
56 f = open("myfile.txt", "r", encoding="utf-8")
57
58In-memory text streams are also available as :class:`StringIO` objects::
59
60 f = io.StringIO("some initial text data")
61
Eli Benderskyf877a7c2012-07-14 21:22:25 +030062The text stream API is described in detail in the documentation of
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000063:class:`TextIOBase`.
Antoine Pitroub530e142010-08-30 12:41:00 +000064
Antoine Pitroub530e142010-08-30 12:41:00 +000065
66Binary I/O
67^^^^^^^^^^
68
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000069Binary I/O (also called *buffered I/O*) expects and produces :class:`bytes`
70objects. No encoding, decoding, or newline translation is performed. This
71category of streams can be used for all kinds of non-text data, and also when
72manual control over the handling of text data is desired.
Antoine Pitroub530e142010-08-30 12:41:00 +000073
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000074The easiest way to create a binary stream is with :meth:`open()` with ``'b'`` in
75the mode string::
Antoine Pitroub530e142010-08-30 12:41:00 +000076
77 f = open("myfile.jpg", "rb")
78
79In-memory binary streams are also available as :class:`BytesIO` objects::
80
81 f = io.BytesIO(b"some initial binary data: \x00\x01")
82
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000083The binary stream API is described in detail in the docs of
84:class:`BufferedIOBase`.
Antoine Pitroub530e142010-08-30 12:41:00 +000085
86Other library modules may provide additional ways to create text or binary
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000087streams. See :meth:`socket.socket.makefile` for example.
88
Antoine Pitroub530e142010-08-30 12:41:00 +000089
90Raw I/O
91^^^^^^^
92
93Raw I/O (also called *unbuffered I/O*) is generally used as a low-level
94building-block for binary and text streams; it is rarely useful to directly
Benjamin Peterson6b4fa772010-08-30 13:19:53 +000095manipulate a raw stream from user code. Nevertheless, you can create a raw
96stream by opening a file in binary mode with buffering disabled::
Antoine Pitroub530e142010-08-30 12:41:00 +000097
98 f = open("myfile.jpg", "rb", buffering=0)
99
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000100The raw stream API is described in detail in the docs of :class:`RawIOBase`.
Benjamin Petersoncc12e1b2010-02-19 00:58:13 +0000101
Georg Brandl014197c2008-04-09 18:40:51 +0000102
Antoine Pitroub530e142010-08-30 12:41:00 +0000103High-level Module Interface
104---------------------------
Georg Brandl014197c2008-04-09 18:40:51 +0000105
106.. data:: DEFAULT_BUFFER_SIZE
107
108 An int containing the default buffer size used by the module's buffered I/O
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000109 classes. :func:`open` uses the file's blksize (as obtained by
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000110 :func:`os.stat`) if possible.
Georg Brandl014197c2008-04-09 18:40:51 +0000111
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000112
Andrew Svetlova60de4f2013-02-17 16:55:58 +0200113.. function:: open(file, mode='r', buffering=-1, encoding=None, errors=None, newline=None, closefd=True, opener=None)
Georg Brandl014197c2008-04-09 18:40:51 +0000114
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000115 This is an alias for the builtin :func:`open` function.
Georg Brandl014197c2008-04-09 18:40:51 +0000116
Georg Brandl014197c2008-04-09 18:40:51 +0000117
118.. exception:: BlockingIOError
119
Antoine Pitrouf55011f2011-10-12 18:57:23 +0200120 This is a compatibility alias for the builtin :exc:`BlockingIOError`
121 exception.
Georg Brandl014197c2008-04-09 18:40:51 +0000122
123
124.. exception:: UnsupportedOperation
125
Antoine Pitroua787b652011-10-12 19:02:52 +0200126 An exception inheriting :exc:`OSError` and :exc:`ValueError` that is raised
Georg Brandl014197c2008-04-09 18:40:51 +0000127 when an unsupported operation is called on a stream.
128
129
Antoine Pitroub530e142010-08-30 12:41:00 +0000130In-memory streams
131^^^^^^^^^^^^^^^^^
132
133It is also possible to use a :class:`str` or :class:`bytes`-like object as a
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000134file for both reading and writing. For strings :class:`StringIO` can be used
135like a file opened in text mode. :class:`BytesIO` can be used like a file
136opened in binary mode. Both provide full read-write capabilities with random
137access.
Antoine Pitroub530e142010-08-30 12:41:00 +0000138
139
140.. seealso::
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000141
Antoine Pitroub530e142010-08-30 12:41:00 +0000142 :mod:`sys`
143 contains the standard IO streams: :data:`sys.stdin`, :data:`sys.stdout`,
144 and :data:`sys.stderr`.
145
146
147Class hierarchy
148---------------
149
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000150The implementation of I/O streams is organized as a hierarchy of classes. First
151:term:`abstract base classes <abstract base class>` (ABCs), which are used to
152specify the various categories of streams, then concrete classes providing the
153standard stream implementations.
Antoine Pitroub530e142010-08-30 12:41:00 +0000154
155 .. note::
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000156
157 The abstract base classes also provide default implementations of some
158 methods in order to help implementation of concrete stream classes. For
159 example, :class:`BufferedIOBase` provides unoptimized implementations of
160 ``readinto()`` and ``readline()``.
Antoine Pitroub530e142010-08-30 12:41:00 +0000161
162At the top of the I/O hierarchy is the abstract base class :class:`IOBase`. It
163defines the basic interface to a stream. Note, however, that there is no
164separation between reading and writing to streams; implementations are allowed
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000165to raise :exc:`UnsupportedOperation` if they do not support a given operation.
Antoine Pitroub530e142010-08-30 12:41:00 +0000166
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000167The :class:`RawIOBase` ABC extends :class:`IOBase`. It deals with the reading
168and writing of bytes to a stream. :class:`FileIO` subclasses :class:`RawIOBase`
169to provide an interface to files in the machine's file system.
Antoine Pitroub530e142010-08-30 12:41:00 +0000170
171The :class:`BufferedIOBase` ABC deals with buffering on a raw byte stream
172(:class:`RawIOBase`). Its subclasses, :class:`BufferedWriter`,
173:class:`BufferedReader`, and :class:`BufferedRWPair` buffer streams that are
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000174readable, writable, and both readable and writable. :class:`BufferedRandom`
175provides a buffered interface to random access streams. Another
Georg Brandl682d7e02010-10-06 10:26:05 +0000176:class:`BufferedIOBase` subclass, :class:`BytesIO`, is a stream of in-memory
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000177bytes.
Antoine Pitroub530e142010-08-30 12:41:00 +0000178
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000179The :class:`TextIOBase` ABC, another subclass of :class:`IOBase`, deals with
180streams whose bytes represent text, and handles encoding and decoding to and
181from strings. :class:`TextIOWrapper`, which extends it, is a buffered text
182interface to a buffered raw stream (:class:`BufferedIOBase`). Finally,
183:class:`StringIO` is an in-memory stream for text.
Antoine Pitroub530e142010-08-30 12:41:00 +0000184
185Argument names are not part of the specification, and only the arguments of
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000186:func:`open` are intended to be used as keyword arguments.
Antoine Pitroub530e142010-08-30 12:41:00 +0000187
Andrew Svetloved636a82012-12-06 12:20:56 +0200188The following table summarizes the ABCs provided by the :mod:`io` module:
189
190========================= ================== ======================== ==================================================
191ABC Inherits Stub Methods Mixin Methods and Properties
192========================= ================== ======================== ==================================================
193:class:`IOBase` ``fileno``, ``seek``, ``close``, ``closed``, ``__enter__``,
194 and ``truncate`` ``__exit__``, ``flush``, ``isatty``, ``__iter__``,
195 ``__next__``, ``readable``, ``readline``,
196 ``readlines``, ``seekable``, ``tell``,
197 ``writable``, and ``writelines``
198:class:`RawIOBase` :class:`IOBase` ``readinto`` and Inherited :class:`IOBase` methods, ``read``,
199 ``write`` and ``readall``
200:class:`BufferedIOBase` :class:`IOBase` ``detach``, ``read``, Inherited :class:`IOBase` methods, ``readinto``
201 ``read1``, and ``write``
202:class:`TextIOBase` :class:`IOBase` ``detach``, ``read``, Inherited :class:`IOBase` methods, ``encoding``,
203 ``readline``, and ``errors``, and ``newlines``
204 ``write``
205========================= ================== ======================== ==================================================
206
Antoine Pitroub530e142010-08-30 12:41:00 +0000207
Georg Brandl014197c2008-04-09 18:40:51 +0000208I/O Base Classes
Antoine Pitroub530e142010-08-30 12:41:00 +0000209^^^^^^^^^^^^^^^^
Georg Brandl014197c2008-04-09 18:40:51 +0000210
211.. class:: IOBase
212
213 The abstract base class for all I/O classes, acting on streams of bytes.
214 There is no public constructor.
215
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000216 This class provides empty abstract implementations for many methods
217 that derived classes can override selectively; the default
218 implementations represent a file that cannot be read, written or
219 seeked.
Georg Brandl014197c2008-04-09 18:40:51 +0000220
221 Even though :class:`IOBase` does not declare :meth:`read`, :meth:`readinto`,
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000222 or :meth:`write` because their signatures will vary, implementations and
223 clients should consider those methods part of the interface. Also,
Antoine Pitroua787b652011-10-12 19:02:52 +0200224 implementations may raise a :exc:`ValueError` (or :exc:`UnsupportedOperation`)
225 when operations they do not support are called.
Georg Brandl014197c2008-04-09 18:40:51 +0000226
227 The basic type used for binary data read from or written to a file is
228 :class:`bytes`. :class:`bytearray`\s are accepted too, and in some cases
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000229 (such as :class:`readinto`) required. Text I/O classes work with
230 :class:`str` data.
Georg Brandl014197c2008-04-09 18:40:51 +0000231
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000232 Note that calling any method (even inquiries) on a closed stream is
Antoine Pitroua787b652011-10-12 19:02:52 +0200233 undefined. Implementations may raise :exc:`ValueError` in this case.
Georg Brandl014197c2008-04-09 18:40:51 +0000234
Éric Araujo3f7c0e42012-12-08 22:53:43 -0500235 :class:`IOBase` (and its subclasses) supports the iterator protocol, meaning
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300236 that an :class:`IOBase` object can be iterated over yielding the lines in a
237 stream. Lines are defined slightly differently depending on whether the
238 stream is a binary stream (yielding bytes), or a text stream (yielding
239 character strings). See :meth:`~IOBase.readline` below.
Georg Brandl014197c2008-04-09 18:40:51 +0000240
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300241 :class:`IOBase` is also a context manager and therefore supports the
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000242 :keyword:`with` statement. In this example, *file* is closed after the
243 :keyword:`with` statement's suite is finished---even if an exception occurs::
Georg Brandl014197c2008-04-09 18:40:51 +0000244
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000245 with open('spam.txt', 'w') as file:
246 file.write('Spam and eggs!')
Georg Brandl014197c2008-04-09 18:40:51 +0000247
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000248 :class:`IOBase` provides these data attributes and methods:
Georg Brandl014197c2008-04-09 18:40:51 +0000249
250 .. method:: close()
251
Christian Heimesecc42a22008-11-05 19:30:32 +0000252 Flush and close this stream. This method has no effect if the file is
Georg Brandl48310cd2009-01-03 21:18:54 +0000253 already closed. Once the file is closed, any operation on the file
Georg Brandl8569e582010-05-19 20:57:08 +0000254 (e.g. reading or writing) will raise a :exc:`ValueError`.
Antoine Pitrouf9fc08f2010-04-28 19:59:32 +0000255
256 As a convenience, it is allowed to call this method more than once;
257 only the first call, however, will have an effect.
Georg Brandl014197c2008-04-09 18:40:51 +0000258
259 .. attribute:: closed
260
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300261 ``True`` if the stream is closed.
Georg Brandl014197c2008-04-09 18:40:51 +0000262
263 .. method:: fileno()
264
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000265 Return the underlying file descriptor (an integer) of the stream if it
Antoine Pitroua787b652011-10-12 19:02:52 +0200266 exists. An :exc:`OSError` is raised if the IO object does not use a file
Georg Brandl014197c2008-04-09 18:40:51 +0000267 descriptor.
268
269 .. method:: flush()
270
Benjamin Petersonb85a5842008-04-13 21:39:58 +0000271 Flush the write buffers of the stream if applicable. This does nothing
272 for read-only and non-blocking streams.
Georg Brandl014197c2008-04-09 18:40:51 +0000273
274 .. method:: isatty()
275
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000276 Return ``True`` if the stream is interactive (i.e., connected to
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000277 a terminal/tty device).
Georg Brandl014197c2008-04-09 18:40:51 +0000278
279 .. method:: readable()
280
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000281 Return ``True`` if the stream can be read from. If False, :meth:`read`
Antoine Pitroua787b652011-10-12 19:02:52 +0200282 will raise :exc:`OSError`.
Georg Brandl014197c2008-04-09 18:40:51 +0000283
Georg Brandl3dd33882009-06-01 17:35:27 +0000284 .. method:: readline(limit=-1)
Georg Brandl014197c2008-04-09 18:40:51 +0000285
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000286 Read and return one line from the stream. If *limit* is specified, at
287 most *limit* bytes will be read.
Georg Brandl014197c2008-04-09 18:40:51 +0000288
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000289 The line terminator is always ``b'\n'`` for binary files; for text files,
Benjamin Peterson6b4fa772010-08-30 13:19:53 +0000290 the *newlines* argument to :func:`open` can be used to select the line
Georg Brandl014197c2008-04-09 18:40:51 +0000291 terminator(s) recognized.
292
Georg Brandl3dd33882009-06-01 17:35:27 +0000293 .. method:: readlines(hint=-1)
Georg Brandl014197c2008-04-09 18:40:51 +0000294
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000295 Read and return a list of lines from the stream. *hint* can be specified
296 to control the number of lines read: no more lines will be read if the
297 total size (in bytes/characters) of all lines so far exceeds *hint*.
Georg Brandl014197c2008-04-09 18:40:51 +0000298
Georg Brandl3dd33882009-06-01 17:35:27 +0000299 .. method:: seek(offset, whence=SEEK_SET)
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000300
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000301 Change the stream position to the given byte *offset*. *offset* is
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000302 interpreted relative to the position indicated by *whence*. Values for
303 *whence* are:
304
Benjamin Peterson0e4caf42009-04-01 21:22:20 +0000305 * :data:`SEEK_SET` or ``0`` -- start of the stream (the default);
306 *offset* should be zero or positive
307 * :data:`SEEK_CUR` or ``1`` -- current stream position; *offset* may
308 be negative
309 * :data:`SEEK_END` or ``2`` -- end of the stream; *offset* is usually
310 negative
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000311
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000312 Return the new absolute position.
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000313
Raymond Hettinger35a88362009-04-09 00:08:24 +0000314 .. versionadded:: 3.1
Georg Brandl67b21b72010-08-17 15:07:14 +0000315 The ``SEEK_*`` constants.
Benjamin Peterson0e4caf42009-04-01 21:22:20 +0000316
Jesus Cea94363612012-06-22 18:32:07 +0200317 .. versionadded:: 3.3
318 Some operating systems could support additional values, like
319 :data:`os.SEEK_HOLE` or :data:`os.SEEK_DATA`. The valid values
320 for a file could depend on it being open in text or binary mode.
321
Georg Brandl014197c2008-04-09 18:40:51 +0000322 .. method:: seekable()
323
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000324 Return ``True`` if the stream supports random access. If ``False``,
Antoine Pitroua787b652011-10-12 19:02:52 +0200325 :meth:`seek`, :meth:`tell` and :meth:`truncate` will raise :exc:`OSError`.
Georg Brandl014197c2008-04-09 18:40:51 +0000326
327 .. method:: tell()
328
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000329 Return the current stream position.
Georg Brandl014197c2008-04-09 18:40:51 +0000330
Georg Brandl3dd33882009-06-01 17:35:27 +0000331 .. method:: truncate(size=None)
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000332
Antoine Pitrou2016dc92010-05-29 12:08:25 +0000333 Resize the stream to the given *size* in bytes (or the current position
334 if *size* is not specified). The current stream position isn't changed.
335 This resizing can extend or reduce the current file size. In case of
336 extension, the contents of the new file area depend on the platform
337 (on most systems, additional bytes are zero-filled, on Windows they're
338 undetermined). The new file size is returned.
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000339
Georg Brandl014197c2008-04-09 18:40:51 +0000340 .. method:: writable()
341
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000342 Return ``True`` if the stream supports writing. If ``False``,
Antoine Pitroua787b652011-10-12 19:02:52 +0200343 :meth:`write` and :meth:`truncate` will raise :exc:`OSError`.
Georg Brandl014197c2008-04-09 18:40:51 +0000344
345 .. method:: writelines(lines)
346
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000347 Write a list of lines to the stream. Line separators are not added, so it
348 is usual for each of the lines provided to have a line separator at the
349 end.
Georg Brandl014197c2008-04-09 18:40:51 +0000350
351
352.. class:: RawIOBase
353
354 Base class for raw binary I/O. It inherits :class:`IOBase`. There is no
355 public constructor.
356
Antoine Pitrou497a7672009-09-17 17:18:01 +0000357 Raw binary I/O typically provides low-level access to an underlying OS
358 device or API, and does not try to encapsulate it in high-level primitives
359 (this is left to Buffered I/O and Text I/O, described later in this page).
360
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000361 In addition to the attributes and methods from :class:`IOBase`,
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300362 :class:`RawIOBase` provides the following methods:
Georg Brandl014197c2008-04-09 18:40:51 +0000363
Georg Brandl3dd33882009-06-01 17:35:27 +0000364 .. method:: read(n=-1)
Georg Brandl014197c2008-04-09 18:40:51 +0000365
Antoine Pitrou78ddbe62009-10-01 16:24:45 +0000366 Read up to *n* bytes from the object and return them. As a convenience,
367 if *n* is unspecified or -1, :meth:`readall` is called. Otherwise,
368 only one system call is ever made. Fewer than *n* bytes may be
369 returned if the operating system call returns fewer than *n* bytes.
370
371 If 0 bytes are returned, and *n* was not 0, this indicates end of file.
372 If the object is in non-blocking mode and no bytes are available,
373 ``None`` is returned.
Georg Brandl014197c2008-04-09 18:40:51 +0000374
Benjamin Petersonb47aace2008-04-09 21:38:38 +0000375 .. method:: readall()
Georg Brandl014197c2008-04-09 18:40:51 +0000376
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000377 Read and return all the bytes from the stream until EOF, using multiple
378 calls to the stream if necessary.
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000379
380 .. method:: readinto(b)
381
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300382 Read up to ``len(b)`` bytes into :class:`bytearray` *b* and return the
383 number of bytes read. If the object is in non-blocking mode and no
Daniel Stutzbachd01df462010-11-30 17:49:53 +0000384 bytes are available, ``None`` is returned.
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000385
386 .. method:: write(b)
387
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300388 Write the given :class:`bytes` or :class:`bytearray` object, *b*, to the
389 underlying raw stream and return the number of bytes written. This can
390 be less than ``len(b)``, depending on specifics of the underlying raw
391 stream, and especially if it is in non-blocking mode. ``None`` is
392 returned if the raw stream is set not to block and no single byte could
393 be readily written to it.
Georg Brandl014197c2008-04-09 18:40:51 +0000394
395
Georg Brandl014197c2008-04-09 18:40:51 +0000396.. class:: BufferedIOBase
397
Antoine Pitrou497a7672009-09-17 17:18:01 +0000398 Base class for binary streams that support some kind of buffering.
399 It inherits :class:`IOBase`. There is no public constructor.
Georg Brandl014197c2008-04-09 18:40:51 +0000400
Antoine Pitrou497a7672009-09-17 17:18:01 +0000401 The main difference with :class:`RawIOBase` is that methods :meth:`read`,
402 :meth:`readinto` and :meth:`write` will try (respectively) to read as much
403 input as requested or to consume all given output, at the expense of
404 making perhaps more than one system call.
405
406 In addition, those methods can raise :exc:`BlockingIOError` if the
407 underlying raw stream is in non-blocking mode and cannot take or give
408 enough data; unlike their :class:`RawIOBase` counterparts, they will
409 never return ``None``.
410
411 Besides, the :meth:`read` method does not have a default
Georg Brandl014197c2008-04-09 18:40:51 +0000412 implementation that defers to :meth:`readinto`.
413
Antoine Pitrou497a7672009-09-17 17:18:01 +0000414 A typical :class:`BufferedIOBase` implementation should not inherit from a
415 :class:`RawIOBase` implementation, but wrap one, like
416 :class:`BufferedWriter` and :class:`BufferedReader` do.
Georg Brandl014197c2008-04-09 18:40:51 +0000417
Senthil Kumarana6bac952011-07-04 11:28:30 -0700418 :class:`BufferedIOBase` provides or overrides these methods and attribute in
419 addition to those from :class:`IOBase`:
Georg Brandl014197c2008-04-09 18:40:51 +0000420
Benjamin Petersonc609b6b2009-06-28 17:32:20 +0000421 .. attribute:: raw
422
423 The underlying raw stream (a :class:`RawIOBase` instance) that
424 :class:`BufferedIOBase` deals with. This is not part of the
425 :class:`BufferedIOBase` API and may not exist on some implementations.
426
Benjamin Petersond2e0c792009-05-01 20:40:59 +0000427 .. method:: detach()
428
429 Separate the underlying raw stream from the buffer and return it.
430
431 After the raw stream has been detached, the buffer is in an unusable
432 state.
433
434 Some buffers, like :class:`BytesIO`, do not have the concept of a single
435 raw stream to return from this method. They raise
436 :exc:`UnsupportedOperation`.
437
Benjamin Petersonedc36472009-05-01 20:48:14 +0000438 .. versionadded:: 3.1
439
Georg Brandl3dd33882009-06-01 17:35:27 +0000440 .. method:: read(n=-1)
Georg Brandl014197c2008-04-09 18:40:51 +0000441
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000442 Read and return up to *n* bytes. If the argument is omitted, ``None``, or
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300443 negative, data is read and returned until EOF is reached. An empty
444 :class:`bytes` object is returned if the stream is already at EOF.
Georg Brandl014197c2008-04-09 18:40:51 +0000445
446 If the argument is positive, and the underlying raw stream is not
447 interactive, multiple raw reads may be issued to satisfy the byte count
448 (unless EOF is reached first). But for interactive raw streams, at most
449 one raw read will be issued, and a short result does not imply that EOF is
450 imminent.
451
Antoine Pitrou497a7672009-09-17 17:18:01 +0000452 A :exc:`BlockingIOError` is raised if the underlying raw stream is in
453 non blocking-mode, and has no data available at the moment.
Georg Brandl014197c2008-04-09 18:40:51 +0000454
Georg Brandl3dd33882009-06-01 17:35:27 +0000455 .. method:: read1(n=-1)
Benjamin Peterson4fa88fa2009-03-04 00:14:51 +0000456
457 Read and return up to *n* bytes, with at most one call to the underlying
Antoine Pitrou497a7672009-09-17 17:18:01 +0000458 raw stream's :meth:`~RawIOBase.read` method. This can be useful if you
459 are implementing your own buffering on top of a :class:`BufferedIOBase`
460 object.
Benjamin Peterson4fa88fa2009-03-04 00:14:51 +0000461
Georg Brandl014197c2008-04-09 18:40:51 +0000462 .. method:: readinto(b)
463
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300464 Read up to ``len(b)`` bytes into bytearray *b* and return the number of
465 bytes read.
Georg Brandl014197c2008-04-09 18:40:51 +0000466
467 Like :meth:`read`, multiple reads may be issued to the underlying raw
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300468 stream, unless the latter is interactive.
Georg Brandl014197c2008-04-09 18:40:51 +0000469
Antoine Pitrou497a7672009-09-17 17:18:01 +0000470 A :exc:`BlockingIOError` is raised if the underlying raw stream is in
471 non blocking-mode, and has no data available at the moment.
Georg Brandl014197c2008-04-09 18:40:51 +0000472
Georg Brandl014197c2008-04-09 18:40:51 +0000473 .. method:: write(b)
474
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300475 Write the given :class:`bytes` or :class:`bytearray` object, *b* and
476 return the number of bytes written (never less than ``len(b)``, since if
477 the write fails an :exc:`OSError` will be raised). Depending on the
478 actual implementation, these bytes may be readily written to the
479 underlying stream, or held in a buffer for performance and latency
480 reasons.
Georg Brandl014197c2008-04-09 18:40:51 +0000481
Antoine Pitrou497a7672009-09-17 17:18:01 +0000482 When in non-blocking mode, a :exc:`BlockingIOError` is raised if the
483 data needed to be written to the raw stream but it couldn't accept
484 all the data without blocking.
Georg Brandl014197c2008-04-09 18:40:51 +0000485
486
Benjamin Petersonaa069002009-01-23 03:26:36 +0000487Raw File I/O
Antoine Pitroub530e142010-08-30 12:41:00 +0000488^^^^^^^^^^^^
Benjamin Petersonaa069002009-01-23 03:26:36 +0000489
Ross Lagerwall59142db2011-10-31 20:34:46 +0200490.. class:: FileIO(name, mode='r', closefd=True, opener=None)
Benjamin Petersonaa069002009-01-23 03:26:36 +0000491
Antoine Pitrou497a7672009-09-17 17:18:01 +0000492 :class:`FileIO` represents an OS-level file containing bytes data.
493 It implements the :class:`RawIOBase` interface (and therefore the
494 :class:`IOBase` interface, too).
495
496 The *name* can be one of two things:
497
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300498 * a character string or :class:`bytes` object representing the path to the
499 file which will be opened;
Antoine Pitrou497a7672009-09-17 17:18:01 +0000500 * an integer representing the number of an existing OS-level file descriptor
501 to which the resulting :class:`FileIO` object will give access.
Benjamin Petersonaa069002009-01-23 03:26:36 +0000502
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100503 The *mode* can be ``'r'``, ``'w'``, ``'x'`` or ``'a'`` for reading
Charles-François Natalid612de12012-01-14 11:51:00 +0100504 (default), writing, exclusive creation or appending. The file will be
505 created if it doesn't exist when opened for writing or appending; it will be
506 truncated when opened for writing. :exc:`FileExistsError` will be raised if
507 it already exists when opened for creating. Opening a file for creating
508 implies writing, so this mode behaves in a similar way to ``'w'``. Add a
509 ``'+'`` to the mode to allow simultaneous reading and writing.
Benjamin Petersonaa069002009-01-23 03:26:36 +0000510
Antoine Pitrou497a7672009-09-17 17:18:01 +0000511 The :meth:`read` (when called with a positive argument), :meth:`readinto`
512 and :meth:`write` methods on this class will only make one system call.
513
Ross Lagerwall59142db2011-10-31 20:34:46 +0200514 A custom opener can be used by passing a callable as *opener*. The underlying
515 file descriptor for the file object is then obtained by calling *opener* with
516 (*name*, *flags*). *opener* must return an open file descriptor (passing
517 :mod:`os.open` as *opener* results in functionality similar to passing
518 ``None``).
519
Éric Araujo8f423c92012-11-03 17:06:52 -0400520 See the :func:`open` built-in function for examples on using the *opener*
521 parameter.
522
Ross Lagerwall59142db2011-10-31 20:34:46 +0200523 .. versionchanged:: 3.3
524 The *opener* parameter was added.
Charles-François Natalidc3044c2012-01-09 22:40:02 +0100525 The ``'x'`` mode was added.
Ross Lagerwall59142db2011-10-31 20:34:46 +0200526
Benjamin Petersonaa069002009-01-23 03:26:36 +0000527 In addition to the attributes and methods from :class:`IOBase` and
528 :class:`RawIOBase`, :class:`FileIO` provides the following data
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300529 attributes:
Benjamin Petersonaa069002009-01-23 03:26:36 +0000530
531 .. attribute:: mode
532
533 The mode as given in the constructor.
534
535 .. attribute:: name
536
537 The file name. This is the file descriptor of the file when no name is
538 given in the constructor.
539
Benjamin Petersonaa069002009-01-23 03:26:36 +0000540
541Buffered Streams
Antoine Pitroub530e142010-08-30 12:41:00 +0000542^^^^^^^^^^^^^^^^
Benjamin Petersonaa069002009-01-23 03:26:36 +0000543
Antoine Pitroubed81c82010-12-03 19:14:17 +0000544Buffered I/O streams provide a higher-level interface to an I/O device
545than raw I/O does.
Antoine Pitrou497a7672009-09-17 17:18:01 +0000546
Georg Brandl014197c2008-04-09 18:40:51 +0000547.. class:: BytesIO([initial_bytes])
548
549 A stream implementation using an in-memory bytes buffer. It inherits
550 :class:`BufferedIOBase`.
551
Antoine Pitroub530e142010-08-30 12:41:00 +0000552 The argument *initial_bytes* contains optional initial :class:`bytes` data.
Georg Brandl014197c2008-04-09 18:40:51 +0000553
554 :class:`BytesIO` provides or overrides these methods in addition to those
555 from :class:`BufferedIOBase` and :class:`IOBase`:
556
Antoine Pitrou972ee132010-09-06 18:48:21 +0000557 .. method:: getbuffer()
558
559 Return a readable and writable view over the contents of the buffer
560 without copying them. Also, mutating the view will transparently
561 update the contents of the buffer::
562
563 >>> b = io.BytesIO(b"abcdef")
564 >>> view = b.getbuffer()
565 >>> view[2:4] = b"56"
566 >>> b.getvalue()
567 b'ab56ef'
568
569 .. note::
570 As long as the view exists, the :class:`BytesIO` object cannot be
571 resized.
572
573 .. versionadded:: 3.2
574
Georg Brandl014197c2008-04-09 18:40:51 +0000575 .. method:: getvalue()
576
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300577 Return :class:`bytes` containing the entire contents of the buffer.
Georg Brandl014197c2008-04-09 18:40:51 +0000578
579 .. method:: read1()
580
Benjamin Peterson9efcc4b2008-04-14 21:30:21 +0000581 In :class:`BytesIO`, this is the same as :meth:`read`.
Georg Brandl014197c2008-04-09 18:40:51 +0000582
Georg Brandl014197c2008-04-09 18:40:51 +0000583
Georg Brandl3dd33882009-06-01 17:35:27 +0000584.. class:: BufferedReader(raw, buffer_size=DEFAULT_BUFFER_SIZE)
Georg Brandl014197c2008-04-09 18:40:51 +0000585
Antoine Pitrou497a7672009-09-17 17:18:01 +0000586 A buffer providing higher-level access to a readable, sequential
587 :class:`RawIOBase` object. It inherits :class:`BufferedIOBase`.
588 When reading data from this object, a larger amount of data may be
589 requested from the underlying raw stream, and kept in an internal buffer.
590 The buffered data can then be returned directly on subsequent reads.
Georg Brandl014197c2008-04-09 18:40:51 +0000591
592 The constructor creates a :class:`BufferedReader` for the given readable
593 *raw* stream and *buffer_size*. If *buffer_size* is omitted,
594 :data:`DEFAULT_BUFFER_SIZE` is used.
595
596 :class:`BufferedReader` provides or overrides these methods in addition to
597 those from :class:`BufferedIOBase` and :class:`IOBase`:
598
599 .. method:: peek([n])
600
Benjamin Petersonc43a26d2009-06-16 23:09:24 +0000601 Return bytes from the stream without advancing the position. At most one
Benjamin Peterson2a8b54d2009-06-14 14:37:23 +0000602 single read on the raw stream is done to satisfy the call. The number of
603 bytes returned may be less or more than requested.
Georg Brandl014197c2008-04-09 18:40:51 +0000604
605 .. method:: read([n])
606
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000607 Read and return *n* bytes, or if *n* is not given or negative, until EOF
Georg Brandl014197c2008-04-09 18:40:51 +0000608 or if the read call would block in non-blocking mode.
609
610 .. method:: read1(n)
611
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000612 Read and return up to *n* bytes with only one call on the raw stream. If
Georg Brandl014197c2008-04-09 18:40:51 +0000613 at least one byte is buffered, only buffered bytes are returned.
614 Otherwise, one raw stream read call is made.
615
616
Georg Brandl3dd33882009-06-01 17:35:27 +0000617.. class:: BufferedWriter(raw, buffer_size=DEFAULT_BUFFER_SIZE)
Georg Brandl014197c2008-04-09 18:40:51 +0000618
Antoine Pitrou497a7672009-09-17 17:18:01 +0000619 A buffer providing higher-level access to a writeable, sequential
620 :class:`RawIOBase` object. It inherits :class:`BufferedIOBase`.
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300621 When writing to this object, data is normally placed into an internal
Antoine Pitrou497a7672009-09-17 17:18:01 +0000622 buffer. The buffer will be written out to the underlying :class:`RawIOBase`
623 object under various conditions, including:
624
625 * when the buffer gets too small for all pending data;
626 * when :meth:`flush()` is called;
627 * when a :meth:`seek()` is requested (for :class:`BufferedRandom` objects);
628 * when the :class:`BufferedWriter` object is closed or destroyed.
Georg Brandl014197c2008-04-09 18:40:51 +0000629
630 The constructor creates a :class:`BufferedWriter` for the given writeable
631 *raw* stream. If the *buffer_size* is not given, it defaults to
Benjamin Peterson394ee002009-03-05 22:33:59 +0000632 :data:`DEFAULT_BUFFER_SIZE`.
633
Georg Brandl014197c2008-04-09 18:40:51 +0000634 :class:`BufferedWriter` provides or overrides these methods in addition to
635 those from :class:`BufferedIOBase` and :class:`IOBase`:
636
637 .. method:: flush()
638
639 Force bytes held in the buffer into the raw stream. A
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000640 :exc:`BlockingIOError` should be raised if the raw stream blocks.
Georg Brandl014197c2008-04-09 18:40:51 +0000641
642 .. method:: write(b)
643
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300644 Write the :class:`bytes` or :class:`bytearray` object, *b* and return the
645 number of bytes written. When in non-blocking mode, a
646 :exc:`BlockingIOError` is raised if the buffer needs to be written out but
647 the raw stream blocks.
Georg Brandl014197c2008-04-09 18:40:51 +0000648
649
Georg Brandl3dd33882009-06-01 17:35:27 +0000650.. class:: BufferedRandom(raw, buffer_size=DEFAULT_BUFFER_SIZE)
Georg Brandl014197c2008-04-09 18:40:51 +0000651
652 A buffered interface to random access streams. It inherits
Antoine Pitrou497a7672009-09-17 17:18:01 +0000653 :class:`BufferedReader` and :class:`BufferedWriter`, and further supports
654 :meth:`seek` and :meth:`tell` functionality.
Georg Brandl014197c2008-04-09 18:40:51 +0000655
Benjamin Peterson2c5f8282008-04-13 00:27:46 +0000656 The constructor creates a reader and writer for a seekable raw stream, given
Georg Brandl014197c2008-04-09 18:40:51 +0000657 in the first argument. If the *buffer_size* is omitted it defaults to
Benjamin Peterson394ee002009-03-05 22:33:59 +0000658 :data:`DEFAULT_BUFFER_SIZE`.
659
Georg Brandl014197c2008-04-09 18:40:51 +0000660 :class:`BufferedRandom` is capable of anything :class:`BufferedReader` or
661 :class:`BufferedWriter` can do.
662
663
Antoine Pitrou13d28952011-08-20 19:48:43 +0200664.. class:: BufferedRWPair(reader, writer, buffer_size=DEFAULT_BUFFER_SIZE)
665
666 A buffered I/O object combining two unidirectional :class:`RawIOBase`
667 objects -- one readable, the other writeable -- into a single bidirectional
668 endpoint. It inherits :class:`BufferedIOBase`.
669
670 *reader* and *writer* are :class:`RawIOBase` objects that are readable and
671 writeable respectively. If the *buffer_size* is omitted it defaults to
672 :data:`DEFAULT_BUFFER_SIZE`.
673
Antoine Pitrou13d28952011-08-20 19:48:43 +0200674 :class:`BufferedRWPair` implements all of :class:`BufferedIOBase`\'s methods
675 except for :meth:`~BufferedIOBase.detach`, which raises
676 :exc:`UnsupportedOperation`.
677
678 .. warning::
679 :class:`BufferedRWPair` does not attempt to synchronize accesses to
680 its underlying raw streams. You should not pass it the same object
681 as reader and writer; use :class:`BufferedRandom` instead.
682
683
Georg Brandl014197c2008-04-09 18:40:51 +0000684Text I/O
Antoine Pitroub530e142010-08-30 12:41:00 +0000685^^^^^^^^
Georg Brandl014197c2008-04-09 18:40:51 +0000686
687.. class:: TextIOBase
688
689 Base class for text streams. This class provides a character and line based
690 interface to stream I/O. There is no :meth:`readinto` method because
691 Python's character strings are immutable. It inherits :class:`IOBase`.
692 There is no public constructor.
693
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000694 :class:`TextIOBase` provides or overrides these data attributes and
695 methods in addition to those from :class:`IOBase`:
Georg Brandl014197c2008-04-09 18:40:51 +0000696
697 .. attribute:: encoding
698
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000699 The name of the encoding used to decode the stream's bytes into
Georg Brandl014197c2008-04-09 18:40:51 +0000700 strings, and to encode strings into bytes.
701
Benjamin Peterson0926ad12009-06-06 18:02:12 +0000702 .. attribute:: errors
703
704 The error setting of the decoder or encoder.
705
Georg Brandl014197c2008-04-09 18:40:51 +0000706 .. attribute:: newlines
707
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000708 A string, a tuple of strings, or ``None``, indicating the newlines
Antoine Pitrou497a7672009-09-17 17:18:01 +0000709 translated so far. Depending on the implementation and the initial
710 constructor flags, this may not be available.
Georg Brandl014197c2008-04-09 18:40:51 +0000711
Benjamin Petersonc609b6b2009-06-28 17:32:20 +0000712 .. attribute:: buffer
713
714 The underlying binary buffer (a :class:`BufferedIOBase` instance) that
715 :class:`TextIOBase` deals with. This is not part of the
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300716 :class:`TextIOBase` API and may not exist in some implementations.
Benjamin Petersonc609b6b2009-06-28 17:32:20 +0000717
Benjamin Petersond2e0c792009-05-01 20:40:59 +0000718 .. method:: detach()
719
Antoine Pitrou497a7672009-09-17 17:18:01 +0000720 Separate the underlying binary buffer from the :class:`TextIOBase` and
721 return it.
Benjamin Petersond2e0c792009-05-01 20:40:59 +0000722
723 After the underlying buffer has been detached, the :class:`TextIOBase` is
724 in an unusable state.
725
726 Some :class:`TextIOBase` implementations, like :class:`StringIO`, may not
727 have the concept of an underlying buffer and calling this method will
728 raise :exc:`UnsupportedOperation`.
729
Benjamin Petersonedc36472009-05-01 20:48:14 +0000730 .. versionadded:: 3.1
731
Georg Brandl014197c2008-04-09 18:40:51 +0000732 .. method:: read(n)
733
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000734 Read and return at most *n* characters from the stream as a single
Antoine Pitrou497a7672009-09-17 17:18:01 +0000735 :class:`str`. If *n* is negative or ``None``, reads until EOF.
Georg Brandl014197c2008-04-09 18:40:51 +0000736
Antoine Pitrou707bd4e2012-07-25 22:38:33 +0200737 .. method:: readline(limit=-1)
Georg Brandl014197c2008-04-09 18:40:51 +0000738
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000739 Read until newline or EOF and return a single ``str``. If the stream is
740 already at EOF, an empty string is returned.
Georg Brandl014197c2008-04-09 18:40:51 +0000741
Antoine Pitrou707bd4e2012-07-25 22:38:33 +0200742 If *limit* is specified, at most *limit* characters will be read.
743
Antoine Pitrouf49d1522012-01-21 20:20:49 +0100744 .. method:: seek(offset, whence=SEEK_SET)
745
746 Change the stream position to the given *offset*. Behaviour depends
747 on the *whence* parameter:
748
749 * :data:`SEEK_SET` or ``0``: seek from the start of the stream
750 (the default); *offset* must either be a number returned by
751 :meth:`TextIOBase.tell`, or zero. Any other *offset* value
752 produces undefined behaviour.
753 * :data:`SEEK_CUR` or ``1``: "seek" to the current position;
754 *offset* must be zero, which is a no-operation (all other values
755 are unsupported).
756 * :data:`SEEK_END` or ``2``: seek to the end of the stream;
757 *offset* must be zero (all other values are unsupported).
758
759 Return the new absolute position as an opaque number.
760
761 .. versionadded:: 3.1
762 The ``SEEK_*`` constants.
763
764 .. method:: tell()
765
766 Return the current stream position as an opaque number. The number
767 does not usually represent a number of bytes in the underlying
768 binary storage.
769
Georg Brandl014197c2008-04-09 18:40:51 +0000770 .. method:: write(s)
771
Christian Heimes81ee3ef2008-05-04 22:42:01 +0000772 Write the string *s* to the stream and return the number of characters
773 written.
Georg Brandl014197c2008-04-09 18:40:51 +0000774
775
Antoine Pitrou664091b2011-07-23 22:00:03 +0200776.. class:: TextIOWrapper(buffer, encoding=None, errors=None, newline=None, \
777 line_buffering=False, write_through=False)
Georg Brandl014197c2008-04-09 18:40:51 +0000778
Antoine Pitrou497a7672009-09-17 17:18:01 +0000779 A buffered text stream over a :class:`BufferedIOBase` binary stream.
Georg Brandl014197c2008-04-09 18:40:51 +0000780 It inherits :class:`TextIOBase`.
781
782 *encoding* gives the name of the encoding that the stream will be decoded or
Andrew Svetlov4805fa82012-08-13 22:11:14 +0300783 encoded with. It defaults to
784 :func:`locale.getpreferredencoding(False) <locale.getpreferredencoding>`.
Georg Brandl014197c2008-04-09 18:40:51 +0000785
Benjamin Petersonb85a5842008-04-13 21:39:58 +0000786 *errors* is an optional string that specifies how encoding and decoding
787 errors are to be handled. Pass ``'strict'`` to raise a :exc:`ValueError`
788 exception if there is an encoding error (the default of ``None`` has the same
789 effect), or pass ``'ignore'`` to ignore errors. (Note that ignoring encoding
790 errors can lead to data loss.) ``'replace'`` causes a replacement marker
Christian Heimesa342c012008-04-20 21:01:16 +0000791 (such as ``'?'``) to be inserted where there is malformed data. When
792 writing, ``'xmlcharrefreplace'`` (replace with the appropriate XML character
793 reference) or ``'backslashreplace'`` (replace with backslashed escape
794 sequences) can be used. Any other error handling name that has been
795 registered with :func:`codecs.register_error` is also valid.
Georg Brandl014197c2008-04-09 18:40:51 +0000796
R David Murray1b00f252012-08-15 10:43:58 -0400797 .. index::
798 single: universal newlines; io.TextIOWrapper class
799
Antoine Pitrou0c1c0d42012-08-04 00:55:38 +0200800 *newline* controls how line endings are handled. It can be ``None``,
801 ``''``, ``'\n'``, ``'\r'``, and ``'\r\n'``. It works as follows:
802
R David Murray1b00f252012-08-15 10:43:58 -0400803 * When reading input from the stream, if *newline* is ``None``,
R David Murrayee0a9452012-08-15 11:05:36 -0400804 :term:`universal newlines` mode is enabled. Lines in the input can end in
805 ``'\n'``, ``'\r'``, or ``'\r\n'``, and these are translated into ``'\n'``
806 before being returned to the caller. If it is ``''``, universal newlines
807 mode is enabled, but line endings are returned to the caller untranslated.
808 If it has any of the other legal values, input lines are only terminated
809 by the given string, and the line ending is returned to the caller
810 untranslated.
Antoine Pitrou0c1c0d42012-08-04 00:55:38 +0200811
Georg Brandl296d1be2012-08-14 09:39:07 +0200812 * When writing output to the stream, if *newline* is ``None``, any ``'\n'``
813 characters written are translated to the system default line separator,
814 :data:`os.linesep`. If *newline* is ``''`` or ``'\n'``, no translation
815 takes place. If *newline* is any of the other legal values, any ``'\n'``
816 characters written are translated to the given string.
Georg Brandl014197c2008-04-09 18:40:51 +0000817
818 If *line_buffering* is ``True``, :meth:`flush` is implied when a call to
819 write contains a newline character.
820
Antoine Pitrou664091b2011-07-23 22:00:03 +0200821 If *write_through* is ``True``, calls to :meth:`write` are guaranteed
822 not to be buffered: any data written on the :class:`TextIOWrapper`
823 object is immediately handled to its underlying binary *buffer*.
824
825 .. versionchanged:: 3.3
826 The *write_through* argument has been added.
827
Victor Stinnerf86a5e82012-06-05 13:43:22 +0200828 .. versionchanged:: 3.3
829 The default *encoding* is now ``locale.getpreferredencoding(False)``
830 instead of ``locale.getpreferredencoding()``. Don't change temporary the
831 locale encoding using :func:`locale.setlocale`, use the current locale
832 encoding instead of the user preferred encoding.
833
Benjamin Peterson0926ad12009-06-06 18:02:12 +0000834 :class:`TextIOWrapper` provides one attribute in addition to those of
Georg Brandl014197c2008-04-09 18:40:51 +0000835 :class:`TextIOBase` and its parents:
836
Georg Brandl014197c2008-04-09 18:40:51 +0000837 .. attribute:: line_buffering
838
839 Whether line buffering is enabled.
Georg Brandl48310cd2009-01-03 21:18:54 +0000840
Georg Brandl014197c2008-04-09 18:40:51 +0000841
Georg Brandl3dd33882009-06-01 17:35:27 +0000842.. class:: StringIO(initial_value='', newline=None)
Georg Brandl014197c2008-04-09 18:40:51 +0000843
Antoine Pitroub530e142010-08-30 12:41:00 +0000844 An in-memory stream for text I/O.
Georg Brandl014197c2008-04-09 18:40:51 +0000845
Benjamin Petersonaa1c8d82009-03-09 02:02:23 +0000846 The initial value of the buffer (an empty string by default) can be set by
847 providing *initial_value*. The *newline* argument works like that of
848 :class:`TextIOWrapper`. The default is to do no newline translation.
Georg Brandl014197c2008-04-09 18:40:51 +0000849
Mark Summerfielde6d5f302008-04-21 10:29:45 +0000850 :class:`StringIO` provides this method in addition to those from
Antoine Pitroub530e142010-08-30 12:41:00 +0000851 :class:`TextIOBase` and its parents:
Georg Brandl014197c2008-04-09 18:40:51 +0000852
853 .. method:: getvalue()
854
Georg Brandl2932d932008-05-30 06:27:09 +0000855 Return a ``str`` containing the entire contents of the buffer at any
856 time before the :class:`StringIO` object's :meth:`close` method is
857 called.
Georg Brandl014197c2008-04-09 18:40:51 +0000858
Georg Brandl2932d932008-05-30 06:27:09 +0000859 Example usage::
860
861 import io
862
863 output = io.StringIO()
864 output.write('First line.\n')
865 print('Second line.', file=output)
866
867 # Retrieve file contents -- this will be
868 # 'First line.\nSecond line.\n'
869 contents = output.getvalue()
870
Georg Brandl48310cd2009-01-03 21:18:54 +0000871 # Close object and discard memory buffer --
Georg Brandl2932d932008-05-30 06:27:09 +0000872 # .getvalue() will now raise an exception.
873 output.close()
Georg Brandl014197c2008-04-09 18:40:51 +0000874
Antoine Pitroub530e142010-08-30 12:41:00 +0000875
R David Murray1b00f252012-08-15 10:43:58 -0400876.. index::
877 single: universal newlines; io.IncrementalNewlineDecoder class
878
Georg Brandl014197c2008-04-09 18:40:51 +0000879.. class:: IncrementalNewlineDecoder
880
R David Murray1b00f252012-08-15 10:43:58 -0400881 A helper codec that decodes newlines for :term:`universal newlines` mode.
882 It inherits :class:`codecs.IncrementalDecoder`.
Georg Brandl014197c2008-04-09 18:40:51 +0000883
Antoine Pitroubed81c82010-12-03 19:14:17 +0000884
Antoine Pitroubed81c82010-12-03 19:14:17 +0000885Performance
Benjamin Petersonedf51322011-02-24 03:03:46 +0000886-----------
887
888This section discusses the performance of the provided concrete I/O
889implementations.
Antoine Pitroubed81c82010-12-03 19:14:17 +0000890
891Binary I/O
Benjamin Petersonedf51322011-02-24 03:03:46 +0000892^^^^^^^^^^
Antoine Pitroubed81c82010-12-03 19:14:17 +0000893
Benjamin Petersonedf51322011-02-24 03:03:46 +0000894By reading and writing only large chunks of data even when the user asks for a
895single byte, buffered I/O hides any inefficiency in calling and executing the
896operating system's unbuffered I/O routines. The gain depends on the OS and the
897kind of I/O which is performed. For example, on some modern OSes such as Linux,
898unbuffered disk I/O can be as fast as buffered I/O. The bottom line, however,
899is that buffered I/O offers predictable performance regardless of the platform
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300900and the backing device. Therefore, it is almost always preferable to use
901buffered I/O rather than unbuffered I/O for binary data.
Antoine Pitroubed81c82010-12-03 19:14:17 +0000902
903Text I/O
Benjamin Petersonedf51322011-02-24 03:03:46 +0000904^^^^^^^^
Antoine Pitroubed81c82010-12-03 19:14:17 +0000905
906Text I/O over a binary storage (such as a file) is significantly slower than
Benjamin Petersonedf51322011-02-24 03:03:46 +0000907binary I/O over the same storage, because it requires conversions between
908unicode and binary data using a character codec. This can become noticeable
909handling huge amounts of text data like large log files. Also,
910:meth:`TextIOWrapper.tell` and :meth:`TextIOWrapper.seek` are both quite slow
911due to the reconstruction algorithm used.
Antoine Pitroubed81c82010-12-03 19:14:17 +0000912
913:class:`StringIO`, however, is a native in-memory unicode container and will
914exhibit similar speed to :class:`BytesIO`.
915
916Multi-threading
917^^^^^^^^^^^^^^^
918
Benjamin Petersonedf51322011-02-24 03:03:46 +0000919:class:`FileIO` objects are thread-safe to the extent that the operating system
920calls (such as ``read(2)`` under Unix) they wrap are thread-safe too.
Antoine Pitroubed81c82010-12-03 19:14:17 +0000921
922Binary buffered objects (instances of :class:`BufferedReader`,
923:class:`BufferedWriter`, :class:`BufferedRandom` and :class:`BufferedRWPair`)
924protect their internal structures using a lock; it is therefore safe to call
925them from multiple threads at once.
926
927:class:`TextIOWrapper` objects are not thread-safe.
928
929Reentrancy
930^^^^^^^^^^
931
932Binary buffered objects (instances of :class:`BufferedReader`,
933:class:`BufferedWriter`, :class:`BufferedRandom` and :class:`BufferedRWPair`)
934are not reentrant. While reentrant calls will not happen in normal situations,
Benjamin Petersonedf51322011-02-24 03:03:46 +0000935they can arise from doing I/O in a :mod:`signal` handler. If a thread tries to
Eli Benderskyf877a7c2012-07-14 21:22:25 +0300936re-enter a buffered object which it is already accessing, a :exc:`RuntimeError`
937is raised. Note this doesn't prohibit a different thread from entering the
Benjamin Petersonedf51322011-02-24 03:03:46 +0000938buffered object.
Antoine Pitroubed81c82010-12-03 19:14:17 +0000939
Benjamin Petersonedf51322011-02-24 03:03:46 +0000940The above implicitly extends to text files, since the :func:`open()` function
941will wrap a buffered object inside a :class:`TextIOWrapper`. This includes
942standard streams and therefore affects the built-in function :func:`print()` as
943well.
Antoine Pitroubed81c82010-12-03 19:14:17 +0000944