blob: 9174830fc139936e04169400dbd4aa777fdf9bac [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`fileinput` --- Iterate over lines from multiple input streams
2===================================================================
3
4.. module:: fileinput
5 :synopsis: Loop over standard input or a list of files.
6.. moduleauthor:: Guido van Rossum <guido@python.org>
7.. sectionauthor:: Fred L. Drake, Jr. <fdrake@acm.org>
8
9
Thomas Wouters1b7f8912007-09-19 03:06:30 +000010This module implements a helper class and functions to quickly write a
11loop over standard input or a list of files. If you just want to read or
12write one file see :func:`open`.
Georg Brandl116aa622007-08-15 14:28:22 +000013
14The typical use is::
15
16 import fileinput
17 for line in fileinput.input():
18 process(line)
19
20This iterates over the lines of all files listed in ``sys.argv[1:]``, defaulting
21to ``sys.stdin`` if the list is empty. If a filename is ``'-'``, it is also
22replaced by ``sys.stdin``. To specify an alternative list of filenames, pass it
Georg Brandl96593ed2007-09-07 14:15:41 +000023as the first argument to :func:`.input`. A single file name is also allowed.
Georg Brandl116aa622007-08-15 14:28:22 +000024
25All files are opened in text mode by default, but you can override this by
Georg Brandl96593ed2007-09-07 14:15:41 +000026specifying the *mode* parameter in the call to :func:`.input` or
Georg Brandl6cb7b652010-07-31 20:08:15 +000027:class:`FileInput`. If an I/O error occurs during opening or reading a file,
Georg Brandl116aa622007-08-15 14:28:22 +000028:exc:`IOError` is raised.
29
30If ``sys.stdin`` is used more than once, the second and further use will return
31no lines, except perhaps for interactive use, or if it has been explicitly reset
32(e.g. using ``sys.stdin.seek(0)``).
33
34Empty files are opened and immediately closed; the only time their presence in
35the list of filenames is noticeable at all is when the last file opened is
36empty.
37
38Lines are returned with any newlines intact, which means that the last line in
39a file may not have one.
40
41You can control how files are opened by providing an opening hook via the
42*openhook* parameter to :func:`fileinput.input` or :class:`FileInput()`. The
43hook must be a function that takes two arguments, *filename* and *mode*, and
44returns an accordingly opened file-like object. Two useful hooks are already
45provided by this module.
46
Éric Araujo6e6cb8e2010-11-16 19:13:50 +000047.. seealso::
48
49 Latest version of the :source:`fileinput Python source code
50 <Lib/fileinput.py>`
51
Georg Brandl116aa622007-08-15 14:28:22 +000052The following function is the primary interface of this module:
53
54
Georg Brandl71515ca2009-05-17 12:29:12 +000055.. function:: input(files=None, inplace=False, backup='', bufsize=0, mode='r', openhook=None)
Georg Brandl116aa622007-08-15 14:28:22 +000056
57 Create an instance of the :class:`FileInput` class. The instance will be used
58 as global state for the functions of this module, and is also returned to use
59 during iteration. The parameters to this function will be passed along to the
60 constructor of the :class:`FileInput` class.
61
Georg Brandl6cb7b652010-07-31 20:08:15 +000062 The :class:`FileInput` instance can be used as a context manager in the
63 :keyword:`with` statement. In this example, *input* is closed after the
64 :keyword:`with` statement is exited, even if an exception occurs::
65
Raymond Hettinger7fefaff2010-09-05 23:50:32 +000066 with fileinput.input(files=('spam.txt', 'eggs.txt')) as f:
67 for line in f:
68 process(line)
Georg Brandl6cb7b652010-07-31 20:08:15 +000069
70 .. versionchanged:: 3.2
71 Can be used as a context manager.
72
Georg Brandl116aa622007-08-15 14:28:22 +000073
74The following functions use the global state created by :func:`fileinput.input`;
75if there is no active state, :exc:`RuntimeError` is raised.
76
77
78.. function:: filename()
79
80 Return the name of the file currently being read. Before the first line has
81 been read, returns ``None``.
82
83
84.. function:: fileno()
85
86 Return the integer "file descriptor" for the current file. When no file is
87 opened (before the first line and between files), returns ``-1``.
88
Georg Brandl116aa622007-08-15 14:28:22 +000089
90.. function:: lineno()
91
92 Return the cumulative line number of the line that has just been read. Before
93 the first line has been read, returns ``0``. After the last line of the last
94 file has been read, returns the line number of that line.
95
96
97.. function:: filelineno()
98
99 Return the line number in the current file. Before the first line has been
100 read, returns ``0``. After the last line of the last file has been read,
101 returns the line number of that line within the file.
102
103
104.. function:: isfirstline()
105
106 Returns true if the line just read is the first line of its file, otherwise
107 returns false.
108
109
110.. function:: isstdin()
111
112 Returns true if the last line was read from ``sys.stdin``, otherwise returns
113 false.
114
115
116.. function:: nextfile()
117
118 Close the current file so that the next iteration will read the first line from
119 the next file (if any); lines not read from the file will not count towards the
120 cumulative line count. The filename is not changed until after the first line
121 of the next file has been read. Before the first line has been read, this
122 function has no effect; it cannot be used to skip the first file. After the
123 last line of the last file has been read, this function has no effect.
124
125
126.. function:: close()
127
128 Close the sequence.
129
130The class which implements the sequence behavior provided by the module is
131available for subclassing as well:
132
133
Georg Brandl71515ca2009-05-17 12:29:12 +0000134.. class:: FileInput(files=None, inplace=False, backup='', bufsize=0, mode='r', openhook=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000135
136 Class :class:`FileInput` is the implementation; its methods :meth:`filename`,
137 :meth:`fileno`, :meth:`lineno`, :meth:`filelineno`, :meth:`isfirstline`,
138 :meth:`isstdin`, :meth:`nextfile` and :meth:`close` correspond to the functions
139 of the same name in the module. In addition it has a :meth:`readline` method
140 which returns the next input line, and a :meth:`__getitem__` method which
141 implements the sequence behavior. The sequence must be accessed in strictly
142 sequential order; random access and :meth:`readline` cannot be mixed.
143
144 With *mode* you can specify which file mode will be passed to :func:`open`. It
145 must be one of ``'r'``, ``'rU'``, ``'U'`` and ``'rb'``.
146
147 The *openhook*, when given, must be a function that takes two arguments,
148 *filename* and *mode*, and returns an accordingly opened file-like object. You
149 cannot use *inplace* and *openhook* together.
150
Georg Brandl6cb7b652010-07-31 20:08:15 +0000151 A :class:`FileInput` instance can be used as a context manager in the
152 :keyword:`with` statement. In this example, *input* is closed after the
153 :keyword:`with` statement is exited, even if an exception occurs::
Georg Brandl116aa622007-08-15 14:28:22 +0000154
Georg Brandl6cb7b652010-07-31 20:08:15 +0000155 with FileInput(files=('spam.txt', 'eggs.txt')) as input:
156 process(input)
157
158 .. versionchanged:: 3.2
159 Can be used as a context manager.
160
161
162**Optional in-place filtering:** if the keyword argument ``inplace=True`` is
163passed to :func:`fileinput.input` or to the :class:`FileInput` constructor, the
164file is moved to a backup file and standard output is directed to the input file
165(if a file of the same name as the backup file already exists, it will be
166replaced silently). This makes it possible to write a filter that rewrites its
167input file in place. If the *backup* parameter is given (typically as
Georg Brandl116aa622007-08-15 14:28:22 +0000168``backup='.<some extension>'``), it specifies the extension for the backup file,
169and the backup file remains around; by default, the extension is ``'.bak'`` and
170it is deleted when the output file is closed. In-place filtering is disabled
171when standard input is read.
172
Georg Brandle720c0a2009-04-27 16:20:50 +0000173.. note::
Georg Brandl48310cd2009-01-03 21:18:54 +0000174
Guido van Rossumda27fd22007-08-17 00:24:54 +0000175 The current implementation does not work for MS-DOS 8+3 filesystems.
176
Georg Brandl116aa622007-08-15 14:28:22 +0000177
178The two following opening hooks are provided by this module:
179
Georg Brandl116aa622007-08-15 14:28:22 +0000180.. function:: hook_compressed(filename, mode)
181
182 Transparently opens files compressed with gzip and bzip2 (recognized by the
183 extensions ``'.gz'`` and ``'.bz2'``) using the :mod:`gzip` and :mod:`bz2`
184 modules. If the filename extension is not ``'.gz'`` or ``'.bz2'``, the file is
185 opened normally (ie, using :func:`open` without any decompression).
186
187 Usage example: ``fi = fileinput.FileInput(openhook=fileinput.hook_compressed)``
188
Georg Brandl116aa622007-08-15 14:28:22 +0000189
190.. function:: hook_encoded(encoding)
191
192 Returns a hook which opens each file with :func:`codecs.open`, using the given
193 *encoding* to read the file.
194
195 Usage example: ``fi =
196 fileinput.FileInput(openhook=fileinput.hook_encoded("iso-8859-1"))``