blob: 9fe7a3589a7ba1618442c729a671aa6307785e30 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`textwrap` --- Text wrapping and filling
2=============================================
3
4.. module:: textwrap
5 :synopsis: Text wrapping and filling
6.. moduleauthor:: Greg Ward <gward@python.net>
7.. sectionauthor:: Greg Ward <gward@python.net>
8
Raymond Hettinger10480942011-01-10 03:26:08 +00009**Source code:** :source:`Lib/textwrap.py`
Georg Brandl116aa622007-08-15 14:28:22 +000010
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000011--------------
12
Antoine Pitrou389dec82013-08-12 22:39:09 +020013The :mod:`textwrap` module provides some convenience functions,
14as well as :class:`TextWrapper`, the class that does all the work.
15If you're just wrapping or filling one or two text strings, the convenience
16functions should be good enough; otherwise, you should use an instance of
17:class:`TextWrapper` for efficiency.
Georg Brandl116aa622007-08-15 14:28:22 +000018
Georg Brandl7f01a132009-09-16 15:58:14 +000019.. function:: wrap(text, width=70, **kwargs)
Georg Brandl116aa622007-08-15 14:28:22 +000020
Georg Brandl7f01a132009-09-16 15:58:14 +000021 Wraps the single paragraph in *text* (a string) so every line is at most
22 *width* characters long. Returns a list of output lines, without final
23 newlines.
Georg Brandl116aa622007-08-15 14:28:22 +000024
25 Optional keyword arguments correspond to the instance attributes of
26 :class:`TextWrapper`, documented below. *width* defaults to ``70``.
27
R David Murray1585b702012-09-08 13:13:25 -040028 See the :meth:`TextWrapper.wrap` method for additional details on how
29 :func:`wrap` behaves.
30
Georg Brandl116aa622007-08-15 14:28:22 +000031
Georg Brandl7f01a132009-09-16 15:58:14 +000032.. function:: fill(text, width=70, **kwargs)
Georg Brandl116aa622007-08-15 14:28:22 +000033
34 Wraps the single paragraph in *text*, and returns a single string containing the
35 wrapped paragraph. :func:`fill` is shorthand for ::
36
37 "\n".join(wrap(text, ...))
38
39 In particular, :func:`fill` accepts exactly the same keyword arguments as
40 :func:`wrap`.
41
Georg Brandl116aa622007-08-15 14:28:22 +000042
Larry Hastings3732ed22014-03-15 21:13:56 -070043.. function:: shorten(text, width, **kwargs)
Alexandre Vassalotti5f8ced22008-05-16 00:03:33 +000044
Larry Hastings3732ed22014-03-15 21:13:56 -070045 Collapse and truncate the given *text* to fit in the given *width*.
Antoine Pitrou389dec82013-08-12 22:39:09 +020046
Larry Hastings3732ed22014-03-15 21:13:56 -070047 First the whitespace in *text* is collapsed (all whitespace is replaced by
48 single spaces). If the result fits in the *width*, it is returned.
49 Otherwise, enough words are dropped from the end so that the remaining words
50 plus the :attr:`placeholder` fit within :attr:`width`::
Antoine Pitrou389dec82013-08-12 22:39:09 +020051
52 >>> textwrap.shorten("Hello world!", width=12)
53 'Hello world!'
54 >>> textwrap.shorten("Hello world!", width=11)
Antoine Pitrouc5930562013-08-16 22:31:12 +020055 'Hello [...]'
Antoine Pitrou389dec82013-08-12 22:39:09 +020056 >>> textwrap.shorten("Hello world", width=10, placeholder="...")
57 'Hello...'
58
Larry Hastings3732ed22014-03-15 21:13:56 -070059 Optional keyword arguments correspond to the instance attributes of
60 :class:`TextWrapper`, documented below. Note that the whitespace is
61 collapsed before the text is passed to the :class:`TextWrapper` :meth:`fill`
62 function, so changing the value of :attr:`.tabsize`, :attr:`.expand_tabs`,
63 :attr:`.drop_whitespace`, and :attr:`.replace_whitespace` will have no effect.
64
Antoine Pitrou389dec82013-08-12 22:39:09 +020065 .. versionadded:: 3.4
66
Georg Brandl116aa622007-08-15 14:28:22 +000067
68.. function:: dedent(text)
69
70 Remove any common leading whitespace from every line in *text*.
71
72 This can be used to make triple-quoted strings line up with the left edge of the
73 display, while still presenting them in the source code in indented form.
74
75 Note that tabs and spaces are both treated as whitespace, but they are not
76 equal: the lines ``" hello"`` and ``"\thello"`` are considered to have no
Georg Brandle6bcc912008-05-12 18:05:20 +000077 common leading whitespace.
Georg Brandl116aa622007-08-15 14:28:22 +000078
79 For example::
80
81 def test():
82 # end first line with \ to avoid the empty line!
83 s = '''\
84 hello
85 world
86 '''
Collin Winterc79461b2007-09-01 23:34:30 +000087 print(repr(s)) # prints ' hello\n world\n '
88 print(repr(dedent(s))) # prints 'hello\n world\n'
Georg Brandl116aa622007-08-15 14:28:22 +000089
90
Nick Coghlan4fae8cd2012-06-11 23:07:51 +100091.. function:: indent(text, prefix, predicate=None)
92
93 Add *prefix* to the beginning of selected lines in *text*.
94
95 Lines are separated by calling ``text.splitlines(True)``.
96
97 By default, *prefix* is added to all lines that do not consist
98 solely of whitespace (including any line endings).
99
100 For example::
101
102 >>> s = 'hello\n\n \nworld'
103 >>> indent(s, ' ')
104 ' hello\n\n \n world'
105
106 The optional *predicate* argument can be used to control which lines
107 are indented. For example, it is easy to add *prefix* to even empty
108 and whitespace-only lines::
109
110 >>> print(indent(s, '+ ', lambda line: True))
111 + hello
112 +
113 +
114 + world
115
Raymond Hettingerdc69e032014-11-05 21:27:56 -0800116 .. versionadded:: 3.3
117
Nick Coghlan4fae8cd2012-06-11 23:07:51 +1000118
Antoine Pitrou389dec82013-08-12 22:39:09 +0200119:func:`wrap`, :func:`fill` and :func:`shorten` work by creating a
120:class:`TextWrapper` instance and calling a single method on it. That
121instance is not reused, so for applications that process many text
Larry Hastings3732ed22014-03-15 21:13:56 -0700122strings using :func:`wrap` and/or :func:`fill`, it may be more efficient to
123create your own :class:`TextWrapper` object.
Antoine Pitrou389dec82013-08-12 22:39:09 +0200124
125Text is preferably wrapped on whitespaces and right after the hyphens in
126hyphenated words; only then will long words be broken if necessary, unless
127:attr:`TextWrapper.break_long_words` is set to false.
128
Georg Brandl7f01a132009-09-16 15:58:14 +0000129.. class:: TextWrapper(**kwargs)
Georg Brandl116aa622007-08-15 14:28:22 +0000130
131 The :class:`TextWrapper` constructor accepts a number of optional keyword
Georg Brandl7f01a132009-09-16 15:58:14 +0000132 arguments. Each keyword argument corresponds to an instance attribute, so
133 for example ::
Georg Brandl116aa622007-08-15 14:28:22 +0000134
135 wrapper = TextWrapper(initial_indent="* ")
136
137 is the same as ::
138
139 wrapper = TextWrapper()
140 wrapper.initial_indent = "* "
141
142 You can re-use the same :class:`TextWrapper` object many times, and you can
143 change any of its options through direct assignment to instance attributes
144 between uses.
145
Benjamin Petersone41251e2008-04-25 01:59:09 +0000146 The :class:`TextWrapper` instance attributes (and keyword arguments to the
147 constructor) are as follows:
Georg Brandl116aa622007-08-15 14:28:22 +0000148
149
Benjamin Petersone41251e2008-04-25 01:59:09 +0000150 .. attribute:: width
Georg Brandl116aa622007-08-15 14:28:22 +0000151
Benjamin Petersone41251e2008-04-25 01:59:09 +0000152 (default: ``70``) The maximum length of wrapped lines. As long as there
153 are no individual words in the input text longer than :attr:`width`,
154 :class:`TextWrapper` guarantees that no output line will be longer than
155 :attr:`width` characters.
Georg Brandl116aa622007-08-15 14:28:22 +0000156
157
Benjamin Petersone41251e2008-04-25 01:59:09 +0000158 .. attribute:: expand_tabs
Georg Brandl116aa622007-08-15 14:28:22 +0000159
Benjamin Petersone41251e2008-04-25 01:59:09 +0000160 (default: ``True``) If true, then all tab characters in *text* will be
161 expanded to spaces using the :meth:`expandtabs` method of *text*.
Georg Brandl116aa622007-08-15 14:28:22 +0000162
163
Hynek Schlawackd5272592012-05-19 13:33:11 +0200164 .. attribute:: tabsize
165
166 (default: ``8``) If :attr:`expand_tabs` is true, then all tab characters
167 in *text* will be expanded to zero or more spaces, depending on the
168 current column and the given tab size.
169
170 .. versionadded:: 3.3
171
172
Benjamin Petersone41251e2008-04-25 01:59:09 +0000173 .. attribute:: replace_whitespace
Georg Brandl116aa622007-08-15 14:28:22 +0000174
Andrew Svetlov59db4012012-08-13 23:22:23 +0300175 (default: ``True``) If true, after tab expansion but before wrapping,
176 the :meth:`wrap` method will replace each whitespace character
177 with a single space. The whitespace characters replaced are
178 as follows: tab, newline, vertical tab, formfeed, and carriage
179 return (``'\t\n\v\f\r'``).
Georg Brandl116aa622007-08-15 14:28:22 +0000180
Benjamin Petersone41251e2008-04-25 01:59:09 +0000181 .. note::
Georg Brandl116aa622007-08-15 14:28:22 +0000182
Benjamin Petersone41251e2008-04-25 01:59:09 +0000183 If :attr:`expand_tabs` is false and :attr:`replace_whitespace` is true,
184 each tab character will be replaced by a single space, which is *not*
185 the same as tab expansion.
Georg Brandl116aa622007-08-15 14:28:22 +0000186
Terry Reedy6d2ab712010-11-23 20:17:24 +0000187 .. note::
188
189 If :attr:`replace_whitespace` is false, newlines may appear in the
190 middle of a line and cause strange output. For this reason, text should
191 be split into paragraphs (using :meth:`str.splitlines` or similar)
192 which are wrapped separately.
193
Georg Brandl116aa622007-08-15 14:28:22 +0000194
Benjamin Petersone41251e2008-04-25 01:59:09 +0000195 .. attribute:: drop_whitespace
Georg Brandl116aa622007-08-15 14:28:22 +0000196
R David Murray1585b702012-09-08 13:13:25 -0400197 (default: ``True``) If true, whitespace at the beginning and ending of
198 every line (after wrapping but before indenting) is dropped.
199 Whitespace at the beginning of the paragraph, however, is not dropped
200 if non-whitespace follows it. If whitespace being dropped takes up an
201 entire line, the whole line is dropped.
Georg Brandl116aa622007-08-15 14:28:22 +0000202
Georg Brandl116aa622007-08-15 14:28:22 +0000203
Benjamin Petersone41251e2008-04-25 01:59:09 +0000204 .. attribute:: initial_indent
Georg Brandl116aa622007-08-15 14:28:22 +0000205
Benjamin Petersone41251e2008-04-25 01:59:09 +0000206 (default: ``''``) String that will be prepended to the first line of
R David Murray1585b702012-09-08 13:13:25 -0400207 wrapped output. Counts towards the length of the first line. The empty
208 string is not indented.
Georg Brandl116aa622007-08-15 14:28:22 +0000209
210
Benjamin Petersone41251e2008-04-25 01:59:09 +0000211 .. attribute:: subsequent_indent
Georg Brandl116aa622007-08-15 14:28:22 +0000212
Benjamin Petersone41251e2008-04-25 01:59:09 +0000213 (default: ``''``) String that will be prepended to all lines of wrapped
214 output except the first. Counts towards the length of each line except
215 the first.
Georg Brandl116aa622007-08-15 14:28:22 +0000216
217
Benjamin Petersone41251e2008-04-25 01:59:09 +0000218 .. attribute:: fix_sentence_endings
Georg Brandl116aa622007-08-15 14:28:22 +0000219
Benjamin Petersone41251e2008-04-25 01:59:09 +0000220 (default: ``False``) If true, :class:`TextWrapper` attempts to detect
221 sentence endings and ensure that sentences are always separated by exactly
222 two spaces. This is generally desired for text in a monospaced font.
223 However, the sentence detection algorithm is imperfect: it assumes that a
224 sentence ending consists of a lowercase letter followed by one of ``'.'``,
225 ``'!'``, or ``'?'``, possibly followed by one of ``'"'`` or ``"'"``,
226 followed by a space. One problem with this is algorithm is that it is
227 unable to detect the difference between "Dr." in ::
Georg Brandl116aa622007-08-15 14:28:22 +0000228
Benjamin Petersone41251e2008-04-25 01:59:09 +0000229 [...] Dr. Frankenstein's monster [...]
Georg Brandl116aa622007-08-15 14:28:22 +0000230
Benjamin Petersone41251e2008-04-25 01:59:09 +0000231 and "Spot." in ::
Georg Brandl116aa622007-08-15 14:28:22 +0000232
Benjamin Petersone41251e2008-04-25 01:59:09 +0000233 [...] See Spot. See Spot run [...]
Georg Brandl116aa622007-08-15 14:28:22 +0000234
Benjamin Petersone41251e2008-04-25 01:59:09 +0000235 :attr:`fix_sentence_endings` is false by default.
Georg Brandl116aa622007-08-15 14:28:22 +0000236
Benjamin Petersone41251e2008-04-25 01:59:09 +0000237 Since the sentence detection algorithm relies on ``string.lowercase`` for
238 the definition of "lowercase letter," and a convention of using two spaces
239 after a period to separate sentences on the same line, it is specific to
240 English-language texts.
Georg Brandl116aa622007-08-15 14:28:22 +0000241
242
Benjamin Petersone41251e2008-04-25 01:59:09 +0000243 .. attribute:: break_long_words
Georg Brandl116aa622007-08-15 14:28:22 +0000244
Benjamin Petersone41251e2008-04-25 01:59:09 +0000245 (default: ``True``) If true, then words longer than :attr:`width` will be
246 broken in order to ensure that no lines are longer than :attr:`width`. If
247 it is false, long words will not be broken, and some lines may be longer
248 than :attr:`width`. (Long words will be put on a line by themselves, in
249 order to minimize the amount by which :attr:`width` is exceeded.)
Georg Brandl116aa622007-08-15 14:28:22 +0000250
Alexandre Vassalotti5f8ced22008-05-16 00:03:33 +0000251
252 .. attribute:: break_on_hyphens
253
254 (default: ``True``) If true, wrapping will occur preferably on whitespaces
255 and right after hyphens in compound words, as it is customary in English.
256 If false, only whitespaces will be considered as potentially good places
257 for line breaks, but you need to set :attr:`break_long_words` to false if
258 you want truly insecable words. Default behaviour in previous versions
259 was to always allow breaking hyphenated words.
260
Alexandre Vassalotti5f8ced22008-05-16 00:03:33 +0000261
Serhiy Storchakaacc9f3f2013-10-15 21:22:54 +0300262 .. attribute:: max_lines
263
Larry Hastings3732ed22014-03-15 21:13:56 -0700264 (default: ``None``) If not ``None``, then the output will contain at most
265 *max_lines* lines, with *placeholder* appearing at the end of the output.
Serhiy Storchakaacc9f3f2013-10-15 21:22:54 +0300266
267 .. versionadded:: 3.4
268
269
270 .. attribute:: placeholder
271
Larry Hastings3732ed22014-03-15 21:13:56 -0700272 (default: ``' [...]'``) String that will appear at the end of the output
273 text if it has been truncated.
Serhiy Storchakaacc9f3f2013-10-15 21:22:54 +0300274
275 .. versionadded:: 3.4
276
277
Antoine Pitrou389dec82013-08-12 22:39:09 +0200278 :class:`TextWrapper` also provides some public methods, analogous to the
Benjamin Petersone41251e2008-04-25 01:59:09 +0000279 module-level convenience functions:
Georg Brandl116aa622007-08-15 14:28:22 +0000280
Benjamin Petersone41251e2008-04-25 01:59:09 +0000281 .. method:: wrap(text)
Georg Brandl116aa622007-08-15 14:28:22 +0000282
Benjamin Petersone41251e2008-04-25 01:59:09 +0000283 Wraps the single paragraph in *text* (a string) so every line is at most
284 :attr:`width` characters long. All wrapping options are taken from
R David Murray1585b702012-09-08 13:13:25 -0400285 instance attributes of the :class:`TextWrapper` instance. Returns a list
286 of output lines, without final newlines. If the wrapped output has no
287 content, the returned list is empty.
Georg Brandl116aa622007-08-15 14:28:22 +0000288
289
Benjamin Petersone41251e2008-04-25 01:59:09 +0000290 .. method:: fill(text)
Georg Brandl116aa622007-08-15 14:28:22 +0000291
Benjamin Petersone41251e2008-04-25 01:59:09 +0000292 Wraps the single paragraph in *text*, and returns a single string
293 containing the wrapped paragraph.