| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 1 | :mod:`string` --- Common string operations | 
|  | 2 | ========================================== | 
|  | 3 |  | 
|  | 4 | .. module:: string | 
|  | 5 | :synopsis: Common string operations. | 
|  | 6 |  | 
| Éric Araujo | 19f9b71 | 2011-08-19 00:49:18 +0200 | [diff] [blame] | 7 | **Source code:** :source:`Lib/string.py` | 
|  | 8 |  | 
|  | 9 | -------------- | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 10 |  | 
| Éric Araujo | 6e6cb8e | 2010-11-16 19:13:50 +0000 | [diff] [blame] | 11 | .. seealso:: | 
|  | 12 |  | 
| Ezio Melotti | a6229e6 | 2012-10-12 10:59:14 +0300 | [diff] [blame] | 13 | :ref:`textseq` | 
| Georg Brandl | b30f330 | 2011-01-06 09:23:56 +0000 | [diff] [blame] | 14 |  | 
|  | 15 | :ref:`string-methods` | 
|  | 16 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 17 | String constants | 
|  | 18 | ---------------- | 
|  | 19 |  | 
|  | 20 | The constants defined in this module are: | 
|  | 21 |  | 
|  | 22 |  | 
|  | 23 | .. data:: ascii_letters | 
|  | 24 |  | 
|  | 25 | The concatenation of the :const:`ascii_lowercase` and :const:`ascii_uppercase` | 
|  | 26 | constants described below.  This value is not locale-dependent. | 
|  | 27 |  | 
|  | 28 |  | 
|  | 29 | .. data:: ascii_lowercase | 
|  | 30 |  | 
|  | 31 | The lowercase letters ``'abcdefghijklmnopqrstuvwxyz'``.  This value is not | 
|  | 32 | locale-dependent and will not change. | 
|  | 33 |  | 
|  | 34 |  | 
|  | 35 | .. data:: ascii_uppercase | 
|  | 36 |  | 
|  | 37 | The uppercase letters ``'ABCDEFGHIJKLMNOPQRSTUVWXYZ'``.  This value is not | 
|  | 38 | locale-dependent and will not change. | 
|  | 39 |  | 
|  | 40 |  | 
|  | 41 | .. data:: digits | 
|  | 42 |  | 
|  | 43 | The string ``'0123456789'``. | 
|  | 44 |  | 
|  | 45 |  | 
|  | 46 | .. data:: hexdigits | 
|  | 47 |  | 
|  | 48 | The string ``'0123456789abcdefABCDEF'``. | 
|  | 49 |  | 
|  | 50 |  | 
|  | 51 | .. data:: octdigits | 
|  | 52 |  | 
|  | 53 | The string ``'01234567'``. | 
|  | 54 |  | 
|  | 55 |  | 
|  | 56 | .. data:: punctuation | 
|  | 57 |  | 
|  | 58 | String of ASCII characters which are considered punctuation characters | 
|  | 59 | in the ``C`` locale. | 
|  | 60 |  | 
|  | 61 |  | 
|  | 62 | .. data:: printable | 
|  | 63 |  | 
|  | 64 | String of ASCII characters which are considered printable.  This is a | 
|  | 65 | combination of :const:`digits`, :const:`ascii_letters`, :const:`punctuation`, | 
|  | 66 | and :const:`whitespace`. | 
|  | 67 |  | 
|  | 68 |  | 
|  | 69 | .. data:: whitespace | 
|  | 70 |  | 
| Georg Brandl | 5076740 | 2008-11-22 08:31:09 +0000 | [diff] [blame] | 71 | A string containing all ASCII characters that are considered whitespace. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 72 | This includes the characters space, tab, linefeed, return, formfeed, and | 
|  | 73 | vertical tab. | 
|  | 74 |  | 
|  | 75 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 76 | .. _string-formatting: | 
|  | 77 |  | 
| Martin Panter | d5db147 | 2016-02-08 01:34:09 +0000 | [diff] [blame] | 78 | Custom String Formatting | 
|  | 79 | ------------------------ | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 80 |  | 
| Benjamin Peterson | 50923f9 | 2008-05-25 19:45:17 +0000 | [diff] [blame] | 81 | The built-in string class provides the ability to do complex variable | 
| Martin Panter | d5db147 | 2016-02-08 01:34:09 +0000 | [diff] [blame] | 82 | substitutions and value formatting via the :meth:`~str.format` method described in | 
| Benjamin Peterson | 50923f9 | 2008-05-25 19:45:17 +0000 | [diff] [blame] | 83 | :pep:`3101`.  The :class:`Formatter` class in the :mod:`string` module allows | 
|  | 84 | you to create and customize your own string formatting behaviors using the same | 
| Martin Panter | d5db147 | 2016-02-08 01:34:09 +0000 | [diff] [blame] | 85 | implementation as the built-in :meth:`~str.format` method. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 86 |  | 
| Benjamin Peterson | 1baf465 | 2009-12-31 03:11:23 +0000 | [diff] [blame] | 87 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 88 | .. class:: Formatter | 
|  | 89 |  | 
|  | 90 | The :class:`Formatter` class has the following public methods: | 
|  | 91 |  | 
| Georg Brandl | 8e490de | 2011-01-24 19:53:18 +0000 | [diff] [blame] | 92 | .. method:: format(format_string, *args, **kwargs) | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 93 |  | 
| Martin Panter | d5db147 | 2016-02-08 01:34:09 +0000 | [diff] [blame] | 94 | The primary API method.  It takes a format string and | 
| R David Murray | e56bf97 | 2012-08-19 17:26:34 -0400 | [diff] [blame] | 95 | an arbitrary set of positional and keyword arguments. | 
| Martin Panter | d5db147 | 2016-02-08 01:34:09 +0000 | [diff] [blame] | 96 | It is just a wrapper that calls :meth:`vformat`. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 97 |  | 
| Serhiy Storchaka | 009b0a1 | 2017-01-13 09:10:51 +0200 | [diff] [blame] | 98 | .. versionchanged:: 3.7 | 
|  | 99 | A format string argument is now :ref:`positional-only | 
|  | 100 | <positional-only_parameter>`. | 
| Serhiy Storchaka | b876df4 | 2015-03-24 22:30:46 +0200 | [diff] [blame] | 101 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 102 | .. method:: vformat(format_string, args, kwargs) | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 103 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 104 | This function does the actual work of formatting.  It is exposed as a | 
|  | 105 | separate function for cases where you want to pass in a predefined | 
|  | 106 | dictionary of arguments, rather than unpacking and repacking the | 
| Ezio Melotti | 28c88f4 | 2012-11-27 19:17:57 +0200 | [diff] [blame] | 107 | dictionary as individual arguments using the ``*args`` and ``**kwargs`` | 
| R David Murray | e56bf97 | 2012-08-19 17:26:34 -0400 | [diff] [blame] | 108 | syntax.  :meth:`vformat` does the work of breaking up the format string | 
|  | 109 | into character data and replacement fields.  It calls the various | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 110 | methods described below. | 
|  | 111 |  | 
|  | 112 | In addition, the :class:`Formatter` defines a number of methods that are | 
|  | 113 | intended to be replaced by subclasses: | 
|  | 114 |  | 
|  | 115 | .. method:: parse(format_string) | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 116 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 117 | Loop over the format_string and return an iterable of tuples | 
|  | 118 | (*literal_text*, *field_name*, *format_spec*, *conversion*).  This is used | 
| Georg Brandl | 70cd7bc | 2010-10-26 19:31:06 +0000 | [diff] [blame] | 119 | by :meth:`vformat` to break the string into either literal text, or | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 120 | replacement fields. | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 121 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 122 | The values in the tuple conceptually represent a span of literal text | 
|  | 123 | followed by a single replacement field.  If there is no literal text | 
|  | 124 | (which can happen if two replacement fields occur consecutively), then | 
|  | 125 | *literal_text* will be a zero-length string.  If there is no replacement | 
|  | 126 | field, then the values of *field_name*, *format_spec* and *conversion* | 
|  | 127 | will be ``None``. | 
|  | 128 |  | 
| Eric Smith | 9d4ba39 | 2007-09-02 15:33:26 +0000 | [diff] [blame] | 129 | .. method:: get_field(field_name, args, kwargs) | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 130 |  | 
|  | 131 | Given *field_name* as returned by :meth:`parse` (see above), convert it to | 
| Georg Brandl | 7f13e6b | 2007-08-31 10:37:15 +0000 | [diff] [blame] | 132 | an object to be formatted.  Returns a tuple (obj, used_key).  The default | 
|  | 133 | version takes strings of the form defined in :pep:`3101`, such as | 
|  | 134 | "0[name]" or "label.title".  *args* and *kwargs* are as passed in to | 
|  | 135 | :meth:`vformat`.  The return value *used_key* has the same meaning as the | 
|  | 136 | *key* parameter to :meth:`get_value`. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 137 |  | 
|  | 138 | .. method:: get_value(key, args, kwargs) | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 139 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 140 | Retrieve a given field value.  The *key* argument will be either an | 
|  | 141 | integer or a string.  If it is an integer, it represents the index of the | 
|  | 142 | positional argument in *args*; if it is a string, then it represents a | 
|  | 143 | named argument in *kwargs*. | 
|  | 144 |  | 
|  | 145 | The *args* parameter is set to the list of positional arguments to | 
|  | 146 | :meth:`vformat`, and the *kwargs* parameter is set to the dictionary of | 
|  | 147 | keyword arguments. | 
|  | 148 |  | 
|  | 149 | For compound field names, these functions are only called for the first | 
|  | 150 | component of the field name; Subsequent components are handled through | 
|  | 151 | normal attribute and indexing operations. | 
|  | 152 |  | 
|  | 153 | So for example, the field expression '0.name' would cause | 
|  | 154 | :meth:`get_value` to be called with a *key* argument of 0.  The ``name`` | 
|  | 155 | attribute will be looked up after :meth:`get_value` returns by calling the | 
|  | 156 | built-in :func:`getattr` function. | 
|  | 157 |  | 
|  | 158 | If the index or keyword refers to an item that does not exist, then an | 
|  | 159 | :exc:`IndexError` or :exc:`KeyError` should be raised. | 
|  | 160 |  | 
|  | 161 | .. method:: check_unused_args(used_args, args, kwargs) | 
|  | 162 |  | 
|  | 163 | Implement checking for unused arguments if desired.  The arguments to this | 
|  | 164 | function is the set of all argument keys that were actually referred to in | 
|  | 165 | the format string (integers for positional arguments, and strings for | 
|  | 166 | named arguments), and a reference to the *args* and *kwargs* that was | 
|  | 167 | passed to vformat.  The set of unused args can be calculated from these | 
| Georg Brandl | 7cb1319 | 2010-08-03 12:06:29 +0000 | [diff] [blame] | 168 | parameters.  :meth:`check_unused_args` is assumed to raise an exception if | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 169 | the check fails. | 
|  | 170 |  | 
|  | 171 | .. method:: format_field(value, format_spec) | 
|  | 172 |  | 
|  | 173 | :meth:`format_field` simply calls the global :func:`format` built-in.  The | 
|  | 174 | method is provided so that subclasses can override it. | 
|  | 175 |  | 
|  | 176 | .. method:: convert_field(value, conversion) | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 177 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 178 | Converts the value (returned by :meth:`get_field`) given a conversion type | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 179 | (as in the tuple returned by the :meth:`parse` method).  The default | 
| R David Murray | e56bf97 | 2012-08-19 17:26:34 -0400 | [diff] [blame] | 180 | version understands 's' (str), 'r' (repr) and 'a' (ascii) conversion | 
|  | 181 | types. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 182 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 183 |  | 
|  | 184 | .. _formatstrings: | 
|  | 185 |  | 
|  | 186 | Format String Syntax | 
|  | 187 | -------------------- | 
|  | 188 |  | 
|  | 189 | The :meth:`str.format` method and the :class:`Formatter` class share the same | 
|  | 190 | syntax for format strings (although in the case of :class:`Formatter`, | 
| Martin Panter | bc1ee46 | 2016-02-13 00:41:37 +0000 | [diff] [blame] | 191 | subclasses can define their own format string syntax).  The syntax is | 
|  | 192 | related to that of :ref:`formatted string literals <f-strings>`, but | 
|  | 193 | there are differences. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 194 |  | 
|  | 195 | Format strings contain "replacement fields" surrounded by curly braces ``{}``. | 
|  | 196 | Anything that is not contained in braces is considered literal text, which is | 
|  | 197 | copied unchanged to the output.  If you need to include a brace character in the | 
|  | 198 | literal text, it can be escaped by doubling: ``{{`` and ``}}``. | 
|  | 199 |  | 
|  | 200 | The grammar for a replacement field is as follows: | 
|  | 201 |  | 
|  | 202 | .. productionlist:: sf | 
| Georg Brandl | 2f3ed68 | 2009-09-01 07:42:40 +0000 | [diff] [blame] | 203 | replacement_field: "{" [`field_name`] ["!" `conversion`] [":" `format_spec`] "}" | 
| Eric Smith | c4cae32 | 2009-04-22 00:53:01 +0000 | [diff] [blame] | 204 | field_name: arg_name ("." `attribute_name` | "[" `element_index` "]")* | 
| Benjamin Peterson | d7c3ed5 | 2010-06-27 22:32:30 +0000 | [diff] [blame] | 205 | arg_name: [`identifier` | `integer`] | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 206 | attribute_name: `identifier` | 
| Eric Smith | 2e9f202 | 2010-02-25 14:58:13 +0000 | [diff] [blame] | 207 | element_index: `integer` | `index_string` | 
|  | 208 | index_string: <any source character except "]"> + | 
| Benjamin Peterson | 065ba70 | 2008-11-09 01:43:02 +0000 | [diff] [blame] | 209 | conversion: "r" | "s" | "a" | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 210 | format_spec: <described in the next section> | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 211 |  | 
| Georg Brandl | 2f3ed68 | 2009-09-01 07:42:40 +0000 | [diff] [blame] | 212 | In less formal terms, the replacement field can start with a *field_name* that specifies | 
| Eric Smith | c4cae32 | 2009-04-22 00:53:01 +0000 | [diff] [blame] | 213 | the object whose value is to be formatted and inserted | 
|  | 214 | into the output instead of the replacement field. | 
|  | 215 | The *field_name* is optionally followed by a  *conversion* field, which is | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 216 | preceded by an exclamation point ``'!'``, and a *format_spec*, which is preceded | 
| Eric Smith | c4cae32 | 2009-04-22 00:53:01 +0000 | [diff] [blame] | 217 | by a colon ``':'``.  These specify a non-default format for the replacement value. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 218 |  | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 219 | See also the :ref:`formatspec` section. | 
|  | 220 |  | 
| Ezio Melotti | e130a52 | 2011-10-19 10:58:56 +0300 | [diff] [blame] | 221 | The *field_name* itself begins with an *arg_name* that is either a number or a | 
| Eric Smith | c4cae32 | 2009-04-22 00:53:01 +0000 | [diff] [blame] | 222 | keyword.  If it's a number, it refers to a positional argument, and if it's a keyword, | 
|  | 223 | it refers to a named keyword argument.  If the numerical arg_names in a format string | 
|  | 224 | are 0, 1, 2, ... in sequence, they can all be omitted (not just some) | 
|  | 225 | and the numbers 0, 1, 2, ... will be automatically inserted in that order. | 
| Éric Araujo | 29cf58c | 2011-09-01 18:59:06 +0200 | [diff] [blame] | 226 | Because *arg_name* is not quote-delimited, it is not possible to specify arbitrary | 
|  | 227 | dictionary keys (e.g., the strings ``'10'`` or ``':-]'``) within a format string. | 
| Eric Smith | c4cae32 | 2009-04-22 00:53:01 +0000 | [diff] [blame] | 228 | The *arg_name* can be followed by any number of index or | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 229 | attribute expressions. An expression of the form ``'.name'`` selects the named | 
|  | 230 | attribute using :func:`getattr`, while an expression of the form ``'[index]'`` | 
|  | 231 | does an index lookup using :func:`__getitem__`. | 
|  | 232 |  | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 233 | .. versionchanged:: 3.1 | 
|  | 234 | The positional argument specifiers can be omitted, so ``'{} {}'`` is | 
|  | 235 | equivalent to ``'{0} {1}'``. | 
|  | 236 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 237 | Some simple format string examples:: | 
|  | 238 |  | 
| Serhiy Storchaka | dba9039 | 2016-05-10 12:01:23 +0300 | [diff] [blame] | 239 | "First, thou shalt count to {0}"  # References first positional argument | 
|  | 240 | "Bring me a {}"                   # Implicitly references the first positional argument | 
|  | 241 | "From {} to {}"                   # Same as "From {0} to {1}" | 
|  | 242 | "My quest is {name}"              # References keyword argument 'name' | 
|  | 243 | "Weight in tons {0.weight}"       # 'weight' attribute of first positional arg | 
|  | 244 | "Units destroyed: {players[0]}"   # First element of keyword argument 'players'. | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 245 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 246 | The *conversion* field causes a type coercion before formatting.  Normally, the | 
|  | 247 | job of formatting a value is done by the :meth:`__format__` method of the value | 
|  | 248 | itself.  However, in some cases it is desirable to force a type to be formatted | 
|  | 249 | as a string, overriding its own definition of formatting.  By converting the | 
|  | 250 | value to a string before calling :meth:`__format__`, the normal formatting logic | 
|  | 251 | is bypassed. | 
|  | 252 |  | 
| Georg Brandl | 559e5d7 | 2008-06-11 18:37:52 +0000 | [diff] [blame] | 253 | Three conversion flags are currently supported: ``'!s'`` which calls :func:`str` | 
|  | 254 | on the value, ``'!r'`` which calls :func:`repr` and ``'!a'`` which calls | 
|  | 255 | :func:`ascii`. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 256 |  | 
|  | 257 | Some examples:: | 
|  | 258 |  | 
|  | 259 | "Harold's a clever {0!s}"        # Calls str() on the argument first | 
|  | 260 | "Bring out the holy {name!r}"    # Calls repr() on the argument first | 
| Georg Brandl | 2f3ed68 | 2009-09-01 07:42:40 +0000 | [diff] [blame] | 261 | "More {!a}"                      # Calls ascii() on the argument first | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 262 |  | 
|  | 263 | The *format_spec* field contains a specification of how the value should be | 
|  | 264 | presented, including such details as field width, alignment, padding, decimal | 
| Eric Smith | 0f7affe | 2010-02-15 11:57:31 +0000 | [diff] [blame] | 265 | precision and so on.  Each value type can define its own "formatting | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 266 | mini-language" or interpretation of the *format_spec*. | 
|  | 267 |  | 
|  | 268 | Most built-in types support a common formatting mini-language, which is | 
|  | 269 | described in the next section. | 
|  | 270 |  | 
|  | 271 | A *format_spec* field can also include nested replacement fields within it. | 
| Martin Panter | d5db147 | 2016-02-08 01:34:09 +0000 | [diff] [blame] | 272 | These nested replacement fields may contain a field name, conversion flag | 
|  | 273 | and format specification, but deeper nesting is | 
|  | 274 | not allowed.  The replacement fields within the | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 275 | format_spec are substituted before the *format_spec* string is interpreted. | 
|  | 276 | This allows the formatting of a value to be dynamically specified. | 
|  | 277 |  | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 278 | See the :ref:`formatexamples` section for some examples. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 279 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 280 |  | 
|  | 281 | .. _formatspec: | 
|  | 282 |  | 
|  | 283 | Format Specification Mini-Language | 
|  | 284 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
|  | 285 |  | 
|  | 286 | "Format specifications" are used within replacement fields contained within a | 
|  | 287 | format string to define how individual values are presented (see | 
| Martin Panter | bc1ee46 | 2016-02-13 00:41:37 +0000 | [diff] [blame] | 288 | :ref:`formatstrings` and :ref:`f-strings`). | 
|  | 289 | They can also be passed directly to the built-in | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 290 | :func:`format` function.  Each formattable type may define how the format | 
|  | 291 | specification is to be interpreted. | 
|  | 292 |  | 
|  | 293 | Most built-in types implement the following options for format specifications, | 
|  | 294 | although some of the formatting options are only supported by the numeric types. | 
|  | 295 |  | 
| Eric Smith | 05c0774 | 2010-02-25 14:18:57 +0000 | [diff] [blame] | 296 | A general convention is that an empty format string (``""``) produces | 
|  | 297 | the same result as if you had called :func:`str` on the value. A | 
|  | 298 | non-empty format string typically modifies the result. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 299 |  | 
|  | 300 | The general form of a *standard format specifier* is: | 
|  | 301 |  | 
|  | 302 | .. productionlist:: sf | 
| Eric V. Smith | d7665ca | 2016-09-09 23:13:01 -0400 | [diff] [blame] | 303 | format_spec: [[`fill`]`align`][`sign`][#][0][`width`][`grouping_option`][.`precision`][`type`] | 
| Ezio Melotti | c318442 | 2013-10-21 02:53:07 +0300 | [diff] [blame] | 304 | fill: <any character> | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 305 | align: "<" | ">" | "=" | "^" | 
|  | 306 | sign: "+" | "-" | " " | 
|  | 307 | width: `integer` | 
| Eric V. Smith | d7665ca | 2016-09-09 23:13:01 -0400 | [diff] [blame] | 308 | grouping_option: "_" | "," | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 309 | precision: `integer` | 
| Eric Smith | 05c0774 | 2010-02-25 14:18:57 +0000 | [diff] [blame] | 310 | type: "b" | "c" | "d" | "e" | "E" | "f" | "F" | "g" | "G" | "n" | "o" | "s" | "x" | "X" | "%" | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 311 |  | 
| Ezio Melotti | 2bbdfe7 | 2013-11-17 02:47:12 +0200 | [diff] [blame] | 312 | If a valid *align* value is specified, it can be preceded by a *fill* | 
| Ezio Melotti | c318442 | 2013-10-21 02:53:07 +0300 | [diff] [blame] | 313 | character that can be any character and defaults to a space if omitted. | 
| Martin Panter | d5db147 | 2016-02-08 01:34:09 +0000 | [diff] [blame] | 314 | It is not possible to use a literal curly brace ("``{``" or "``}``") as | 
| Martin Panter | bc1ee46 | 2016-02-13 00:41:37 +0000 | [diff] [blame] | 315 | the *fill* character in a :ref:`formatted string literal | 
|  | 316 | <f-strings>` or when using the :meth:`str.format` | 
| Martin Panter | d5db147 | 2016-02-08 01:34:09 +0000 | [diff] [blame] | 317 | method.  However, it is possible to insert a curly brace | 
|  | 318 | with a nested replacement field.  This limitation doesn't | 
| Ezio Melotti | c318442 | 2013-10-21 02:53:07 +0300 | [diff] [blame] | 319 | affect the :func:`format` function. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 320 |  | 
|  | 321 | The meaning of the various alignment options is as follows: | 
|  | 322 |  | 
|  | 323 | +---------+----------------------------------------------------------+ | 
|  | 324 | | Option  | Meaning                                                  | | 
|  | 325 | +=========+==========================================================+ | 
|  | 326 | | ``'<'`` | Forces the field to be left-aligned within the available | | 
| Georg Brandl | ca583b6 | 2011-02-07 12:13:58 +0000 | [diff] [blame] | 327 | |         | space (this is the default for most objects).            | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 328 | +---------+----------------------------------------------------------+ | 
|  | 329 | | ``'>'`` | Forces the field to be right-aligned within the          | | 
| Georg Brandl | ca583b6 | 2011-02-07 12:13:58 +0000 | [diff] [blame] | 330 | |         | available space (this is the default for numbers).       | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 331 | +---------+----------------------------------------------------------+ | 
|  | 332 | | ``'='`` | Forces the padding to be placed after the sign (if any)  | | 
|  | 333 | |         | but before the digits.  This is used for printing fields | | 
|  | 334 | |         | in the form '+000000120'. This alignment option is only  | | 
| Terry Jan Reedy | 4902c46 | 2016-03-20 21:05:57 -0400 | [diff] [blame] | 335 | |         | valid for numeric types.  It becomes the default when '0'| | 
|  | 336 | |         | immediately precedes the field width.                    | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 337 | +---------+----------------------------------------------------------+ | 
|  | 338 | | ``'^'`` | Forces the field to be centered within the available     | | 
|  | 339 | |         | space.                                                   | | 
|  | 340 | +---------+----------------------------------------------------------+ | 
|  | 341 |  | 
|  | 342 | Note that unless a minimum field width is defined, the field width will always | 
|  | 343 | be the same size as the data to fill it, so that the alignment option has no | 
|  | 344 | meaning in this case. | 
|  | 345 |  | 
|  | 346 | The *sign* option is only valid for number types, and can be one of the | 
|  | 347 | following: | 
|  | 348 |  | 
|  | 349 | +---------+----------------------------------------------------------+ | 
|  | 350 | | Option  | Meaning                                                  | | 
|  | 351 | +=========+==========================================================+ | 
|  | 352 | | ``'+'`` | indicates that a sign should be used for both            | | 
|  | 353 | |         | positive as well as negative numbers.                    | | 
|  | 354 | +---------+----------------------------------------------------------+ | 
|  | 355 | | ``'-'`` | indicates that a sign should be used only for negative   | | 
|  | 356 | |         | numbers (this is the default behavior).                  | | 
|  | 357 | +---------+----------------------------------------------------------+ | 
|  | 358 | | space   | indicates that a leading space should be used on         | | 
|  | 359 | |         | positive numbers, and a minus sign on negative numbers.  | | 
|  | 360 | +---------+----------------------------------------------------------+ | 
|  | 361 |  | 
| Eric Smith | 984bb58 | 2010-11-25 16:08:06 +0000 | [diff] [blame] | 362 |  | 
|  | 363 | The ``'#'`` option causes the "alternate form" to be used for the | 
|  | 364 | conversion.  The alternate form is defined differently for different | 
|  | 365 | types.  This option is only valid for integer, float, complex and | 
|  | 366 | Decimal types. For integers, when binary, octal, or hexadecimal output | 
|  | 367 | is used, this option adds the prefix respective ``'0b'``, ``'0o'``, or | 
|  | 368 | ``'0x'`` to the output value. For floats, complex and Decimal the | 
|  | 369 | alternate form causes the result of the conversion to always contain a | 
|  | 370 | decimal-point character, even if no digits follow it. Normally, a | 
|  | 371 | decimal-point character appears in the result of these conversions | 
|  | 372 | only if a digit follows it. In addition, for ``'g'`` and ``'G'`` | 
|  | 373 | conversions, trailing zeros are not removed from the result. | 
| Eric Smith | d68af8f | 2008-07-16 00:15:35 +0000 | [diff] [blame] | 374 |  | 
| Raymond Hettinger | 6db9470 | 2009-07-12 20:49:21 +0000 | [diff] [blame] | 375 | The ``','`` option signals the use of a comma for a thousands separator. | 
|  | 376 | For a locale aware separator, use the ``'n'`` integer presentation type | 
|  | 377 | instead. | 
|  | 378 |  | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 379 | .. versionchanged:: 3.1 | 
|  | 380 | Added the ``','`` option (see also :pep:`378`). | 
|  | 381 |  | 
| Eric V. Smith | 89e1b1a | 2016-09-09 23:06:47 -0400 | [diff] [blame] | 382 | The ``'_'`` option signals the use of an underscore for a thousands | 
|  | 383 | separator for floating point presentation types and for integer | 
|  | 384 | presentation type ``'d'``.  For integer presentation types ``'b'``, | 
|  | 385 | ``'o'``, ``'x'``, and ``'X'``, underscores will be inserted every 4 | 
|  | 386 | digits.  For other presentation types, specifying this option is an | 
|  | 387 | error. | 
|  | 388 |  | 
|  | 389 | .. versionchanged:: 3.6 | 
|  | 390 | Added the ``'_'`` option (see also :pep:`515`). | 
|  | 391 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 392 | *width* is a decimal integer defining the minimum field width.  If not | 
|  | 393 | specified, then the field width will be determined by the content. | 
|  | 394 |  | 
| Terry Jan Reedy | 4902c46 | 2016-03-20 21:05:57 -0400 | [diff] [blame] | 395 | When no explicit alignment is given, preceding the *width* field by a zero | 
|  | 396 | (``'0'``) character enables | 
| Terry Jan Reedy | f6190c1 | 2012-08-17 15:40:46 -0400 | [diff] [blame] | 397 | sign-aware zero-padding for numeric types.  This is equivalent to a *fill* | 
|  | 398 | character of ``'0'`` with an *alignment* type of ``'='``. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 399 |  | 
|  | 400 | The *precision* is a decimal number indicating how many digits should be | 
| Georg Brandl | 3dbca81 | 2008-07-23 16:10:53 +0000 | [diff] [blame] | 401 | displayed after the decimal point for a floating point value formatted with | 
|  | 402 | ``'f'`` and ``'F'``, or before and after the decimal point for a floating point | 
|  | 403 | value formatted with ``'g'`` or ``'G'``.  For non-number types the field | 
|  | 404 | indicates the maximum field size - in other words, how many characters will be | 
| Eric Smith | e5fffc7 | 2009-05-07 19:38:09 +0000 | [diff] [blame] | 405 | used from the field content. The *precision* is not allowed for integer values. | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 406 |  | 
|  | 407 | Finally, the *type* determines how the data should be presented. | 
|  | 408 |  | 
| Eric Smith | 05c0774 | 2010-02-25 14:18:57 +0000 | [diff] [blame] | 409 | The available string presentation types are: | 
|  | 410 |  | 
|  | 411 | +---------+----------------------------------------------------------+ | 
|  | 412 | | Type    | Meaning                                                  | | 
|  | 413 | +=========+==========================================================+ | 
|  | 414 | | ``'s'`` | String format. This is the default type for strings and  | | 
|  | 415 | |         | may be omitted.                                          | | 
|  | 416 | +---------+----------------------------------------------------------+ | 
|  | 417 | | None    | The same as ``'s'``.                                     | | 
|  | 418 | +---------+----------------------------------------------------------+ | 
|  | 419 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 420 | The available integer presentation types are: | 
|  | 421 |  | 
|  | 422 | +---------+----------------------------------------------------------+ | 
|  | 423 | | Type    | Meaning                                                  | | 
|  | 424 | +=========+==========================================================+ | 
| Eric Smith | d68af8f | 2008-07-16 00:15:35 +0000 | [diff] [blame] | 425 | | ``'b'`` | Binary format. Outputs the number in base 2.             | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 426 | +---------+----------------------------------------------------------+ | 
|  | 427 | | ``'c'`` | Character. Converts the integer to the corresponding     | | 
|  | 428 | |         | unicode character before printing.                       | | 
|  | 429 | +---------+----------------------------------------------------------+ | 
|  | 430 | | ``'d'`` | Decimal Integer. Outputs the number in base 10.          | | 
|  | 431 | +---------+----------------------------------------------------------+ | 
|  | 432 | | ``'o'`` | Octal format. Outputs the number in base 8.              | | 
|  | 433 | +---------+----------------------------------------------------------+ | 
|  | 434 | | ``'x'`` | Hex format. Outputs the number in base 16, using lower-  | | 
|  | 435 | |         | case letters for the digits above 9.                     | | 
|  | 436 | +---------+----------------------------------------------------------+ | 
|  | 437 | | ``'X'`` | Hex format. Outputs the number in base 16, using upper-  | | 
|  | 438 | |         | case letters for the digits above 9.                     | | 
|  | 439 | +---------+----------------------------------------------------------+ | 
| Eric Smith | 5e18a20 | 2008-05-12 10:01:24 +0000 | [diff] [blame] | 440 | | ``'n'`` | Number. This is the same as ``'d'``, except that it uses | | 
|  | 441 | |         | the current locale setting to insert the appropriate     | | 
|  | 442 | |         | number separator characters.                             | | 
|  | 443 | +---------+----------------------------------------------------------+ | 
| Georg Brandl | 3dbca81 | 2008-07-23 16:10:53 +0000 | [diff] [blame] | 444 | | None    | The same as ``'d'``.                                     | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 445 | +---------+----------------------------------------------------------+ | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 446 |  | 
| Eric Smith | 05c0774 | 2010-02-25 14:18:57 +0000 | [diff] [blame] | 447 | In addition to the above presentation types, integers can be formatted | 
|  | 448 | with the floating point presentation types listed below (except | 
| Serhiy Storchaka | ecf41da | 2016-10-19 16:29:26 +0300 | [diff] [blame] | 449 | ``'n'`` and ``None``). When doing so, :func:`float` is used to convert the | 
| Eric Smith | 05c0774 | 2010-02-25 14:18:57 +0000 | [diff] [blame] | 450 | integer to a floating point number before formatting. | 
|  | 451 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 452 | The available presentation types for floating point and decimal values are: | 
| Georg Brandl | 48310cd | 2009-01-03 21:18:54 +0000 | [diff] [blame] | 453 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 454 | +---------+----------------------------------------------------------+ | 
|  | 455 | | Type    | Meaning                                                  | | 
|  | 456 | +=========+==========================================================+ | 
|  | 457 | | ``'e'`` | Exponent notation. Prints the number in scientific       | | 
|  | 458 | |         | notation using the letter 'e' to indicate the exponent.  | | 
| Eric V. Smith | 45fe62d | 2013-04-15 09:51:54 -0400 | [diff] [blame] | 459 | |         | The default precision is ``6``.                          | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 460 | +---------+----------------------------------------------------------+ | 
| Eric Smith | 22b85b3 | 2008-07-17 19:18:29 +0000 | [diff] [blame] | 461 | | ``'E'`` | Exponent notation. Same as ``'e'`` except it uses an     | | 
|  | 462 | |         | upper case 'E' as the separator character.               | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 463 | +---------+----------------------------------------------------------+ | 
|  | 464 | | ``'f'`` | Fixed point. Displays the number as a fixed-point        | | 
| Eric V. Smith | 45fe62d | 2013-04-15 09:51:54 -0400 | [diff] [blame] | 465 | |         | number.  The default precision is ``6``.                 | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 466 | +---------+----------------------------------------------------------+ | 
| Eric Smith | 741191f | 2009-05-06 13:08:15 +0000 | [diff] [blame] | 467 | | ``'F'`` | Fixed point. Same as ``'f'``, but converts ``nan`` to    | | 
|  | 468 | |         | ``NAN`` and ``inf`` to ``INF``.                          | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 469 | +---------+----------------------------------------------------------+ | 
| Mark Dickinson | c70614f | 2009-10-08 20:05:48 +0000 | [diff] [blame] | 470 | | ``'g'`` | General format.  For a given precision ``p >= 1``,       | | 
|  | 471 | |         | this rounds the number to ``p`` significant digits and   | | 
|  | 472 | |         | then formats the result in either fixed-point format     | | 
|  | 473 | |         | or in scientific notation, depending on its magnitude.   | | 
|  | 474 | |         |                                                          | | 
|  | 475 | |         | The precise rules are as follows: suppose that the       | | 
|  | 476 | |         | result formatted with presentation type ``'e'`` and      | | 
|  | 477 | |         | precision ``p-1`` would have exponent ``exp``.  Then     | | 
|  | 478 | |         | if ``-4 <= exp < p``, the number is formatted            | | 
|  | 479 | |         | with presentation type ``'f'`` and precision             | | 
|  | 480 | |         | ``p-1-exp``.  Otherwise, the number is formatted         | | 
|  | 481 | |         | with presentation type ``'e'`` and precision ``p-1``.    | | 
|  | 482 | |         | In both cases insignificant trailing zeros are removed   | | 
|  | 483 | |         | from the significand, and the decimal point is also      | | 
|  | 484 | |         | removed if there are no remaining digits following it.   | | 
|  | 485 | |         |                                                          | | 
| Benjamin Peterson | 73a3f2d | 2010-10-12 23:07:13 +0000 | [diff] [blame] | 486 | |         | Positive and negative infinity, positive and negative    | | 
| Mark Dickinson | c70614f | 2009-10-08 20:05:48 +0000 | [diff] [blame] | 487 | |         | zero, and nans, are formatted as ``inf``, ``-inf``,      | | 
|  | 488 | |         | ``0``, ``-0`` and ``nan`` respectively, regardless of    | | 
|  | 489 | |         | the precision.                                           | | 
|  | 490 | |         |                                                          | | 
|  | 491 | |         | A precision of ``0`` is treated as equivalent to a       | | 
| Eric V. Smith | 45fe62d | 2013-04-15 09:51:54 -0400 | [diff] [blame] | 492 | |         | precision of ``1``.  The default precision is ``6``.     | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 493 | +---------+----------------------------------------------------------+ | 
|  | 494 | | ``'G'`` | General format. Same as ``'g'`` except switches to       | | 
| Mark Dickinson | c70614f | 2009-10-08 20:05:48 +0000 | [diff] [blame] | 495 | |         | ``'E'`` if the number gets too large. The                | | 
|  | 496 | |         | representations of infinity and NaN are uppercased, too. | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 497 | +---------+----------------------------------------------------------+ | 
|  | 498 | | ``'n'`` | Number. This is the same as ``'g'``, except that it uses | | 
|  | 499 | |         | the current locale setting to insert the appropriate     | | 
|  | 500 | |         | number separator characters.                             | | 
|  | 501 | +---------+----------------------------------------------------------+ | 
|  | 502 | | ``'%'`` | Percentage. Multiplies the number by 100 and displays    | | 
|  | 503 | |         | in fixed (``'f'``) format, followed by a percent sign.   | | 
|  | 504 | +---------+----------------------------------------------------------+ | 
| Terry Jan Reedy | c6ad576 | 2014-10-06 02:04:33 -0400 | [diff] [blame] | 505 | | None    | Similar to ``'g'``, except that fixed-point notation,    | | 
|  | 506 | |         | when used, has at least one digit past the decimal point.| | 
|  | 507 | |         | The default precision is as high as needed to represent  | | 
|  | 508 | |         | the particular value. The overall effect is to match the | | 
|  | 509 | |         | output of :func:`str` as altered by the other format     | | 
|  | 510 | |         | modifiers.                                               | | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 511 | +---------+----------------------------------------------------------+ | 
|  | 512 |  | 
|  | 513 |  | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 514 | .. _formatexamples: | 
|  | 515 |  | 
|  | 516 | Format examples | 
|  | 517 | ^^^^^^^^^^^^^^^ | 
|  | 518 |  | 
| Martin Panter | d5db147 | 2016-02-08 01:34:09 +0000 | [diff] [blame] | 519 | This section contains examples of the :meth:`str.format` syntax and | 
|  | 520 | comparison with the old ``%``-formatting. | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 521 |  | 
|  | 522 | In most of the cases the syntax is similar to the old ``%``-formatting, with the | 
|  | 523 | addition of the ``{}`` and with ``:`` used instead of ``%``. | 
|  | 524 | For example, ``'%03.2f'`` can be translated to ``'{:03.2f}'``. | 
|  | 525 |  | 
|  | 526 | The new format syntax also supports new and different options, shown in the | 
|  | 527 | follow examples. | 
|  | 528 |  | 
|  | 529 | Accessing arguments by position:: | 
|  | 530 |  | 
|  | 531 | >>> '{0}, {1}, {2}'.format('a', 'b', 'c') | 
|  | 532 | 'a, b, c' | 
|  | 533 | >>> '{}, {}, {}'.format('a', 'b', 'c')  # 3.1+ only | 
|  | 534 | 'a, b, c' | 
|  | 535 | >>> '{2}, {1}, {0}'.format('a', 'b', 'c') | 
|  | 536 | 'c, b, a' | 
|  | 537 | >>> '{2}, {1}, {0}'.format(*'abc')      # unpacking argument sequence | 
|  | 538 | 'c, b, a' | 
|  | 539 | >>> '{0}{1}{0}'.format('abra', 'cad')   # arguments' indices can be repeated | 
|  | 540 | 'abracadabra' | 
|  | 541 |  | 
|  | 542 | Accessing arguments by name:: | 
|  | 543 |  | 
|  | 544 | >>> 'Coordinates: {latitude}, {longitude}'.format(latitude='37.24N', longitude='-115.81W') | 
|  | 545 | 'Coordinates: 37.24N, -115.81W' | 
|  | 546 | >>> coord = {'latitude': '37.24N', 'longitude': '-115.81W'} | 
|  | 547 | >>> 'Coordinates: {latitude}, {longitude}'.format(**coord) | 
|  | 548 | 'Coordinates: 37.24N, -115.81W' | 
|  | 549 |  | 
|  | 550 | Accessing arguments' attributes:: | 
|  | 551 |  | 
|  | 552 | >>> c = 3-5j | 
|  | 553 | >>> ('The complex number {0} is formed from the real part {0.real} ' | 
|  | 554 | ...  'and the imaginary part {0.imag}.').format(c) | 
|  | 555 | 'The complex number (3-5j) is formed from the real part 3.0 and the imaginary part -5.0.' | 
|  | 556 | >>> class Point: | 
|  | 557 | ...     def __init__(self, x, y): | 
|  | 558 | ...         self.x, self.y = x, y | 
|  | 559 | ...     def __str__(self): | 
|  | 560 | ...         return 'Point({self.x}, {self.y})'.format(self=self) | 
|  | 561 | ... | 
|  | 562 | >>> str(Point(4, 2)) | 
|  | 563 | 'Point(4, 2)' | 
|  | 564 |  | 
|  | 565 | Accessing arguments' items:: | 
|  | 566 |  | 
|  | 567 | >>> coord = (3, 5) | 
|  | 568 | >>> 'X: {0[0]};  Y: {0[1]}'.format(coord) | 
|  | 569 | 'X: 3;  Y: 5' | 
|  | 570 |  | 
|  | 571 | Replacing ``%s`` and ``%r``:: | 
|  | 572 |  | 
|  | 573 | >>> "repr() shows quotes: {!r}; str() doesn't: {!s}".format('test1', 'test2') | 
|  | 574 | "repr() shows quotes: 'test1'; str() doesn't: test2" | 
|  | 575 |  | 
|  | 576 | Aligning the text and specifying a width:: | 
|  | 577 |  | 
|  | 578 | >>> '{:<30}'.format('left aligned') | 
|  | 579 | 'left aligned                  ' | 
|  | 580 | >>> '{:>30}'.format('right aligned') | 
|  | 581 | '                 right aligned' | 
|  | 582 | >>> '{:^30}'.format('centered') | 
|  | 583 | '           centered           ' | 
|  | 584 | >>> '{:*^30}'.format('centered')  # use '*' as a fill char | 
|  | 585 | '***********centered***********' | 
|  | 586 |  | 
|  | 587 | Replacing ``%+f``, ``%-f``, and ``% f`` and specifying a sign:: | 
|  | 588 |  | 
|  | 589 | >>> '{:+f}; {:+f}'.format(3.14, -3.14)  # show it always | 
|  | 590 | '+3.140000; -3.140000' | 
|  | 591 | >>> '{: f}; {: f}'.format(3.14, -3.14)  # show a space for positive numbers | 
|  | 592 | ' 3.140000; -3.140000' | 
|  | 593 | >>> '{:-f}; {:-f}'.format(3.14, -3.14)  # show only the minus -- same as '{:f}; {:f}' | 
|  | 594 | '3.140000; -3.140000' | 
|  | 595 |  | 
|  | 596 | Replacing ``%x`` and ``%o`` and converting the value to different bases:: | 
|  | 597 |  | 
|  | 598 | >>> # format also supports binary numbers | 
|  | 599 | >>> "int: {0:d};  hex: {0:x};  oct: {0:o};  bin: {0:b}".format(42) | 
|  | 600 | 'int: 42;  hex: 2a;  oct: 52;  bin: 101010' | 
|  | 601 | >>> # with 0x, 0o, or 0b as prefix: | 
|  | 602 | >>> "int: {0:d};  hex: {0:#x};  oct: {0:#o};  bin: {0:#b}".format(42) | 
|  | 603 | 'int: 42;  hex: 0x2a;  oct: 0o52;  bin: 0b101010' | 
|  | 604 |  | 
|  | 605 | Using the comma as a thousands separator:: | 
|  | 606 |  | 
|  | 607 | >>> '{:,}'.format(1234567890) | 
|  | 608 | '1,234,567,890' | 
|  | 609 |  | 
|  | 610 | Expressing a percentage:: | 
|  | 611 |  | 
|  | 612 | >>> points = 19 | 
|  | 613 | >>> total = 22 | 
| Sandro Tosi | baf30da | 2011-12-24 15:53:35 +0100 | [diff] [blame] | 614 | >>> 'Correct answers: {:.2%}'.format(points/total) | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 615 | 'Correct answers: 86.36%' | 
|  | 616 |  | 
|  | 617 | Using type-specific formatting:: | 
|  | 618 |  | 
|  | 619 | >>> import datetime | 
|  | 620 | >>> d = datetime.datetime(2010, 7, 4, 12, 15, 58) | 
|  | 621 | >>> '{:%Y-%m-%d %H:%M:%S}'.format(d) | 
|  | 622 | '2010-07-04 12:15:58' | 
|  | 623 |  | 
|  | 624 | Nesting arguments and more complex examples:: | 
|  | 625 |  | 
|  | 626 | >>> for align, text in zip('<^>', ['left', 'center', 'right']): | 
| Georg Brandl | a5770aa | 2011-02-07 12:10:46 +0000 | [diff] [blame] | 627 | ...     '{0:{fill}{align}16}'.format(text, fill=align, align=align) | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 628 | ... | 
|  | 629 | 'left<<<<<<<<<<<<' | 
|  | 630 | '^^^^^center^^^^^' | 
|  | 631 | '>>>>>>>>>>>right' | 
|  | 632 | >>> | 
|  | 633 | >>> octets = [192, 168, 0, 1] | 
|  | 634 | >>> '{:02X}{:02X}{:02X}{:02X}'.format(*octets) | 
|  | 635 | 'C0A80001' | 
|  | 636 | >>> int(_, 16) | 
|  | 637 | 3232235521 | 
|  | 638 | >>> | 
|  | 639 | >>> width = 5 | 
| Ezio Melotti | 4050792 | 2013-01-11 09:09:07 +0200 | [diff] [blame] | 640 | >>> for num in range(5,12): #doctest: +NORMALIZE_WHITESPACE | 
| Ezio Melotti | d2191e0 | 2010-07-02 23:18:51 +0000 | [diff] [blame] | 641 | ...     for base in 'dXob': | 
|  | 642 | ...         print('{0:{width}{base}}'.format(num, base=base, width=width), end=' ') | 
|  | 643 | ...     print() | 
|  | 644 | ... | 
|  | 645 | 5     5     5   101 | 
|  | 646 | 6     6     6   110 | 
|  | 647 | 7     7     7   111 | 
|  | 648 | 8     8    10  1000 | 
|  | 649 | 9     9    11  1001 | 
|  | 650 | 10     A    12  1010 | 
|  | 651 | 11     B    13  1011 | 
|  | 652 |  | 
|  | 653 |  | 
|  | 654 |  | 
| Georg Brandl | 4b49131 | 2007-08-31 09:22:56 +0000 | [diff] [blame] | 655 | .. _template-strings: | 
|  | 656 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 657 | Template strings | 
|  | 658 | ---------------- | 
|  | 659 |  | 
| Barry Warsaw | 9f74deb | 2017-03-28 10:02:07 -0400 | [diff] [blame] | 660 | Template strings provide simpler string substitutions as described in | 
|  | 661 | :pep:`292`.  A primary use case for template strings is for | 
|  | 662 | internationalization (i18n) since in that context, the simpler syntax and | 
|  | 663 | functionality makes it easier to translate than other built-in string | 
|  | 664 | formatting facilities in Python.  As an example of a library built on template | 
|  | 665 | strings for i18n, see the | 
|  | 666 | `flufl.i18n <http://flufli18n.readthedocs.io/en/latest/>`_ package. | 
|  | 667 |  | 
|  | 668 | Template strings support ``$``-based substitutions, using the following rules: | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 669 |  | 
|  | 670 | * ``$$`` is an escape; it is replaced with a single ``$``. | 
|  | 671 |  | 
|  | 672 | * ``$identifier`` names a substitution placeholder matching a mapping key of | 
| Barry Warsaw | 17d5f47 | 2015-06-09 14:20:31 -0400 | [diff] [blame] | 673 | ``"identifier"``.  By default, ``"identifier"`` is restricted to any | 
|  | 674 | case-insensitive ASCII alphanumeric string (including underscores) that | 
|  | 675 | starts with an underscore or ASCII letter.  The first non-identifier | 
|  | 676 | character after the ``$`` character terminates this placeholder | 
|  | 677 | specification. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 678 |  | 
| Barry Warsaw | 17d5f47 | 2015-06-09 14:20:31 -0400 | [diff] [blame] | 679 | * ``${identifier}`` is equivalent to ``$identifier``.  It is required when | 
|  | 680 | valid identifier characters follow the placeholder but are not part of the | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 681 | placeholder, such as ``"${noun}ification"``. | 
|  | 682 |  | 
|  | 683 | Any other appearance of ``$`` in the string will result in a :exc:`ValueError` | 
|  | 684 | being raised. | 
|  | 685 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 686 | The :mod:`string` module provides a :class:`Template` class that implements | 
|  | 687 | these rules.  The methods of :class:`Template` are: | 
|  | 688 |  | 
|  | 689 |  | 
|  | 690 | .. class:: Template(template) | 
|  | 691 |  | 
|  | 692 | The constructor takes a single argument which is the template string. | 
|  | 693 |  | 
|  | 694 |  | 
| Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 695 | .. method:: substitute(mapping, **kwds) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 696 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 697 | Performs the template substitution, returning a new string.  *mapping* is | 
|  | 698 | any dictionary-like object with keys that match the placeholders in the | 
|  | 699 | template.  Alternatively, you can provide keyword arguments, where the | 
| Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 700 | keywords are the placeholders.  When both *mapping* and *kwds* are given | 
|  | 701 | and there are duplicates, the placeholders from *kwds* take precedence. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 702 |  | 
|  | 703 |  | 
| Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 704 | .. method:: safe_substitute(mapping, **kwds) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 705 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 706 | Like :meth:`substitute`, except that if placeholders are missing from | 
| Georg Brandl | 7f01a13 | 2009-09-16 15:58:14 +0000 | [diff] [blame] | 707 | *mapping* and *kwds*, instead of raising a :exc:`KeyError` exception, the | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 708 | original placeholder will appear in the resulting string intact.  Also, | 
|  | 709 | unlike with :meth:`substitute`, any other appearances of the ``$`` will | 
|  | 710 | simply return ``$`` instead of raising :exc:`ValueError`. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 711 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 712 | While other exceptions may still occur, this method is called "safe" | 
|  | 713 | because substitutions always tries to return a usable string instead of | 
|  | 714 | raising an exception.  In another sense, :meth:`safe_substitute` may be | 
|  | 715 | anything other than safe, since it will silently ignore malformed | 
|  | 716 | templates containing dangling delimiters, unmatched braces, or | 
|  | 717 | placeholders that are not valid Python identifiers. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 718 |  | 
| Benjamin Peterson | 2021100 | 2009-11-25 18:34:42 +0000 | [diff] [blame] | 719 | :class:`Template` instances also provide one public data attribute: | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 720 |  | 
| Benjamin Peterson | 2021100 | 2009-11-25 18:34:42 +0000 | [diff] [blame] | 721 | .. attribute:: template | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 722 |  | 
| Benjamin Peterson | 2021100 | 2009-11-25 18:34:42 +0000 | [diff] [blame] | 723 | This is the object passed to the constructor's *template* argument.  In | 
|  | 724 | general, you shouldn't change it, but read-only access is not enforced. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 725 |  | 
| Ezio Melotti | bcbc567 | 2013-02-21 12:30:32 +0200 | [diff] [blame] | 726 | Here is an example of how to use a Template:: | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 727 |  | 
|  | 728 | >>> from string import Template | 
|  | 729 | >>> s = Template('$who likes $what') | 
|  | 730 | >>> s.substitute(who='tim', what='kung pao') | 
|  | 731 | 'tim likes kung pao' | 
|  | 732 | >>> d = dict(who='tim') | 
|  | 733 | >>> Template('Give $who $100').substitute(d) | 
|  | 734 | Traceback (most recent call last): | 
| Ezio Melotti | bcbc567 | 2013-02-21 12:30:32 +0200 | [diff] [blame] | 735 | ... | 
| Ezio Melotti | 4050792 | 2013-01-11 09:09:07 +0200 | [diff] [blame] | 736 | ValueError: Invalid placeholder in string: line 1, col 11 | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 737 | >>> Template('$who likes $what').substitute(d) | 
|  | 738 | Traceback (most recent call last): | 
| Ezio Melotti | bcbc567 | 2013-02-21 12:30:32 +0200 | [diff] [blame] | 739 | ... | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 740 | KeyError: 'what' | 
|  | 741 | >>> Template('$who likes $what').safe_substitute(d) | 
|  | 742 | 'tim likes $what' | 
|  | 743 |  | 
| Barry Warsaw | 9f74deb | 2017-03-28 10:02:07 -0400 | [diff] [blame] | 744 | Advanced usage: you can derive subclasses of :class:`Template` to customize | 
|  | 745 | the placeholder syntax, delimiter character, or the entire regular expression | 
|  | 746 | used to parse template strings.  To do this, you can override these class | 
|  | 747 | attributes: | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 748 |  | 
| Barry Warsaw | 9f74deb | 2017-03-28 10:02:07 -0400 | [diff] [blame] | 749 | * *delimiter* -- This is the literal string describing a placeholder | 
|  | 750 | introducing delimiter.  The default value is ``$``.  Note that this should | 
|  | 751 | *not* be a regular expression, as the implementation will call | 
|  | 752 | :meth:`re.escape` on this string as needed.  Note further that you cannot | 
|  | 753 | change the delimiter after class creation (i.e. a different delimiter must | 
|  | 754 | be set in the subclass's class namespace). | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 755 |  | 
|  | 756 | * *idpattern* -- This is the regular expression describing the pattern for | 
| Barry Warsaw | ba42796 | 2017-09-04 16:32:10 -0400 | [diff] [blame] | 757 | non-braced placeholders.  The default value is the regular expression | 
| INADA Naoki | b22273e | 2017-10-13 16:02:23 +0900 | [diff] [blame^] | 758 | ``(?-i:[_a-zA-Z][_a-zA-Z0-9]*)``.  If this is given and *braceidpattern* is | 
|  | 759 | ``None`` this pattern will also apply to braced placeholders. | 
|  | 760 |  | 
|  | 761 | .. note:: | 
|  | 762 |  | 
|  | 763 | Since default *flags* is ``re.IGNORECASE``, pattern ``[a-z]`` can match | 
|  | 764 | with some non-ASCII characters. That's why we use local ``-i`` flag here. | 
|  | 765 |  | 
|  | 766 | While *flags* is kept to ``re.IGNORECASE`` for backward compatibility, | 
|  | 767 | you can override it to ``0`` or ``re.IGNORECASE | re.ASCII`` when | 
|  | 768 | subclassing.  It's simple way to avoid unexpected match like above example. | 
| Barry Warsaw | ba42796 | 2017-09-04 16:32:10 -0400 | [diff] [blame] | 769 |  | 
|  | 770 | .. versionchanged:: 3.7 | 
|  | 771 | *braceidpattern* can be used to define separate patterns used inside and | 
|  | 772 | outside the braces. | 
|  | 773 |  | 
|  | 774 | * *braceidpattern* -- This is like *idpattern* but describes the pattern for | 
|  | 775 | braced placeholders.  Defaults to ``None`` which means to fall back to | 
|  | 776 | *idpattern* (i.e. the same pattern is used both inside and outside braces). | 
|  | 777 | If given, this allows you to define different patterns for braced and | 
|  | 778 | unbraced placeholders. | 
|  | 779 |  | 
|  | 780 | .. versionadded:: 3.7 | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 781 |  | 
| Georg Brandl | 056cb93 | 2010-07-29 17:16:10 +0000 | [diff] [blame] | 782 | * *flags* -- The regular expression flags that will be applied when compiling | 
|  | 783 | the regular expression used for recognizing substitutions.  The default value | 
|  | 784 | is ``re.IGNORECASE``.  Note that ``re.VERBOSE`` will always be added to the | 
|  | 785 | flags, so custom *idpattern*\ s must follow conventions for verbose regular | 
|  | 786 | expressions. | 
|  | 787 |  | 
|  | 788 | .. versionadded:: 3.2 | 
|  | 789 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 790 | Alternatively, you can provide the entire regular expression pattern by | 
|  | 791 | overriding the class attribute *pattern*.  If you do this, the value must be a | 
|  | 792 | regular expression object with four named capturing groups.  The capturing | 
|  | 793 | groups correspond to the rules given above, along with the invalid placeholder | 
|  | 794 | rule: | 
|  | 795 |  | 
|  | 796 | * *escaped* -- This group matches the escape sequence, e.g. ``$$``, in the | 
|  | 797 | default pattern. | 
|  | 798 |  | 
|  | 799 | * *named* -- This group matches the unbraced placeholder name; it should not | 
|  | 800 | include the delimiter in capturing group. | 
|  | 801 |  | 
|  | 802 | * *braced* -- This group matches the brace enclosed placeholder name; it should | 
|  | 803 | not include either the delimiter or braces in the capturing group. | 
|  | 804 |  | 
|  | 805 | * *invalid* -- This group matches any other delimiter pattern (usually a single | 
|  | 806 | delimiter), and it should appear last in the regular expression. | 
|  | 807 |  | 
|  | 808 |  | 
| Georg Brandl | abc3877 | 2009-04-12 15:51:51 +0000 | [diff] [blame] | 809 | Helper functions | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 810 | ---------------- | 
|  | 811 |  | 
| Georg Brandl | 10430ad | 2009-09-26 20:59:11 +0000 | [diff] [blame] | 812 | .. function:: capwords(s, sep=None) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 813 |  | 
| Ezio Melotti | a40bdda | 2009-09-26 12:33:22 +0000 | [diff] [blame] | 814 | Split the argument into words using :meth:`str.split`, capitalize each word | 
|  | 815 | using :meth:`str.capitalize`, and join the capitalized words using | 
|  | 816 | :meth:`str.join`.  If the optional second argument *sep* is absent | 
|  | 817 | or ``None``, runs of whitespace characters are replaced by a single space | 
|  | 818 | and leading and trailing whitespace are removed, otherwise *sep* is used to | 
|  | 819 | split and join the words. |