blob: 1e718382589c48acd10ea8e48366ee66e86228ec [file] [log] [blame]
Alexander Belopolskyf0a0d142010-10-27 03:06:43 +00001:mod:`ast` --- Abstract Syntax Trees
2====================================
Georg Brandl0c77a822008-06-10 16:37:50 +00003
4.. module:: ast
5 :synopsis: Abstract Syntax Tree classes and manipulation.
6
7.. sectionauthor:: Martin v. Lรถwis <martin@v.loewis.de>
8.. sectionauthor:: Georg Brandl <georg@python.org>
9
Raymond Hettinger10480942011-01-10 03:26:08 +000010**Source code:** :source:`Lib/ast.py`
Georg Brandl0c77a822008-06-10 16:37:50 +000011
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000012--------------
13
Georg Brandl0c77a822008-06-10 16:37:50 +000014The :mod:`ast` module helps Python applications to process trees of the Python
15abstract syntax grammar. The abstract syntax itself might change with each
16Python release; this module helps to find out programmatically what the current
17grammar looks like.
18
Benjamin Petersonec9199b2008-11-08 17:05:00 +000019An abstract syntax tree can be generated by passing :data:`ast.PyCF_ONLY_AST` as
Georg Brandl22b34312009-07-26 14:54:51 +000020a flag to the :func:`compile` built-in function, or using the :func:`parse`
Georg Brandl0c77a822008-06-10 16:37:50 +000021helper provided in this module. The result will be a tree of objects whose
Benjamin Petersonec9199b2008-11-08 17:05:00 +000022classes all inherit from :class:`ast.AST`. An abstract syntax tree can be
23compiled into a Python code object using the built-in :func:`compile` function.
Georg Brandl0c77a822008-06-10 16:37:50 +000024
Georg Brandl0c77a822008-06-10 16:37:50 +000025
26Node classes
27------------
28
29.. class:: AST
30
31 This is the base of all AST node classes. The actual node classes are
32 derived from the :file:`Parser/Python.asdl` file, which is reproduced
33 :ref:`below <abstract-grammar>`. They are defined in the :mod:`_ast` C
34 module and re-exported in :mod:`ast`.
35
36 There is one class defined for each left-hand side symbol in the abstract
37 grammar (for example, :class:`ast.stmt` or :class:`ast.expr`). In addition,
38 there is one class defined for each constructor on the right-hand side; these
39 classes inherit from the classes for the left-hand side trees. For example,
40 :class:`ast.BinOp` inherits from :class:`ast.expr`. For production rules
41 with alternatives (aka "sums"), the left-hand side class is abstract: only
42 instances of specific constructor nodes are ever created.
43
Serhiy Storchaka913876d2018-10-28 13:41:26 +020044 .. index:: single: ? (question mark); in AST grammar
45 .. index:: single: * (asterisk); in AST grammar
46
Georg Brandl0c77a822008-06-10 16:37:50 +000047 .. attribute:: _fields
48
49 Each concrete class has an attribute :attr:`_fields` which gives the names
50 of all child nodes.
51
52 Each instance of a concrete class has one attribute for each child node,
53 of the type as defined in the grammar. For example, :class:`ast.BinOp`
54 instances have an attribute :attr:`left` of type :class:`ast.expr`.
55
56 If these attributes are marked as optional in the grammar (using a
57 question mark), the value might be ``None``. If the attributes can have
58 zero-or-more values (marked with an asterisk), the values are represented
59 as Python lists. All possible attributes must be present and have valid
60 values when compiling an AST with :func:`compile`.
61
62 .. attribute:: lineno
63 col_offset
Ivan Levkivskyi9932a222019-01-22 11:18:22 +000064 end_lineno
65 end_col_offset
Georg Brandl0c77a822008-06-10 16:37:50 +000066
67 Instances of :class:`ast.expr` and :class:`ast.stmt` subclasses have
Ivan Levkivskyi9932a222019-01-22 11:18:22 +000068 :attr:`lineno`, :attr:`col_offset`, :attr:`lineno`, and :attr:`col_offset`
69 attributes. The :attr:`lineno` and :attr:`end_lineno` are the first and
70 last line numbers of source text span (1-indexed so the first line is line 1)
71 and the :attr:`col_offset` and :attr:`end_col_offset` are the corresponding
72 UTF-8 byte offsets of the first and last tokens that generated the node.
73 The UTF-8 offset is recorded because the parser uses UTF-8 internally.
74
75 Note that the end positions are not required by the compiler and are
76 therefore optional. The end offset is *after* the last symbol, for example
77 one can get the source segment of a one-line expression node using
78 ``source_line[node.col_offset : node.end_col_offset]``.
Georg Brandl0c77a822008-06-10 16:37:50 +000079
80 The constructor of a class :class:`ast.T` parses its arguments as follows:
81
82 * If there are positional arguments, there must be as many as there are items
83 in :attr:`T._fields`; they will be assigned as attributes of these names.
84 * If there are keyword arguments, they will set the attributes of the same
85 names to the given values.
86
87 For example, to create and populate an :class:`ast.UnaryOp` node, you could
88 use ::
89
90 node = ast.UnaryOp()
91 node.op = ast.USub()
Serhiy Storchaka3f228112018-09-27 17:42:37 +030092 node.operand = ast.Constant()
93 node.operand.value = 5
Georg Brandl0c77a822008-06-10 16:37:50 +000094 node.operand.lineno = 0
95 node.operand.col_offset = 0
96 node.lineno = 0
97 node.col_offset = 0
98
99 or the more compact ::
100
Serhiy Storchaka3f228112018-09-27 17:42:37 +0300101 node = ast.UnaryOp(ast.USub(), ast.Constant(5, lineno=0, col_offset=0),
Georg Brandl0c77a822008-06-10 16:37:50 +0000102 lineno=0, col_offset=0)
103
Serhiy Storchaka3f228112018-09-27 17:42:37 +0300104.. deprecated:: 3.8
105
106 Class :class:`ast.Constant` is now used for all constants. Old classes
107 :class:`ast.Num`, :class:`ast.Str`, :class:`ast.Bytes`,
108 :class:`ast.NameConstant` and :class:`ast.Ellipsis` are still available,
109 but they will be removed in future Python releases.
110
Georg Brandl0c77a822008-06-10 16:37:50 +0000111
112.. _abstract-grammar:
113
114Abstract Grammar
115----------------
116
Georg Brandl0c77a822008-06-10 16:37:50 +0000117The abstract grammar is currently defined as follows:
118
119.. literalinclude:: ../../Parser/Python.asdl
Martin Panter1050d2d2016-07-26 11:18:21 +0200120 :language: none
Georg Brandl0c77a822008-06-10 16:37:50 +0000121
122
123:mod:`ast` Helpers
124------------------
125
Martin Panter2e4571a2015-11-14 01:07:43 +0000126Apart from the node classes, the :mod:`ast` module defines these utility functions
Georg Brandl0c77a822008-06-10 16:37:50 +0000127and classes for traversing abstract syntax trees:
128
Guido van Rossum10b55c12019-06-11 17:23:12 -0700129.. function:: parse(source, filename='<unknown>', mode='exec', *, type_comments=False, feature_version=None)
Georg Brandl0c77a822008-06-10 16:37:50 +0000130
Terry Reedyfeac6242011-01-24 21:36:03 +0000131 Parse the source into an AST node. Equivalent to ``compile(source,
Benjamin Petersonec9199b2008-11-08 17:05:00 +0000132 filename, mode, ast.PyCF_ONLY_AST)``.
Georg Brandl0c77a822008-06-10 16:37:50 +0000133
Guido van Rossumdcfcd142019-01-31 03:40:27 -0800134 If ``type_comments=True`` is given, the parser is modified to check
135 and return type comments as specified by :pep:`484` and :pep:`526`.
136 This is equivalent to adding :data:`ast.PyCF_TYPE_COMMENTS` to the
137 flags passed to :func:`compile()`. This will report syntax errors
138 for misplaced type comments. Without this flag, type comments will
139 be ignored, and the ``type_comment`` field on selected AST nodes
140 will always be ``None``. In addition, the locations of ``# type:
141 ignore`` comments will be returned as the ``type_ignores``
142 attribute of :class:`Module` (otherwise it is always an empty list).
143
144 In addition, if ``mode`` is ``'func_type'``, the input syntax is
145 modified to correspond to :pep:`484` "signature type comments",
146 e.g. ``(str, int) -> List[str]``.
147
Guido van Rossum10b55c12019-06-11 17:23:12 -0700148 Also, setting ``feature_version`` to a tuple ``(major, minor)``
149 will attempt to parse using that Python version's grammar.
150 Currently ``major`` must equal to ``3``. For example, setting
151 ``feature_version=(3, 4)`` will allow the use of ``async`` and
152 ``await`` as variable names. The lowest supported version is
153 ``(3, 4)``; the highest is ``sys.version_info[0:2]``.
Guido van Rossum495da292019-03-07 12:38:08 -0800154
Brett Cannon7a7f1002018-03-09 12:03:22 -0800155 .. warning::
156 It is possible to crash the Python interpreter with a
157 sufficiently large/complex string due to stack depth limitations
158 in Python's AST compiler.
159
Guido van Rossumdcfcd142019-01-31 03:40:27 -0800160 .. versionchanged:: 3.8
Guido van Rossum495da292019-03-07 12:38:08 -0800161 Added ``type_comments``, ``mode='func_type'`` and ``feature_version``.
Guido van Rossumdcfcd142019-01-31 03:40:27 -0800162
Georg Brandl48310cd2009-01-03 21:18:54 +0000163
Georg Brandl0c77a822008-06-10 16:37:50 +0000164.. function:: literal_eval(node_or_string)
165
Georg Brandlb9b389e2014-11-05 20:20:28 +0100166 Safely evaluate an expression node or a string containing a Python literal or
167 container display. The string or node provided may only consist of the
168 following Python literal structures: strings, bytes, numbers, tuples, lists,
169 dicts, sets, booleans, and ``None``.
Georg Brandl0c77a822008-06-10 16:37:50 +0000170
Georg Brandlb9b389e2014-11-05 20:20:28 +0100171 This can be used for safely evaluating strings containing Python values from
172 untrusted sources without the need to parse the values oneself. It is not
173 capable of evaluating arbitrarily complex expressions, for example involving
174 operators or indexing.
Georg Brandl0c77a822008-06-10 16:37:50 +0000175
Brett Cannon7a7f1002018-03-09 12:03:22 -0800176 .. warning::
177 It is possible to crash the Python interpreter with a
178 sufficiently large/complex string due to stack depth limitations
179 in Python's AST compiler.
180
Georg Brandl492f3fc2010-07-11 09:41:21 +0000181 .. versionchanged:: 3.2
Georg Brandl85f21772010-07-13 06:38:10 +0000182 Now allows bytes and set literals.
Georg Brandl492f3fc2010-07-11 09:41:21 +0000183
Georg Brandl0c77a822008-06-10 16:37:50 +0000184
Amaury Forgeot d'Arcfdfe62d2008-06-17 20:36:03 +0000185.. function:: get_docstring(node, clean=True)
Georg Brandl0c77a822008-06-10 16:37:50 +0000186
187 Return the docstring of the given *node* (which must be a
INADA Naokicb41b272017-02-23 00:31:59 +0900188 :class:`FunctionDef`, :class:`AsyncFunctionDef`, :class:`ClassDef`,
189 or :class:`Module` node), or ``None`` if it has no docstring.
190 If *clean* is true, clean up the docstring's indentation with
191 :func:`inspect.cleandoc`.
192
193 .. versionchanged:: 3.5
194 :class:`AsyncFunctionDef` is now supported.
195
Georg Brandl0c77a822008-06-10 16:37:50 +0000196
Ivan Levkivskyi9932a222019-01-22 11:18:22 +0000197.. function:: get_source_segment(source, node, *, padded=False)
198
199 Get source code segment of the *source* that generated *node*.
200 If some location information (:attr:`lineno`, :attr:`end_lineno`,
201 :attr:`col_offset`, or :attr:`end_col_offset`) is missing, return ``None``.
202
203 If *padded* is ``True``, the first line of a multi-line statement will
204 be padded with spaces to match its original position.
205
206 .. versionadded:: 3.8
207
208
Georg Brandl0c77a822008-06-10 16:37:50 +0000209.. function:: fix_missing_locations(node)
210
211 When you compile a node tree with :func:`compile`, the compiler expects
212 :attr:`lineno` and :attr:`col_offset` attributes for every node that supports
213 them. This is rather tedious to fill in for generated nodes, so this helper
214 adds these attributes recursively where not already set, by setting them to
215 the values of the parent node. It works recursively starting at *node*.
216
217
218.. function:: increment_lineno(node, n=1)
219
Ivan Levkivskyi9932a222019-01-22 11:18:22 +0000220 Increment the line number and end line number of each node in the tree
221 starting at *node* by *n*. This is useful to "move code" to a different
222 location in a file.
Georg Brandl0c77a822008-06-10 16:37:50 +0000223
224
225.. function:: copy_location(new_node, old_node)
226
Ivan Levkivskyi9932a222019-01-22 11:18:22 +0000227 Copy source location (:attr:`lineno`, :attr:`col_offset`, :attr:`end_lineno`,
228 and :attr:`end_col_offset`) from *old_node* to *new_node* if possible,
229 and return *new_node*.
Georg Brandl0c77a822008-06-10 16:37:50 +0000230
231
232.. function:: iter_fields(node)
233
234 Yield a tuple of ``(fieldname, value)`` for each field in ``node._fields``
235 that is present on *node*.
236
237
238.. function:: iter_child_nodes(node)
239
240 Yield all direct child nodes of *node*, that is, all fields that are nodes
241 and all items of fields that are lists of nodes.
242
243
244.. function:: walk(node)
245
Georg Brandl619e7ba2011-01-09 07:38:51 +0000246 Recursively yield all descendant nodes in the tree starting at *node*
247 (including *node* itself), in no specified order. This is useful if you only
248 want to modify nodes in place and don't care about the context.
Georg Brandl0c77a822008-06-10 16:37:50 +0000249
250
251.. class:: NodeVisitor()
252
253 A node visitor base class that walks the abstract syntax tree and calls a
254 visitor function for every node found. This function may return a value
Georg Brandl36ab1ef2009-01-03 21:17:04 +0000255 which is forwarded by the :meth:`visit` method.
Georg Brandl0c77a822008-06-10 16:37:50 +0000256
257 This class is meant to be subclassed, with the subclass adding visitor
258 methods.
259
260 .. method:: visit(node)
261
262 Visit a node. The default implementation calls the method called
263 :samp:`self.visit_{classname}` where *classname* is the name of the node
264 class, or :meth:`generic_visit` if that method doesn't exist.
265
266 .. method:: generic_visit(node)
267
268 This visitor calls :meth:`visit` on all children of the node.
Georg Brandl48310cd2009-01-03 21:18:54 +0000269
Georg Brandl0c77a822008-06-10 16:37:50 +0000270 Note that child nodes of nodes that have a custom visitor method won't be
271 visited unless the visitor calls :meth:`generic_visit` or visits them
272 itself.
273
274 Don't use the :class:`NodeVisitor` if you want to apply changes to nodes
275 during traversal. For this a special visitor exists
276 (:class:`NodeTransformer`) that allows modifications.
277
278
279.. class:: NodeTransformer()
280
281 A :class:`NodeVisitor` subclass that walks the abstract syntax tree and
282 allows modification of nodes.
283
Georg Brandl36ab1ef2009-01-03 21:17:04 +0000284 The :class:`NodeTransformer` will walk the AST and use the return value of
285 the visitor methods to replace or remove the old node. If the return value
286 of the visitor method is ``None``, the node will be removed from its
287 location, otherwise it is replaced with the return value. The return value
288 may be the original node in which case no replacement takes place.
Georg Brandl0c77a822008-06-10 16:37:50 +0000289
290 Here is an example transformer that rewrites all occurrences of name lookups
291 (``foo``) to ``data['foo']``::
292
293 class RewriteName(NodeTransformer):
294
295 def visit_Name(self, node):
296 return copy_location(Subscript(
297 value=Name(id='data', ctx=Load()),
Serhiy Storchaka3f228112018-09-27 17:42:37 +0300298 slice=Index(value=Constant(value=node.id)),
Georg Brandl0c77a822008-06-10 16:37:50 +0000299 ctx=node.ctx
300 ), node)
301
302 Keep in mind that if the node you're operating on has child nodes you must
303 either transform the child nodes yourself or call the :meth:`generic_visit`
304 method for the node first.
305
306 For nodes that were part of a collection of statements (that applies to all
307 statement nodes), the visitor may also return a list of nodes rather than
308 just a single node.
309
310 Usually you use the transformer like this::
311
312 node = YourTransformer().visit(node)
313
314
315.. function:: dump(node, annotate_fields=True, include_attributes=False)
316
317 Return a formatted dump of the tree in *node*. This is mainly useful for
318 debugging purposes. The returned string will show the names and the values
319 for fields. This makes the code impossible to evaluate, so if evaluation is
Serhiy Storchakafbc1c262013-11-29 12:17:13 +0200320 wanted *annotate_fields* must be set to ``False``. Attributes such as line
Benjamin Petersondcf97b92008-07-02 17:30:14 +0000321 numbers and column offsets are not dumped by default. If this is wanted,
Georg Brandl0c77a822008-06-10 16:37:50 +0000322 *include_attributes* can be set to ``True``.
Senthil Kumaranf3695bf2016-01-06 21:26:53 -0800323
324.. seealso::
325
Sanyam Khurana338cd832018-01-20 05:55:37 +0530326 `Green Tree Snakes <https://greentreesnakes.readthedocs.io/>`_, an external documentation resource, has good
Senthil Kumaranf3695bf2016-01-06 21:26:53 -0800327 details on working with Python ASTs.