Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 1 | r"""Utilities to compile possibly incomplete Python source code. |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 2 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 3 | This module provides two interfaces, broadly similar to the builtin |
Walter Dörwald | 4df3068 | 2003-11-20 13:38:01 +0000 | [diff] [blame] | 4 | function compile(), which take program text, a filename and a 'mode' |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 5 | and: |
Skip Montanaro | e99d5ea | 2001-01-20 19:54:20 +0000 | [diff] [blame] | 6 | |
Walter Dörwald | 4df3068 | 2003-11-20 13:38:01 +0000 | [diff] [blame] | 7 | - Return code object if the command is complete and valid |
| 8 | - Return None if the command is incomplete |
| 9 | - Raise SyntaxError, ValueError or OverflowError if the command is a |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 10 | syntax error (OverflowError and ValueError can be produced by |
| 11 | malformed literals). |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 12 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 13 | Approach: |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 14 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 15 | First, check if the source consists entirely of blank lines and |
| 16 | comments; if so, replace it with 'pass', because the built-in |
| 17 | parser doesn't always do the right thing for these. |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 18 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 19 | Compile three times: as is, with \n, and with \n\n appended. If it |
| 20 | compiles as is, it's complete. If it compiles with one \n appended, |
| 21 | we expect more. If it doesn't compile either way, we compare the |
| 22 | error we get when compiling with \n or \n\n appended. If the errors |
| 23 | are the same, the code is broken. But if the errors are different, we |
| 24 | expect more. Not intuitive; not even guaranteed to hold in future |
| 25 | releases; but this matches the compiler's behavior from Python 1.4 |
| 26 | through 2.2, at least. |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 27 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 28 | Caveat: |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 29 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 30 | It is possible (but not likely) that the parser stops parsing with a |
| 31 | successful outcome before reaching the end of the source; in this |
| 32 | case, trailing symbols may be ignored instead of causing an error. |
| 33 | For example, a backslash followed by two newlines may be followed by |
| 34 | arbitrary garbage. This will be fixed once the API for the parser is |
| 35 | better. |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 36 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 37 | The two interfaces are: |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 38 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 39 | compile_command(source, filename, symbol): |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 40 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 41 | Compiles a single command in the manner described above. |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 42 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 43 | CommandCompiler(): |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 44 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 45 | Instances of this class have __call__ methods identical in |
| 46 | signature to compile_command; the difference is that if the |
| 47 | instance compiles program text containing a __future__ statement, |
| 48 | the instance 'remembers' and compiles all subsequent program texts |
| 49 | with the statement in force. |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 50 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 51 | The module also provides another class: |
| 52 | |
| 53 | Compile(): |
| 54 | |
| 55 | Instances of this class act like the built-in function compile, |
| 56 | but with 'memory' in the sense described above. |
| 57 | """ |
| 58 | |
| 59 | import __future__ |
| 60 | |
| 61 | _features = [getattr(__future__, fname) |
| 62 | for fname in __future__.all_feature_names] |
| 63 | |
| 64 | __all__ = ["compile_command", "Compile", "CommandCompiler"] |
| 65 | |
Guido van Rossum | 4b499dd3 | 2003-02-13 22:07:59 +0000 | [diff] [blame] | 66 | PyCF_DONT_IMPLY_DEDENT = 0x200 # Matches pythonrun.h |
| 67 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 68 | def _maybe_compile(compiler, source, filename, symbol): |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 69 | # Check for source consisting of only blank lines and comments |
Eric S. Raymond | 6b71e74 | 2001-02-09 08:56:30 +0000 | [diff] [blame] | 70 | for line in source.split("\n"): |
| 71 | line = line.strip() |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 72 | if line and line[0] != '#': |
| 73 | break # Leave it alone |
| 74 | else: |
Guido van Rossum | 993bc3a | 2003-05-16 01:24:30 +0000 | [diff] [blame] | 75 | if symbol != "eval": |
| 76 | source = "pass" # Replace it with a 'pass' statement |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 77 | |
| 78 | err = err1 = err2 = None |
| 79 | code = code1 = code2 = None |
| 80 | |
| 81 | try: |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 82 | code = compiler(source, filename, symbol) |
Guido van Rossum | b940e11 | 2007-01-10 16:19:56 +0000 | [diff] [blame] | 83 | except SyntaxError as err: |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 84 | pass |
| 85 | |
| 86 | try: |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 87 | code1 = compiler(source + "\n", filename, symbol) |
Guido van Rossum | b940e11 | 2007-01-10 16:19:56 +0000 | [diff] [blame] | 88 | except SyntaxError as e: |
| 89 | err1 = e |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 90 | |
| 91 | try: |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 92 | code2 = compiler(source + "\n\n", filename, symbol) |
Guido van Rossum | b940e11 | 2007-01-10 16:19:56 +0000 | [diff] [blame] | 93 | except SyntaxError as e: |
| 94 | err2 = e |
Guido van Rossum | c41c1a9 | 1998-10-22 21:56:15 +0000 | [diff] [blame] | 95 | |
| 96 | if code: |
| 97 | return code |
Thomas Wouters | 477c8d5 | 2006-05-27 19:21:47 +0000 | [diff] [blame] | 98 | if not code1 and repr(err1) == repr(err2): |
Benjamin Peterson | 1580ece | 2009-10-18 00:34:08 +0000 | [diff] [blame] | 99 | raise err1 |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 100 | |
Guido van Rossum | 4b499dd3 | 2003-02-13 22:07:59 +0000 | [diff] [blame] | 101 | def _compile(source, filename, symbol): |
| 102 | return compile(source, filename, symbol, PyCF_DONT_IMPLY_DEDENT) |
| 103 | |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 104 | def compile_command(source, filename="<input>", symbol="single"): |
| 105 | r"""Compile a command and determine whether it is incomplete. |
| 106 | |
| 107 | Arguments: |
| 108 | |
| 109 | source -- the source string; may contain \n characters |
| 110 | filename -- optional filename from which source was read; default |
| 111 | "<input>" |
| 112 | symbol -- optional grammar start symbol; "single" (default) or "eval" |
| 113 | |
| 114 | Return value / exceptions raised: |
| 115 | |
| 116 | - Return a code object if the command is complete and valid |
| 117 | - Return None if the command is incomplete |
| 118 | - Raise SyntaxError, ValueError or OverflowError if the command is a |
| 119 | syntax error (OverflowError and ValueError can be produced by |
| 120 | malformed literals). |
| 121 | """ |
Guido van Rossum | 4b499dd3 | 2003-02-13 22:07:59 +0000 | [diff] [blame] | 122 | return _maybe_compile(_compile, source, filename, symbol) |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 123 | |
| 124 | class Compile: |
| 125 | """Instances of this class behave much like the built-in compile |
| 126 | function, but if one is used to compile text containing a future |
| 127 | statement, it "remembers" and compiles all subsequent program texts |
| 128 | with the statement in force.""" |
| 129 | def __init__(self): |
Guido van Rossum | 4b499dd3 | 2003-02-13 22:07:59 +0000 | [diff] [blame] | 130 | self.flags = PyCF_DONT_IMPLY_DEDENT |
Tim Peters | 6cd6a82 | 2001-08-17 22:11:27 +0000 | [diff] [blame] | 131 | |
| 132 | def __call__(self, source, filename, symbol): |
| 133 | codeob = compile(source, filename, symbol, self.flags, 1) |
| 134 | for feature in _features: |
| 135 | if codeob.co_flags & feature.compiler_flag: |
| 136 | self.flags |= feature.compiler_flag |
| 137 | return codeob |
| 138 | |
| 139 | class CommandCompiler: |
| 140 | """Instances of this class have __call__ methods identical in |
| 141 | signature to compile_command; the difference is that if the |
| 142 | instance compiles program text containing a __future__ statement, |
| 143 | the instance 'remembers' and compiles all subsequent program texts |
| 144 | with the statement in force.""" |
| 145 | |
| 146 | def __init__(self,): |
| 147 | self.compiler = Compile() |
| 148 | |
| 149 | def __call__(self, source, filename="<input>", symbol="single"): |
| 150 | r"""Compile a command and determine whether it is incomplete. |
| 151 | |
| 152 | Arguments: |
| 153 | |
| 154 | source -- the source string; may contain \n characters |
| 155 | filename -- optional filename from which source was read; |
| 156 | default "<input>" |
| 157 | symbol -- optional grammar start symbol; "single" (default) or |
| 158 | "eval" |
| 159 | |
| 160 | Return value / exceptions raised: |
| 161 | |
| 162 | - Return a code object if the command is complete and valid |
| 163 | - Return None if the command is incomplete |
| 164 | - Raise SyntaxError, ValueError or OverflowError if the command is a |
| 165 | syntax error (OverflowError and ValueError can be produced by |
| 166 | malformed literals). |
| 167 | """ |
| 168 | return _maybe_compile(self.compiler, source, filename, symbol) |