blob: ddc54d2a13d138863e6766defd0a26c2a2c546f2 [file] [log] [blame]
Mikhail Glushenkov270cae32008-05-30 06:25:24 +00001===================================
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +00002Customizing LLVMC: Reference Manual
3===================================
Mikhail Glushenkov23f522a2008-12-13 17:51:47 +00004..
5 This file was automatically generated by rst2html.
6 Please do not edit directly!
7 The ReST source lives in the directory 'tools/llvmc/doc'.
Mikhail Glushenkovd5652032008-12-13 02:28:58 +00008
9.. contents::
10
11.. raw:: html
12
13 <div class="doc_author">
14 <p>Written by <a href="mailto:foldr@codedgers.com">Mikhail Glushenkov</a></p>
15 </div>
16
17Introduction
18============
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000019
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +000020LLVMC is a generic compiler driver, designed to be customizable and
21extensible. It plays the same role for LLVM as the ``gcc`` program
22does for GCC - LLVMC's job is essentially to transform a set of input
23files into a set of targets depending on configuration rules and user
24options. What makes LLVMC different is that these transformation rules
25are completely customizable - in fact, LLVMC knows nothing about the
26specifics of transformation (even the command-line options are mostly
27not hard-coded) and regards the transformation structure as an
Mikhail Glushenkov83237482008-10-15 09:29:13 +000028abstract graph. The structure of this graph is completely determined
29by plugins, which can be either statically or dynamically linked. This
30makes it possible to easily adapt LLVMC for other purposes - for
31example, as a build tool for game resources.
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000032
Mikhail Glushenkovd5652032008-12-13 02:28:58 +000033Because LLVMC employs TableGen_ as its configuration language, you
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +000034need to be familiar with it to customize LLVMC.
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000035
Mikhail Glushenkov4aecec12009-06-17 02:56:08 +000036.. _TableGen: http://llvm.org/docs/TableGenFundamentals.html
Mikhail Glushenkov270cae32008-05-30 06:25:24 +000037
38
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +000039Compiling with LLVMC
Mikhail Glushenkov270cae32008-05-30 06:25:24 +000040====================
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000041
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +000042LLVMC tries hard to be as compatible with ``gcc`` as possible,
43although there are some small differences. Most of the time, however,
44you shouldn't be able to notice them::
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000045
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +000046 $ # This works as expected:
Mikhail Glushenkov113ec352008-11-25 21:38:12 +000047 $ llvmc -O3 -Wall hello.cpp
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +000048 $ ./a.out
49 hello
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000050
Mikhail Glushenkov4aecec12009-06-17 02:56:08 +000051One nice feature of LLVMC is that one doesn't have to distinguish between
52different compilers for different languages (think ``g++`` vs. ``gcc``) - the
53right toolchain is chosen automatically based on input language names (which
54are, in turn, determined from file extensions). If you want to force files
55ending with ".c" to compile as C++, use the ``-x`` option, just like you would
56do it with ``gcc``::
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000057
Mikhail Glushenkovebdeca72008-11-25 21:34:29 +000058 $ # hello.c is really a C++ file
Mikhail Glushenkov113ec352008-11-25 21:38:12 +000059 $ llvmc -x c++ hello.c
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +000060 $ ./a.out
61 hello
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000062
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +000063On the other hand, when using LLVMC as a linker to combine several C++
64object files you should provide the ``--linker`` option since it's
65impossible for LLVMC to choose the right linker in that case::
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000066
Mikhail Glushenkov113ec352008-11-25 21:38:12 +000067 $ llvmc -c hello.cpp
68 $ llvmc hello.o
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +000069 [A lot of link-time errors skipped]
Mikhail Glushenkov113ec352008-11-25 21:38:12 +000070 $ llvmc --linker=c++ hello.o
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +000071 $ ./a.out
72 hello
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +000073
Mikhail Glushenkov4410e322008-12-07 16:47:42 +000074By default, LLVMC uses ``llvm-gcc`` to compile the source code. It is
75also possible to choose the work-in-progress ``clang`` compiler with
76the ``-clang`` option.
77
Mikhail Glushenkov83237482008-10-15 09:29:13 +000078
Mikhail Glushenkov270cae32008-05-30 06:25:24 +000079Predefined options
80==================
81
82LLVMC has some built-in options that can't be overridden in the
Mikhail Glushenkov7e6d70a2008-11-26 22:59:45 +000083configuration libraries:
Mikhail Glushenkov270cae32008-05-30 06:25:24 +000084
85* ``-o FILE`` - Output file name.
86
87* ``-x LANGUAGE`` - Specify the language of the following input files
88 until the next -x option.
89
Mikhail Glushenkov83237482008-10-15 09:29:13 +000090* ``-load PLUGIN_NAME`` - Load the specified plugin DLL. Example:
91 ``-load $LLVM_DIR/Release/lib/LLVMCSimple.so``.
92
Mikhail Glushenkov270cae32008-05-30 06:25:24 +000093* ``-v`` - Enable verbose mode, i.e. print out all executed commands.
94
Mikhail Glushenkov294f5072009-06-25 18:20:44 +000095* ``--save-temps`` - Write temporary files to the current directory and do not
96 delete them on exit. This option can also take an argument: the
97 ``--save-temps=obj`` switch will write files into the directory specified with
98 the ``-o`` option. The ``--save-temps=cwd`` and ``--save-temps`` switches are
99 both synonyms for the default behaviour.
100
Mikhail Glushenkov4ad0d572009-03-27 12:58:29 +0000101* ``--check-graph`` - Check the compilation for common errors like mismatched
102 output/input language names, multiple default edges and cycles. Because of
Mikhail Glushenkov530f3992009-06-16 00:13:52 +0000103 plugins, these checks can't be performed at compile-time. Exit with code zero
104 if no errors were found, and return the number of found errors
105 otherwise. Hidden option, useful for debugging LLVMC plugins.
Mikhail Glushenkovf8c430b2009-01-09 16:16:27 +0000106
Mikhail Glushenkov4ad0d572009-03-27 12:58:29 +0000107* ``--view-graph`` - Show a graphical representation of the compilation graph
108 and exit. Requires that you have ``dot`` and ``gv`` programs installed. Hidden
109 option, useful for debugging LLVMC plugins.
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000110
Mikhail Glushenkov4ad0d572009-03-27 12:58:29 +0000111* ``--write-graph`` - Write a ``compilation-graph.dot`` file in the current
112 directory with the compilation graph description in Graphviz format (identical
Mikhail Glushenkov530f3992009-06-16 00:13:52 +0000113 to the file used by the ``--view-graph`` option). The ``-o`` option can be
114 used to set the output file name. Hidden option, useful for debugging LLVMC
115 plugins.
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000116
Mikhail Glushenkov73296102008-05-30 06:29:17 +0000117* ``--help``, ``--help-hidden``, ``--version`` - These options have
118 their standard meaning.
119
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000120Compiling LLVMC plugins
121=======================
122
123It's easiest to start working on your own LLVMC plugin by copying the
124skeleton project which lives under ``$LLVMC_DIR/plugins/Simple``::
125
126 $ cd $LLVMC_DIR/plugins
127 $ cp -r Simple MyPlugin
128 $ cd MyPlugin
129 $ ls
130 Makefile PluginMain.cpp Simple.td
131
132As you can see, our basic plugin consists of only two files (not
133counting the build script). ``Simple.td`` contains TableGen
134description of the compilation graph; its format is documented in the
135following sections. ``PluginMain.cpp`` is just a helper file used to
136compile the auto-generated C++ code produced from TableGen source. It
137can also contain hook definitions (see `below`__).
138
139__ hooks_
140
141The first thing that you should do is to change the ``LLVMC_PLUGIN``
142variable in the ``Makefile`` to avoid conflicts (since this variable
143is used to name the resulting library)::
144
145 LLVMC_PLUGIN=MyPlugin
146
147It is also a good idea to rename ``Simple.td`` to something less
148generic::
149
150 $ mv Simple.td MyPlugin.td
151
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000152To build your plugin as a dynamic library, just ``cd`` to its source
153directory and run ``make``. The resulting file will be called
Mikhail Glushenkov4aecec12009-06-17 02:56:08 +0000154``plugin_llvmc_$(LLVMC_PLUGIN).$(DLL_EXTENSION)`` (in our case,
155``plugin_llvmc_MyPlugin.so``). This library can be then loaded in with the
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000156``-load`` option. Example::
157
158 $ cd $LLVMC_DIR/plugins/Simple
159 $ make
Mikhail Glushenkov4aecec12009-06-17 02:56:08 +0000160 $ llvmc -load $LLVM_DIR/Release/lib/plugin_llvmc_Simple.so
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000161
Mikhail Glushenkov530f3992009-06-16 00:13:52 +0000162Compiling standalone LLVMC-based drivers
163========================================
164
165By default, the ``llvmc`` executable consists of a driver core plus several
166statically linked plugins (``Base`` and ``Clang`` at the moment). You can
167produce a standalone LLVMC-based driver executable by linking the core with your
168own plugins. The recommended way to do this is by starting with the provided
169``Skeleton`` example (``$LLVMC_DIR/example/Skeleton``)::
170
171 $ cd $LLVMC_DIR/example/
172 $ cp -r Skeleton mydriver
173 $ cd mydriver
174 $ vim Makefile
175 [...]
176 $ make
177
178If you're compiling LLVM with different source and object directories, then you
179must perform the following additional steps before running ``make``::
180
181 # LLVMC_SRC_DIR = $LLVM_SRC_DIR/tools/llvmc/
182 # LLVMC_OBJ_DIR = $LLVM_OBJ_DIR/tools/llvmc/
183 $ cp $LLVMC_SRC_DIR/example/mydriver/Makefile \
184 $LLVMC_OBJ_DIR/example/mydriver/
185 $ cd $LLVMC_OBJ_DIR/example/mydriver
186 $ make
187
188Another way to do the same thing is by using the following command::
189
190 $ cd $LLVMC_DIR
191 $ make LLVMC_BUILTIN_PLUGINS=MyPlugin LLVMC_BASED_DRIVER_NAME=mydriver
192
Mikhail Glushenkov4aecec12009-06-17 02:56:08 +0000193This works with both srcdir == objdir and srcdir != objdir, but assumes that the
Mikhail Glushenkov530f3992009-06-16 00:13:52 +0000194plugin source directory was placed under ``$LLVMC_DIR/plugins``.
195
Mikhail Glushenkovf80f0aa2008-11-25 21:34:01 +0000196Sometimes, you will want a 'bare-bones' version of LLVMC that has no
197built-in plugins. It can be compiled with the following command::
198
199 $ cd $LLVMC_DIR
Mikhail Glushenkov530f3992009-06-16 00:13:52 +0000200 $ make LLVMC_BUILTIN_PLUGINS=""
Mikhail Glushenkovf80f0aa2008-11-25 21:34:01 +0000201
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000202
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000203Customizing LLVMC: the compilation graph
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000204========================================
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000205
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000206Each TableGen configuration file should include the common
207definitions::
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000208
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000209 include "llvm/CompilerDriver/Common.td"
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +0000210
211Internally, LLVMC stores information about possible source
212transformations in form of a graph. Nodes in this graph represent
213tools, and edges between two nodes represent a transformation path. A
214special "root" node is used to mark entry points for the
215transformations. LLVMC also assigns a weight to each edge (more on
216this later) to choose between several alternative edges.
217
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000218The definition of the compilation graph (see file
219``plugins/Base/Base.td`` for an example) is just a list of edges::
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000220
221 def CompilationGraph : CompilationGraph<[
Mikhail Glushenkov01088772008-11-17 17:29:18 +0000222 Edge<"root", "llvm_gcc_c">,
223 Edge<"root", "llvm_gcc_assembler">,
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000224 ...
225
Mikhail Glushenkov01088772008-11-17 17:29:18 +0000226 Edge<"llvm_gcc_c", "llc">,
227 Edge<"llvm_gcc_cpp", "llc">,
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000228 ...
229
Mikhail Glushenkov536637f2008-11-25 21:34:53 +0000230 OptionalEdge<"llvm_gcc_c", "opt", (case (switch_on "opt"),
231 (inc_weight))>,
232 OptionalEdge<"llvm_gcc_cpp", "opt", (case (switch_on "opt"),
233 (inc_weight))>,
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000234 ...
235
Mikhail Glushenkov01088772008-11-17 17:29:18 +0000236 OptionalEdge<"llvm_gcc_assembler", "llvm_gcc_cpp_linker",
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +0000237 (case (input_languages_contain "c++"), (inc_weight),
238 (or (parameter_equals "linker", "g++"),
239 (parameter_equals "linker", "c++")), (inc_weight))>,
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000240 ...
241
242 ]>;
243
244As you can see, the edges can be either default or optional, where
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000245optional edges are differentiated by an additional ``case`` expression
Mikhail Glushenkov01088772008-11-17 17:29:18 +0000246used to calculate the weight of this edge. Notice also that we refer
Mikhail Glushenkovf80f0aa2008-11-25 21:34:01 +0000247to tools via their names (as strings). This makes it possible to add
248edges to an existing compilation graph in plugins without having to
249know about all tool definitions used in the graph.
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000250
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +0000251The default edges are assigned a weight of 1, and optional edges get a
252weight of 0 + 2*N where N is the number of tests that evaluated to
253true in the ``case`` expression. It is also possible to provide an
254integer parameter to ``inc_weight`` and ``dec_weight`` - in this case,
255the weight is increased (or decreased) by the provided value instead
Mikhail Glushenkov7e6d70a2008-11-26 22:59:45 +0000256of the default 2. It is also possible to change the default weight of
257an optional edge by using the ``default`` clause of the ``case``
258construct.
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +0000259
260When passing an input file through the graph, LLVMC picks the edge
261with the maximum weight. To avoid ambiguity, there should be only one
262default edge between two nodes (with the exception of the root node,
263which gets a special treatment - there you are allowed to specify one
264default edge *per language*).
265
Mikhail Glushenkov7e6d70a2008-11-26 22:59:45 +0000266When multiple plugins are loaded, their compilation graphs are merged
Mikhail Glushenkov3321b0f2008-11-28 00:12:09 +0000267together. Since multiple edges that have the same end nodes are not
268allowed (i.e. the graph is not a multigraph), an edge defined in
Mikhail Glushenkov7e6d70a2008-11-26 22:59:45 +0000269several plugins will be replaced by the definition from the plugin
270that was loaded last. Plugin load order can be controlled by using the
271plugin priority feature described above.
272
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +0000273To get a visual representation of the compilation graph (useful for
Mikhail Glushenkov113ec352008-11-25 21:38:12 +0000274debugging), run ``llvmc --view-graph``. You will need ``dot`` and
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +0000275``gsview`` installed for this to work properly.
276
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000277Describing options
278==================
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +0000279
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000280Command-line options that the plugin supports are defined by using an
281``OptionList``::
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000282
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000283 def Options : OptionList<[
284 (switch_option "E", (help "Help string")),
285 (alias_option "quiet", "q")
286 ...
287 ]>;
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000288
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000289As you can see, the option list is just a list of DAGs, where each DAG
290is an option description consisting of the option name and some
291properties. A plugin can define more than one option list (they are
292all merged together in the end), which can be handy if one wants to
293separate option groups syntactically.
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000294
295* Possible option types:
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000296
Mikhail Glushenkov19d3e822009-01-28 03:47:20 +0000297 - ``switch_option`` - a simple boolean switch without arguments, for example
298 ``-O2`` or ``-time``. At most one occurrence is allowed.
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000299
Mikhail Glushenkov19d3e822009-01-28 03:47:20 +0000300 - ``parameter_option`` - option that takes one argument, for example
301 ``-std=c99``. It is also allowed to use spaces instead of the equality
302 sign: ``-std c99``. At most one occurrence is allowed.
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000303
Mikhail Glushenkov19d3e822009-01-28 03:47:20 +0000304 - ``parameter_list_option`` - same as the above, but more than one option
305 occurence is allowed.
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000306
Mikhail Glushenkov19d3e822009-01-28 03:47:20 +0000307 - ``prefix_option`` - same as the parameter_option, but the option name and
308 argument do not have to be separated. Example: ``-ofile``. This can be also
309 specified as ``-o file``; however, ``-o=file`` will be parsed incorrectly
310 (``=file`` will be interpreted as option value). At most one occurrence is
311 allowed.
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000312
Mikhail Glushenkov19d3e822009-01-28 03:47:20 +0000313 - ``prefix_list_option`` - same as the above, but more than one occurence of
314 the option is allowed; example: ``-lm -lpthread``.
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000315
Mikhail Glushenkov19d3e822009-01-28 03:47:20 +0000316 - ``alias_option`` - a special option type for creating aliases. Unlike other
317 option types, aliases are not allowed to have any properties besides the
318 aliased option name. Usage example: ``(alias_option "preprocess", "E")``
Mikhail Glushenkov0ab8ac32008-05-30 06:28:00 +0000319
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000320
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000321* Possible option properties:
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000322
Mikhail Glushenkov19d3e822009-01-28 03:47:20 +0000323 - ``help`` - help string associated with this option. Used for ``--help``
324 output.
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000325
Mikhail Glushenkov19d3e822009-01-28 03:47:20 +0000326 - ``required`` - this option must be specified exactly once (or, in case of
327 the list options without the ``multi_val`` property, at least
328 once). Incompatible with ``zero_or_one`` and ``one_or_more``.
329
330 - ``one_or_more`` - the option must be specified at least one time. Useful
331 only for list options in conjunction with ``multi_val``; for ordinary lists
332 it is synonymous with ``required``. Incompatible with ``required`` and
333 ``zero_or_one``.
334
335 - ``zero_or_one`` - the option can be specified zero or one times. Useful
336 only for list options in conjunction with ``multi_val``. Incompatible with
337 ``required`` and ``one_or_more``.
Mikhail Glushenkov77ddce92008-05-06 18:17:19 +0000338
Mikhail Glushenkovf9b1d792009-01-15 02:42:40 +0000339 - ``hidden`` - the description of this option will not appear in
340 the ``--help`` output (but will appear in the ``--help-hidden``
341 output).
Mikhail Glushenkov739c7202008-11-28 00:13:25 +0000342
Mikhail Glushenkovf9b1d792009-01-15 02:42:40 +0000343 - ``really_hidden`` - the option will not be mentioned in any help
Mikhail Glushenkov739c7202008-11-28 00:13:25 +0000344 output.
345
Mikhail Glushenkov19d3e822009-01-28 03:47:20 +0000346 - ``multi_val n`` - this option takes *n* arguments (can be useful in some
347 special cases). Usage example: ``(parameter_list_option "foo", (multi_val
348 3))``. Only list options can have this attribute; you can, however, use
349 the ``one_or_more`` and ``zero_or_one`` properties.
350
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000351 - ``extern`` - this option is defined in some other plugin, see below.
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000352
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000353External options
354----------------
Mikhail Glushenkov0ab8ac32008-05-30 06:28:00 +0000355
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000356Sometimes, when linking several plugins together, one plugin needs to
357access options defined in some other plugin. Because of the way
Mikhail Glushenkovf9b1d792009-01-15 02:42:40 +0000358options are implemented, such options must be marked as
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000359``extern``. This is what the ``extern`` option property is
360for. Example::
Mikhail Glushenkov0ab8ac32008-05-30 06:28:00 +0000361
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000362 ...
363 (switch_option "E", (extern))
364 ...
Mikhail Glushenkov0ab8ac32008-05-30 06:28:00 +0000365
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000366See also the section on plugin `priorities`__.
Mikhail Glushenkov0ab8ac32008-05-30 06:28:00 +0000367
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000368__ priorities_
Mikhail Glushenkov0ab8ac32008-05-30 06:28:00 +0000369
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000370.. _case:
Mikhail Glushenkov83237482008-10-15 09:29:13 +0000371
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000372Conditional evaluation
373======================
Mikhail Glushenkov0ab8ac32008-05-30 06:28:00 +0000374
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000375The 'case' construct is the main means by which programmability is
376achieved in LLVMC. It can be used to calculate edge weights, program
377actions and modify the shell commands to be executed. The 'case'
378expression is designed after the similarly-named construct in
379functional languages and takes the form ``(case (test_1), statement_1,
380(test_2), statement_2, ... (test_N), statement_N)``. The statements
381are evaluated only if the corresponding tests evaluate to true.
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000382
383Examples::
384
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000385 // Edge weight calculation
386
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000387 // Increases edge weight by 5 if "-A" is provided on the
388 // command-line, and by 5 more if "-B" is also provided.
389 (case
390 (switch_on "A"), (inc_weight 5),
391 (switch_on "B"), (inc_weight 5))
392
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000393
394 // Tool command line specification
395
396 // Evaluates to "cmdline1" if the option "-A" is provided on the
397 // command line; to "cmdline2" if "-B" is provided;
398 // otherwise to "cmdline3".
399
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000400 (case
401 (switch_on "A"), "cmdline1",
402 (switch_on "B"), "cmdline2",
403 (default), "cmdline3")
404
405Note the slight difference in 'case' expression handling in contexts
406of edge weights and command line specification - in the second example
407the value of the ``"B"`` switch is never checked when switch ``"A"`` is
408enabled, and the whole expression always evaluates to ``"cmdline1"`` in
409that case.
410
411Case expressions can also be nested, i.e. the following is legal::
412
413 (case (switch_on "E"), (case (switch_on "o"), ..., (default), ...)
414 (default), ...)
415
416You should, however, try to avoid doing that because it hurts
417readability. It is usually better to split tool descriptions and/or
418use TableGen inheritance instead.
419
420* Possible tests are:
421
Mikhail Glushenkov536637f2008-11-25 21:34:53 +0000422 - ``switch_on`` - Returns true if a given command-line switch is
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000423 provided by the user. Example: ``(switch_on "opt")``.
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000424
425 - ``parameter_equals`` - Returns true if a command-line parameter equals
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000426 a given value.
427 Example: ``(parameter_equals "W", "all")``.
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000428
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000429 - ``element_in_list`` - Returns true if a command-line parameter
430 list contains a given value.
431 Example: ``(parameter_in_list "l", "pthread")``.
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000432
433 - ``input_languages_contain`` - Returns true if a given language
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000434 belongs to the current input language set.
435 Example: ``(input_languages_contain "c++")``.
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000436
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000437 - ``in_language`` - Evaluates to true if the input file language
438 equals to the argument. At the moment works only with ``cmd_line``
439 and ``actions`` (on non-join nodes).
440 Example: ``(in_language "c++")``.
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000441
442 - ``not_empty`` - Returns true if a given option (which should be
443 either a parameter or a parameter list) is set by the
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000444 user.
445 Example: ``(not_empty "o")``.
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000446
Mikhail Glushenkov5c2b6b22008-12-17 02:47:01 +0000447 - ``empty`` - The opposite of ``not_empty``. Equivalent to ``(not (not_empty
448 X))``. Provided for convenience.
449
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000450 - ``default`` - Always evaluates to true. Should always be the last
451 test in the ``case`` expression.
452
453 - ``and`` - A standard logical combinator that returns true iff all
454 of its arguments return true. Used like this: ``(and (test1),
455 (test2), ... (testN))``. Nesting of ``and`` and ``or`` is allowed,
456 but not encouraged.
457
458 - ``or`` - Another logical combinator that returns true only if any
459 one of its arguments returns true. Example: ``(or (test1),
460 (test2), ... (testN))``.
461
Mikhail Glushenkovcd0858e2008-05-30 06:14:42 +0000462
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000463Writing a tool description
464==========================
465
466As was said earlier, nodes in the compilation graph represent tools,
467which are described separately. A tool definition looks like this
468(taken from the ``include/llvm/CompilerDriver/Tools.td`` file)::
469
470 def llvm_gcc_cpp : Tool<[
471 (in_language "c++"),
472 (out_language "llvm-assembler"),
473 (output_suffix "bc"),
474 (cmd_line "llvm-g++ -c $INFILE -o $OUTFILE -emit-llvm"),
475 (sink)
476 ]>;
477
478This defines a new tool called ``llvm_gcc_cpp``, which is an alias for
479``llvm-g++``. As you can see, a tool definition is just a list of
480properties; most of them should be self-explanatory. The ``sink``
481property means that this tool should be passed all command-line
482options that aren't mentioned in the option list.
483
484The complete list of all currently implemented tool properties follows.
485
486* Possible tool properties:
487
488 - ``in_language`` - input language name. Can be either a string or a
489 list, in case the tool supports multiple input languages.
490
491 - ``out_language`` - output language name. Tools are not allowed to
492 have multiple output languages.
493
494 - ``output_suffix`` - output file suffix. Can also be changed
495 dynamically, see documentation on actions.
496
497 - ``cmd_line`` - the actual command used to run the tool. You can
498 use ``$INFILE`` and ``$OUTFILE`` variables, output redirection
499 with ``>``, hook invocations (``$CALL``), environment variables
500 (via ``$ENV``) and the ``case`` construct.
501
502 - ``join`` - this tool is a "join node" in the graph, i.e. it gets a
503 list of input files and joins them together. Used for linkers.
504
505 - ``sink`` - all command-line options that are not handled by other
506 tools are passed to this tool.
507
508 - ``actions`` - A single big ``case`` expression that specifies how
509 this tool reacts on command-line options (described in more detail
510 below).
511
512Actions
513-------
514
515A tool often needs to react to command-line options, and this is
516precisely what the ``actions`` property is for. The next example
517illustrates this feature::
518
519 def llvm_gcc_linker : Tool<[
520 (in_language "object-code"),
521 (out_language "executable"),
522 (output_suffix "out"),
523 (cmd_line "llvm-gcc $INFILE -o $OUTFILE"),
524 (join),
525 (actions (case (not_empty "L"), (forward "L"),
526 (not_empty "l"), (forward "l"),
527 (not_empty "dummy"),
528 [(append_cmd "-dummy1"), (append_cmd "-dummy2")])
529 ]>;
530
531The ``actions`` tool property is implemented on top of the omnipresent
532``case`` expression. It associates one or more different *actions*
533with given conditions - in the example, the actions are ``forward``,
534which forwards a given option unchanged, and ``append_cmd``, which
535appends a given string to the tool execution command. Multiple actions
536can be associated with a single condition by using a list of actions
537(used in the example to append some dummy options). The same ``case``
538construct can also be used in the ``cmd_line`` property to modify the
539tool command line.
540
541The "join" property used in the example means that this tool behaves
542like a linker.
543
544The list of all possible actions follows.
545
546* Possible actions:
547
548 - ``append_cmd`` - append a string to the tool invocation
549 command.
Mikhail Glushenkov5c2b6b22008-12-17 02:47:01 +0000550 Example: ``(case (switch_on "pthread"), (append_cmd
551 "-lpthread"))``
552
553 - ``error` - exit with error.
554 Example: ``(error "Mixing -c and -S is not allowed!")``.
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000555
556 - ``forward`` - forward an option unchanged.
557 Example: ``(forward "Wall")``.
558
559 - ``forward_as`` - Change the name of an option, but forward the
560 argument unchanged.
Mikhail Glushenkove89331b2009-05-06 01:41:19 +0000561 Example: ``(forward_as "O0", "--disable-optimization")``.
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000562
563 - ``output_suffix`` - modify the output suffix of this
564 tool.
565 Example: ``(output_suffix "i")``.
566
567 - ``stop_compilation`` - stop compilation after this tool processes
568 its input. Used without arguments.
569
570 - ``unpack_values`` - used for for splitting and forwarding
571 comma-separated lists of options, e.g. ``-Wa,-foo=bar,-baz`` is
572 converted to ``-foo=bar -baz`` and appended to the tool invocation
573 command.
574 Example: ``(unpack_values "Wa,")``.
575
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000576Language map
Mikhail Glushenkov270cae32008-05-30 06:25:24 +0000577============
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000578
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000579If you are adding support for a new language to LLVMC, you'll need to
580modify the language map, which defines mappings from file extensions
581to language names. It is used to choose the proper toolchain(s) for a
582given input file set. Language map definition looks like this::
Anton Korobeynikovac67b7e2008-03-23 08:57:20 +0000583
584 def LanguageMap : LanguageMap<
585 [LangToSuffixes<"c++", ["cc", "cp", "cxx", "cpp", "CPP", "c++", "C"]>,
586 LangToSuffixes<"c", ["c"]>,
587 ...
588 ]>;
589
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000590For example, without those definitions the following command wouldn't work::
591
592 $ llvmc hello.cpp
593 llvmc: Unknown suffix: cpp
594
595The language map entries should be added only for tools that are
596linked with the root node. Since tools are not allowed to have
597multiple output languages, for nodes "inside" the graph the input and
598output languages should match. This is enforced at compile-time.
599
600
601More advanced topics
602====================
603
604.. _hooks:
605
606Hooks and environment variables
607-------------------------------
608
609Normally, LLVMC executes programs from the system ``PATH``. Sometimes,
Mikhail Glushenkova298bb72009-01-21 13:04:00 +0000610this is not sufficient: for example, we may want to specify tool paths
611or names in the configuration file. This can be easily achieved via
612the hooks mechanism. To write your own hooks, just add their
613definitions to the ``PluginMain.cpp`` or drop a ``.cpp`` file into the
614your plugin directory. Hooks should live in the ``hooks`` namespace
Mikhail Glushenkovb6b51412009-01-21 13:04:33 +0000615and have the signature ``std::string hooks::MyHookName ([const char*
Mikhail Glushenkova298bb72009-01-21 13:04:00 +0000616Arg0 [ const char* Arg2 [, ...]]])``. They can be used from the
617``cmd_line`` tool property::
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000618
619 (cmd_line "$CALL(MyHook)/path/to/file -o $CALL(AnotherHook)")
620
Mikhail Glushenkova298bb72009-01-21 13:04:00 +0000621To pass arguments to hooks, use the following syntax::
622
623 (cmd_line "$CALL(MyHook, 'Arg1', 'Arg2', 'Arg # 3')/path/to/file -o1 -o2")
624
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000625It is also possible to use environment variables in the same manner::
626
627 (cmd_line "$ENV(VAR1)/path/to/file -o $ENV(VAR2)")
628
629To change the command line string based on user-provided options use
630the ``case`` expression (documented `above`__)::
631
632 (cmd_line
633 (case
634 (switch_on "E"),
635 "llvm-g++ -E -x c $INFILE -o $OUTFILE",
636 (default),
637 "llvm-g++ -c -x c $INFILE -o $OUTFILE -emit-llvm"))
638
639__ case_
640
641.. _priorities:
642
643How plugins are loaded
644----------------------
645
646It is possible for LLVMC plugins to depend on each other. For example,
647one can create edges between nodes defined in some other plugin. To
648make this work, however, that plugin should be loaded first. To
649achieve this, the concept of plugin priority was introduced. By
650default, every plugin has priority zero; to specify the priority
651explicitly, put the following line in your plugin's TableGen file::
652
653 def Priority : PluginPriority<$PRIORITY_VALUE>;
654 # Where PRIORITY_VALUE is some integer > 0
655
656Plugins are loaded in order of their (increasing) priority, starting
657with 0. Therefore, the plugin with the highest priority value will be
658loaded last.
659
Mikhail Glushenkov9ecd30c2008-09-22 20:48:48 +0000660Debugging
Mikhail Glushenkov4410e322008-12-07 16:47:42 +0000661---------
Mikhail Glushenkov9ecd30c2008-09-22 20:48:48 +0000662
663When writing LLVMC plugins, it can be useful to get a visual view of
664the resulting compilation graph. This can be achieved via the command
Mikhail Glushenkovd5652032008-12-13 02:28:58 +0000665line option ``--view-graph``. This command assumes that Graphviz_ and
Mikhail Glushenkov4ad0d572009-03-27 12:58:29 +0000666Ghostview_ are installed. There is also a ``--write-graph`` option that
Mikhail Glushenkovd5652032008-12-13 02:28:58 +0000667creates a Graphviz source file (``compilation-graph.dot``) in the
Mikhail Glushenkov9ecd30c2008-09-22 20:48:48 +0000668current directory.
669
Mikhail Glushenkovf9b1d792009-01-15 02:42:40 +0000670Another useful ``llvmc`` option is ``--check-graph``. It checks the
671compilation graph for common errors like mismatched output/input
672language names, multiple default edges and cycles. These checks can't
673be performed at compile-time because the plugins can load code
674dynamically. When invoked with ``--check-graph``, ``llvmc`` doesn't
675perform any compilation tasks and returns the number of encountered
676errors as its status code.
Mikhail Glushenkovf8c430b2009-01-09 16:16:27 +0000677
Mikhail Glushenkovd5652032008-12-13 02:28:58 +0000678.. _Graphviz: http://www.graphviz.org/
679.. _Ghostview: http://pages.cs.wisc.edu/~ghost/
Mikhail Glushenkov68319f82008-12-11 23:24:40 +0000680
Mikhail Glushenkov875ace52009-06-30 00:16:00 +0000681Conditioning on the executable name
682-----------------------------------
683
684For now, the executable name (the value passed to the driver in ``argv[0]``) is
685accessible only in the C++ code (i.e. hooks). Use the following code::
686
687 namespace llvmc {
688 extern const char* ProgramName;
689 }
690
691 std::string MyHook() {
692 //...
693 if (strcmp(ProgramName, "mydriver") == 0) {
694 //...
695
696 }
697
698In general, you're encouraged not to make the behaviour dependent on the
699executable file name, and use command-line switches instead. See for example how
700the ``Base`` plugin behaves when it needs to choose the correct linker options
701(think ``g++`` vs. ``gcc``).
702
Mikhail Glushenkov68319f82008-12-11 23:24:40 +0000703.. raw:: html
Mikhail Glushenkovd5652032008-12-13 02:28:58 +0000704
705 <hr />
706 <address>
707 <a href="http://jigsaw.w3.org/css-validator/check/referer">
708 <img src="http://jigsaw.w3.org/css-validator/images/vcss-blue"
709 alt="Valid CSS" /></a>
710 <a href="http://validator.w3.org/check?uri=referer">
711 <img src="http://www.w3.org/Icons/valid-xhtml10-blue"
712 alt="Valid XHTML 1.0 Transitional"/></a>
713
714 <a href="mailto:foldr@codedgers.com">Mikhail Glushenkov</a><br />
715 <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br />
716
717 Last modified: $Date: 2008-12-11 11:34:48 -0600 (Thu, 11 Dec 2008) $
718 </address>