Blame - docs/tutorial/JITTutorial1.html - platform/external/llvm

blob: 1de1a9d8061c023b3b6cec854b416c53d5ed2ab6 [file] [log] [blame]

Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	1	<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
				2	"http://www.w3.org/TR/html4/strict.dtd">
				3
				4	<html>
				5	<head>
				6	<title>LLVM Tutorial 1: A First Function</title>
				7	<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
				8	<meta name="author" content="Owen Anderson">
				9	<meta name="description"
				10	content="LLVM Tutorial 1: A First Function.">
				11	<link rel="stylesheet" href="../llvm.css" type="text/css">
				12	</head>
				13
				14	<body>
				15
				16	<div class="doc_title"> LLVM Tutorial 1: A First Function </div>
				17
				18	<div class="doc_author">
				19	<p>Written by <a href="mailto:owen@apple.com">Owen Anderson</a></p>
				20	</div>
				21
				22	<div class="doc_text">
				23
				24	<p>For starters, lets consider a relatively straightforward function that takes three integer parameters and returns an arithmetic combination of them. This is nice and simple, especially since it involves no control flow:</p>
				25
				26	<div class="doc_code">
				27	<pre>
Owen Anderson	7d69c95	2007-10-20 05:41:39 +0000	[diff] [blame]	28	int mul_add(int x, int y, int z) {
				29	return x * y + z;
				30	}
Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	31	</pre>
				32	</div>
				33
				34	<p>As a preview, the LLVM IR we’re going to end up generating for this function will look like:</p>
				35
				36	<div class="doc_code">
				37	<pre>
Owen Anderson	7d69c95	2007-10-20 05:41:39 +0000	[diff] [blame]	38	define i32 @mul_add(i32 %x, i32 %y, i32 %z) {
				39	entry:
				40	%tmp = mul i32 %x, %y
				41	%tmp2 = add i32 %tmp, %z
				42	ret i32 %tmp2
				43	}
Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	44	</pre>
				45	</div>
				46
Owen Anderson	0ec16ee	2007-10-20 06:12:33 +0000	[diff] [blame]	47	<p>If you're unsure what the above code says, skim through the <a href="../LangRef.html">LLVM Language Reference Manual</a> and convince yourself that the above LLVM IR is actually equivalent to the original function. Once you’re satisfied with that, let’s move on to actually generating it programmatically!</p>
Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	48
				49	<p>... STUFF ABOUT HEADERS ... </p>
				50
				51	<p>Now, let’s get started on our real program. Here’s what our basic <code>main()</code> will look like:</p>
				52
				53	<div class="doc_code">
				54	<pre>
				55	using namespace llvm;
				56
				57	Module* makeLLVMModule();
				58
				59	int main(int argc, char**argv) {
				60	Module* Mod = makeLLVMModule();
				61
				62	verifyModule(*Mod, PrintMessageAction);
				63
				64	PassManager PM;
				65	PM.add(new PrintModulePass(&llvm::cout));
				66	PM.run(*Mod);
				67
				68	return 0;
				69	}
				70	</pre>
				71	</div>
				72
				73	<p>The first segment is pretty simple: it creates an LLVM “module.” In LLVM, a module represents a single unit of code that is to be processed together. A module contains things like global variables and function declarations and implementations. Here, we’ve declared a <code>makeLLVMModule()</code> function to do the real work of creating the module. Don’t worry, we’ll be looking at that one next!</p>
				74
				75	<p>The second segment runs the LLVM module verifier on our newly created module. While this probably isn’t really necessary for a simple module like this one, it’s always a good idea, especially if you’re generating LLVM IR based on some input. The verifier will print an error message if your LLVM module is malformed in any way.</p>
				76
				77	<p>Finally, we instantiate an LLVM <code>PassManager</code> and run the <code>PrintModulePass</code> on our module. LLVM uses an explicit pass infrastructure to manage optimizations and various other things. A <code>PassManager</code>, as should be obvious from its name, manages passes: it is responsible for scheduling them, invoking them, and insuring the proper disposal after we’re done with them. For this example, we’re just using a trivial pass that prints out our module in textual form.</p>
				78
Owen Anderson	0ec16ee	2007-10-20 06:12:33 +0000	[diff] [blame]	79	<p>Now onto the interesting part: creating a populating a module. Here’s the first chunk of our <code>makeLLVMModule()</code>:</p>
Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	80
				81	<div class="doc_code">
				82	<pre>
				83	Module* makeLLVMModule() {
				84	// Module Construction
				85	Module* mod = new Module("test");
				86	</pre>
				87	</div>
				88
				89	<p>Exciting, isn’t it!? All we’re doing here is instantiating a module and giving it a name. The name isn’t particularly important unless you’re going to be dealing with multiple modules at once.</p>
				90
				91	<div class="doc_code">
				92	<pre>
Owen Anderson	d2ae9a9	2007-10-22 06:29:31 +0000	[diff] [blame^]	93	Constant* c = mod->getOrInsertFunction("mul_add",
				94	/ret type/ IntegerType::get(32),
				95	/args/ IntegerType::get(32),
				96	IntegerType::get(32),
				97	IntegerType::get(32),
				98	/varargs terminated with null/ NULL);
Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	99
Owen Anderson	2d279f8	2007-10-20 05:40:47 +0000	[diff] [blame]	100	Function* mul_add = cast<Function>(c);
Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	101	mul_add->setCallingConv(CallingConv::C);
				102	</pre>
				103	</div>
				104
Owen Anderson	2d279f8	2007-10-20 05:40:47 +0000	[diff] [blame]	105	<p>We construct our <code>Function</code> by calling <code>getOrInsertFunction()</code> on our module, passing in the name, return type, and argument types of the function. In the case of our <code>mul_add</code> function, that means one 32-bit integer for the return value, and three 32-bit integers for the arguments.</p>
Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	106
Owen Anderson	2d279f8	2007-10-20 05:40:47 +0000	[diff] [blame]	107	<p>You'll notice that <code>getOrInsertFunction</code> doesn't actually return a <code>Function</code>. This is because, if the function already existed, but with a different prototype, <code>getOrInsertFunction</code> will return a cast of the existing function to the desired prototype. Since we know that there's not already a <code>mul_add</code> function, we can safely just cast <code>c</code> to a <code>Function</code>.
				108
				109	<p>In addition, we set the calling convention for our new function to be the C calling convention. This isn’t strictly necessary, but it insures that our new function will interoperate properly with C code, which is a good thing.</p>
Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	110
				111	<div class="doc_code">
				112	<pre>
				113	Function::arg_iterator args = mul_add->arg_begin();
				114	Value* x = args++;
				115	x->setName("x");
				116	Value* y = args++;
				117	y->setName("y");
				118	Value* z = args++;
				119	z->setName("z");
				120	</pre>
				121	</div>
				122
				123	<p>While we’re setting up our function, let’s also give names to the parameters. This also isn’t strictly necessary (LLVM will generate names for them if you don’t specify them), but it’ll make looking at our output somewhat more pleasant. To name the parameters, we iterator over the arguments of our function, and call <code>setName()</code> on them. We’ll also keep the pointer to <code>x</code>, <code>y</code>, and <code>z</code> around, since we’ll need them when we get around to creating instructions.</p>
				124
				125	<p>Great! We have a function now. But what good is a function if it has no body? Before we start working on a body for our new function, we need to recall some details of the LLVM IR. The IR, being an abstract assembly language, represents control flow using jumps (we call them branches), both conditional and unconditional. The straight-line sequences of code between branches are called basic blocks, or just blocks. To create a body for our function, we fill it with blocks!</p>
				126
				127	<div class="doc_code">
				128	<pre>
				129	BasicBlock* block = new BasicBlock("entry", mul_add);
				130	LLVMBuilder builder(block);
				131	</pre>
				132	</div>
				133
				134	<p>We create a new basic block, as you might expect, by calling its constructor. All we need to tell it is its name and the function to which it belongs. In addition, we’re creating an <code>LLVMBuilder</code> object, which is a convenience interface for creating instructions and appending them to the end of a block. Instructions can be created through their constructors as well, but some of their interfaces are quite complicated. Unless you need a lot of control, using <code>LLVMBuilder</code> will make your life simpler.</p>
				135
				136	<div class="doc_code">
				137	<pre>
				138	Value* tmp = builder.CreateBinOp(Instruction::Mul,
				139	x, y, "tmp");
				140	Value* tmp2 = builder.CreateBinOp(Instruction::Add,
				141	tmp, z, "tmp2");
				142
				143	builder.CreateRet(tmp2);
Owen Anderson	d2ae9a9	2007-10-22 06:29:31 +0000	[diff] [blame^]	144
				145	return mod;
Owen Anderson	6f1fd94	2007-10-20 05:23:06 +0000	[diff] [blame]	146	}
				147	</pre>
				148	</div>
				149
				150	<p>The final step in creating our function is to create the instructions that make it up. Our <code>mul_add</code> function is composed of just three instructions: a multiply, an add, and a return. <code>LLVMBuilder</code> gives us a simple interface for constructing these instructions and appending them to the “entry” block. Each of the calls to <code>LLVMBuilder</code> returns a <code>Value</code> that represents the value yielded by the instruction. You’ll also notice that, above, <code>x</code>, <code>y</code>, and <code>z</code> are also <code>Value</code>’s, so it’s clear that instructions operate on <code>Value*</code>’s.</p>
				151
				152	<p>And that’s it! Now you can compile and run your code, and get a wonder textual print out of the LLVM IR we saw at the beginning.</p>
				153
				154	<p> ... SECTION ABOUT USING llvm-config TO GET THE NECESSARY COMPILER FLAGS TO COMPILE YOUR CODE ... </p>
				155
				156	</div>
				157
				158	</body>
				159	</html>