Blame - llvm/docs/LangRef.rst - toolchain/llvm-project

blob: 2aba2077653f8891d34698f5c0f8818eb50045ae [file] [log] [blame]

Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1	==============================
				2	LLVM Language Reference Manual
				3	==============================
				4
				5	.. contents::
				6	:local:
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	7	:depth: 4
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9	Abstract
				10	========
				11
				12	This document is a reference manual for the LLVM assembly language. LLVM
				13	is a Static Single Assignment (SSA) based representation that provides
				14	type safety, low-level operations, flexibility, and the capability of
				15	representing 'all' high-level languages cleanly. It is the common code
				16	representation used throughout all phases of the LLVM compilation
				17	strategy.
				18
				19	Introduction
				20	============
				21
				22	The LLVM code representation is designed to be used in three different
				23	forms: as an in-memory compiler IR, as an on-disk bitcode representation
				24	(suitable for fast loading by a Just-In-Time compiler), and as a human
				25	readable assembly language representation. This allows LLVM to provide a
				26	powerful intermediate representation for efficient compiler
				27	transformations and analysis, while providing a natural means to debug
				28	and visualize the transformations. The three different forms of LLVM are
				29	all equivalent. This document describes the human readable
				30	representation and notation.
				31
				32	The LLVM representation aims to be light-weight and low-level while
				33	being expressive, typed, and extensible at the same time. It aims to be
				34	a "universal IR" of sorts, by being at a low enough level that
				35	high-level ideas may be cleanly mapped to it (similar to how
				36	microprocessors are "universal IR's", allowing many source languages to
				37	be mapped to them). By providing type information, LLVM can be used as
				38	the target of optimizations: for example, through pointer analysis, it
				39	can be proven that a C automatic variable is never accessed outside of
				40	the current function, allowing it to be promoted to a simple SSA value
				41	instead of a memory location.
				42
				43	.. _wellformed:
				44
				45	Well-Formedness
				46	---------------
				47
				48	It is important to note that this document describes 'well formed' LLVM
				49	assembly language. There is a difference between what the parser accepts
				50	and what is considered 'well formed'. For example, the following
				51	instruction is syntactically okay, but not well formed:
				52
				53	.. code-block:: llvm
				54
				55	%x = add i32 1, %x
				56
				57	because the definition of ``%x`` does not dominate all of its uses. The
				58	LLVM infrastructure provides a verification pass that may be used to
				59	verify that an LLVM module is well formed. This pass is automatically
				60	run by the parser after parsing input assembly and by the optimizer
				61	before it outputs bitcode. The violations pointed out by the verifier
				62	pass indicate bugs in transformation passes or input to the parser.
				63
				64	.. _identifiers:
				65
				66	Identifiers
				67	===========
				68
				69	LLVM identifiers come in two basic types: global and local. Global
				70	identifiers (functions, global variables) begin with the ``'@'``
				71	character. Local identifiers (register names, types) begin with the
				72	``'%'`` character. Additionally, there are three different formats for
				73	identifiers, for different purposes:
				74
				75	#. Named values are represented as a string of characters with their
				76	prefix. For example, ``%foo``, ``@DivisionByZero``,
				77	``%a.really.long.identifier``. The actual regular expression used is
Sean Silva	9d01a5b	2015-01-07 21:35:14 +0000	[diff] [blame]	78	'``[%@][-a-zA-Z$._][-a-zA-Z$._0-9]*``'. Identifiers that require other
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	79	characters in their names can be surrounded with quotes. Special
				80	characters may be escaped using ``"\xx"`` where ``xx`` is the ASCII
				81	code for the character in hexadecimal. In this way, any character can
Hans Wennborg	85e0653	2014-07-30 20:02:08 +0000	[diff] [blame]	82	be used in a name value, even quotes themselves. The ``"\01"`` prefix
Hans Wennborg	2cfcc01	2018-05-22 10:14:07 +0000	[diff] [blame]	83	can be used on global values to suppress mangling.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	84	#. Unnamed values are represented as an unsigned numeric value with
				85	their prefix. For example, ``%12``, ``@2``, ``%44``.
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	86	#. Constants, which are described in the section Constants_ below.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	87
				88	LLVM requires that values start with a prefix for two reasons: Compilers
				89	don't need to worry about name clashes with reserved words, and the set
				90	of reserved words may be expanded in the future without penalty.
				91	Additionally, unnamed identifiers allow a compiler to quickly come up
				92	with a temporary variable without having to avoid symbol table
				93	conflicts.
				94
				95	Reserved words in LLVM are very similar to reserved words in other
				96	languages. There are keywords for different opcodes ('``add``',
				97	'``bitcast``', '``ret``', etc...), for primitive type names ('``void``',
				98	'``i32``', etc...), and others. These reserved words cannot conflict
				99	with variable names, because none of them start with a prefix character
				100	(``'%'`` or ``'@'``).
				101
				102	Here is an example of LLVM code to multiply the integer variable
				103	'``%X``' by 8:
				104
				105	The easy way:
				106
				107	.. code-block:: llvm
				108
				109	%result = mul i32 %X, 8
				110
				111	After strength reduction:
				112
				113	.. code-block:: llvm
				114
Dmitri Gribenko	675911d	2013-01-26 13:30:13 +0000	[diff] [blame]	115	%result = shl i32 %X, 3
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	116
				117	And the hard way:
				118
				119	.. code-block:: llvm
				120
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	121	%0 = add i32 %X, %X ; yields i32:%0
				122	%1 = add i32 %0, %0 ; yields i32:%1
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	123	%result = add i32 %1, %1
				124
				125	This last way of multiplying ``%X`` by 8 illustrates several important
				126	lexical features of LLVM:
				127
				128	#. Comments are delimited with a '``;``' and go until the end of line.
				129	#. Unnamed temporaries are created when the result of a computation is
				130	not assigned to a named value.
Sean Silva	8ca1178	2013-05-20 23:31:12 +0000	[diff] [blame]	131	#. Unnamed temporaries are numbered sequentially (using a per-function
Dan Liew	2661dfc	2014-08-20 15:06:30 +0000	[diff] [blame]	132	incrementing counter, starting with 0). Note that basic blocks and unnamed
				133	function parameters are included in this numbering. For example, if the
				134	entry basic block is not given a label name and all function parameters are
				135	named, then it will get number 0.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	136
				137	It also shows a convention that we follow in this document. When
				138	demonstrating instructions, we will follow an instruction with a comment
				139	that defines the type and name of value produced.
				140
				141	High Level Structure
				142	====================
				143
				144	Module Structure
				145	----------------
				146
				147	LLVM programs are composed of ``Module``'s, each of which is a
				148	translation unit of the input programs. Each module consists of
				149	functions, global variables, and symbol table entries. Modules may be
				150	combined together with the LLVM linker, which merges function (and
				151	global variable) definitions, resolves forward declarations, and merges
				152	symbol table entries. Here is an example of the "hello world" module:
				153
				154	.. code-block:: llvm
				155
Michael Liao	a769908	2013-03-06 18:24:34 +0000	[diff] [blame]	156	; Declare the string constant as a global constant.
				157	@.str = private unnamed_addr constant [13 x i8] c"hello world\0A\00"
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	158
Michael Liao	a769908	2013-03-06 18:24:34 +0000	[diff] [blame]	159	; External declaration of the puts function
				160	declare i32 @puts(i8* nocapture) nounwind
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	161
				162	; Definition of main function
Michael Liao	a769908	2013-03-06 18:24:34 +0000	[diff] [blame]	163	define i32 @main() { ; i32()*
George Burgess IV	fbc3498	2017-05-20 04:52:29 +0000	[diff] [blame]	164	; Convert [13 x i8]* to i8*...
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	165	%cast210 = getelementptr [13 x i8], [13 x i8]* @.str, i64 0, i64 0
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	166
Michael Liao	a769908	2013-03-06 18:24:34 +0000	[diff] [blame]	167	; Call puts function to write out the string to stdout.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	168	call i32 @puts(i8* %cast210)
Michael Liao	a769908	2013-03-06 18:24:34 +0000	[diff] [blame]	169	ret i32 0
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	170	}
				171
				172	; Named metadata
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	173	!0 = !{i32 42, null, !"string"}
Nick Lewycky	a0de40a	2014-08-13 04:54:05 +0000	[diff] [blame]	174	!foo = !{!0}
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	175
				176	This example is made up of a :ref:`global variable <globalvars>` named
				177	"``.str``", an external declaration of the "``puts``" function, a
				178	:ref:`function definition <functionstructure>` for "``main``" and
				179	:ref:`named metadata <namedmetadatastructure>` "``foo``".
				180
				181	In general, a module is made up of a list of global values (where both
				182	functions and global variables are global values). Global values are
				183	represented by a pointer to a memory location (in this case, a pointer
				184	to an array of char, and a pointer to a function), and have one of the
				185	following :ref:`linkage types <linkage>`.
				186
				187	.. _linkage:
				188
				189	Linkage Types
				190	-------------
				191
				192	All Global Variables and Functions have one of the following types of
				193	linkage:
				194
				195	``private``
				196	Global values with "``private``" linkage are only directly
				197	accessible by objects in the current module. In particular, linking
Sylvestre Ledru	0604c5c	2017-03-04 14:01:38 +0000	[diff] [blame]	198	code into a module with a private global value may cause the
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	199	private to be renamed as necessary to avoid collisions. Because the
				200	symbol is private to the module, all references can be updated. This
				201	doesn't show up in any symbol table in the object file.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	202	``internal``
				203	Similar to private, but the value shows as a local symbol
				204	(``STB_LOCAL`` in the case of ELF) in the object file. This
				205	corresponds to the notion of the '``static``' keyword in C.
				206	``available_externally``
Peter Collingbourne	45cd0c3	2015-12-14 19:22:37 +0000	[diff] [blame]	207	Globals with "``available_externally``" linkage are never emitted into
				208	the object file corresponding to the LLVM module. From the linker's
				209	perspective, an ``available_externally`` global is equivalent to
				210	an external declaration. They exist to allow inlining and other
				211	optimizations to take place given knowledge of the definition of the
				212	global, which is known to be somewhere outside the module. Globals
				213	with ``available_externally`` linkage are allowed to be discarded at
				214	will, and allow inlining and other optimizations. This linkage type is
				215	only allowed on definitions, not declarations.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	216	``linkonce``
				217	Globals with "``linkonce``" linkage are merged with other globals of
				218	the same name when linkage occurs. This can be used to implement
				219	some forms of inline functions, templates, or other code which must
				220	be generated in each translation unit that uses it, but where the
				221	body may be overridden with a more definitive definition later.
				222	Unreferenced ``linkonce`` globals are allowed to be discarded. Note
				223	that ``linkonce`` linkage does not actually allow the optimizer to
				224	inline the body of this function into callers because it doesn't
				225	know if this definition of the function is the definitive definition
				226	within the program or whether it will be overridden by a stronger
				227	definition. To enable inlining and other optimizations, use
				228	"``linkonce_odr``" linkage.
				229	``weak``
				230	"``weak``" linkage has the same merging semantics as ``linkonce``
				231	linkage, except that unreferenced globals with ``weak`` linkage may
				232	not be discarded. This is used for globals that are declared "weak"
				233	in C source code.
				234	``common``
				235	"``common``" linkage is most similar to "``weak``" linkage, but they
				236	are used for tentative definitions in C, such as "``int X;``" at
				237	global scope. Symbols with "``common``" linkage are merged in the
				238	same way as ``weak symbols``, and they may not be deleted if
				239	unreferenced. ``common`` symbols may not have an explicit section,
				240	must have a zero initializer, and may not be marked
				241	':ref:`constant <globalvars>`'. Functions and aliases may not have
				242	common linkage.
				243
				244	.. _linkage_appending:
				245
				246	``appending``
				247	"``appending``" linkage may only be applied to global variables of
				248	pointer to array type. When two global variables with appending
				249	linkage are linked together, the two global arrays are appended
				250	together. This is the LLVM, typesafe, equivalent of having the
				251	system linker append together "sections" with identical names when
				252	.o files are linked.
Rafael Espindola	e64619c	2016-05-16 21:14:24 +0000	[diff] [blame]	253
				254	Unfortunately this doesn't correspond to any feature in .o files, so it
				255	can only be used for variables like ``llvm.global_ctors`` which llvm
				256	interprets specially.
				257
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	258	``extern_weak``
				259	The semantics of this linkage follow the ELF object file model: the
				260	symbol is weak until linked, if not linked, the symbol becomes null
				261	instead of being an undefined reference.
				262	``linkonce_odr``, ``weak_odr``
				263	Some languages allow differing globals to be merged, such as two
				264	functions with different semantics. Other languages, such as
				265	``C++``, ensure that only equivalent globals are ever merged (the
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	266	"one definition rule" --- "ODR"). Such languages can use the
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	267	``linkonce_odr`` and ``weak_odr`` linkage types to indicate that the
				268	global will only be merged with equivalent globals. These linkage
				269	types are otherwise the same as their non-``odr`` versions.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	270	``external``
				271	If none of the above identifiers are used, the global is externally
				272	visible, meaning that it participates in linkage and can be used to
				273	resolve external symbol references.
				274
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	275	It is illegal for a function declaration to have any linkage type
Nico Rieck	7157bb7	2014-01-14 15:22:47 +0000	[diff] [blame]	276	other than ``external`` or ``extern_weak``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	277
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	278	.. _callingconv:
				279
				280	Calling Conventions
				281	-------------------
				282
				283	LLVM :ref:`functions <functionstructure>`, :ref:`calls <i_call>` and
				284	:ref:`invokes <i_invoke>` can all have an optional calling convention
				285	specified for the call. The calling convention of any pair of dynamic
				286	caller/callee must match, or the behavior of the program is undefined.
				287	The following calling conventions are supported by LLVM, and more may be
				288	added in the future:
				289
				290	"``ccc``" - The C calling convention
				291	This calling convention (the default if no other calling convention
				292	is specified) matches the target C calling conventions. This calling
				293	convention supports varargs function calls and tolerates some
				294	mismatch in the declared prototype and implemented declaration of
				295	the function (as does normal C).
				296	"``fastcc``" - The fast calling convention
				297	This calling convention attempts to make calls as fast as possible
				298	(e.g. by passing things in registers). This calling convention
				299	allows the target to use whatever tricks it wants to produce fast
				300	code for the target, without having to conform to an externally
				301	specified ABI (Application Binary Interface). `Tail calls can only
				302	be optimized when this, the GHC or the HiPE convention is
				303	used. <CodeGenerator.html#id80>`_ This calling convention does not
				304	support varargs and requires the prototype of all callees to exactly
				305	match the prototype of the function definition.
				306	"``coldcc``" - The cold calling convention
				307	This calling convention attempts to make code in the caller as
				308	efficient as possible under the assumption that the call is not
				309	commonly executed. As such, these calls often preserve all registers
				310	so that the call does not break any live ranges in the caller side.
				311	This calling convention does not support varargs and requires the
				312	prototype of all callees to exactly match the prototype of the
Juergen Ributzka	5d05ed1	2014-01-17 22:24:35 +0000	[diff] [blame]	313	function definition. Furthermore the inliner doesn't consider such function
				314	calls for inlining.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	315	"``cc 10``" - GHC convention
				316	This calling convention has been implemented specifically for use by
				317	the `Glasgow Haskell Compiler (GHC) <http://www.haskell.org/ghc>`_.
				318	It passes everything in registers, going to extremes to achieve this
				319	by disabling callee save registers. This calling convention should
				320	not be used lightly but only for specific situations such as an
				321	alternative to the register pinning performance technique often
				322	used when implementing functional programming languages. At the
				323	moment only X86 supports this convention and it has the following
				324	limitations:
				325
				326	- On X86-32 only supports up to 4 bit type parameters. No
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	327	floating-point types are supported.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	328	- On X86-64 only supports up to 10 bit type parameters and 6
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	329	floating-point parameters.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	330
				331	This calling convention supports `tail call
				332	optimization <CodeGenerator.html#id80>`_ but requires both the
				333	caller and callee are using it.
				334	"``cc 11``" - The HiPE calling convention
				335	This calling convention has been implemented specifically for use by
				336	the `High-Performance Erlang
				337	(HiPE) <http://www.it.uu.se/research/group/hipe/>`_ compiler, the
				338	native code compiler of the `Ericsson's Open Source Erlang/OTP
				339	system <http://www.erlang.org/download.shtml>`_. It uses more
				340	registers for argument passing than the ordinary C calling
				341	convention and defines no callee-saved registers. The calling
				342	convention properly supports `tail call
				343	optimization <CodeGenerator.html#id80>`_ but requires that both the
				344	caller and the callee use it. It uses a register pinning
				345	mechanism, similar to GHC's convention, for keeping frequently
				346	accessed runtime components pinned to specific hardware registers.
				347	At the moment only X86 supports this convention (both 32 and 64
				348	bit).
Andrew Trick	5e029ce	2013-12-24 02:57:25 +0000	[diff] [blame]	349	"``webkit_jscc``" - WebKit's JavaScript calling convention
				350	This calling convention has been implemented for `WebKit FTL JIT
				351	<https://trac.webkit.org/wiki/FTLJIT>`_. It passes arguments on the
				352	stack right to left (as cdecl does), and returns a value in the
				353	platform's customary return register.
				354	"``anyregcc``" - Dynamic calling convention for code patching
				355	This is a special convention that supports patching an arbitrary code
				356	sequence in place of a call site. This convention forces the call
Eli Bendersky	45324ce	2015-04-02 15:20:04 +0000	[diff] [blame]	357	arguments into registers but allows them to be dynamically
Andrew Trick	5e029ce	2013-12-24 02:57:25 +0000	[diff] [blame]	358	allocated. This can currently only be used with calls to
				359	llvm.experimental.patchpoint because only this intrinsic records
				360	the location of its arguments in a side table. See :doc:`StackMaps`.
Juergen Ributzka	e625013	2014-01-17 19:47:03 +0000	[diff] [blame]	361	"``preserve_mostcc``" - The `PreserveMost` calling convention
Eli Bendersky	45324ce	2015-04-02 15:20:04 +0000	[diff] [blame]	362	This calling convention attempts to make the code in the caller as
				363	unintrusive as possible. This convention behaves identically to the `C`
Juergen Ributzka	e625013	2014-01-17 19:47:03 +0000	[diff] [blame]	364	calling convention on how arguments and return values are passed, but it
				365	uses a different set of caller/callee-saved registers. This alleviates the
				366	burden of saving and recovering a large register set before and after the
Juergen Ributzka	980f2dc	2014-01-30 02:39:00 +0000	[diff] [blame]	367	call in the caller. If the arguments are passed in callee-saved registers,
				368	then they will be preserved by the callee across the call. This doesn't
				369	apply for values returned in callee-saved registers.
Juergen Ributzka	e625013	2014-01-17 19:47:03 +0000	[diff] [blame]	370
				371	- On X86-64 the callee preserves all general purpose registers, except for
				372	R11. R11 can be used as a scratch register. Floating-point registers
				373	(XMMs/YMMs) are not preserved and need to be saved by the caller.
				374
				375	The idea behind this convention is to support calls to runtime functions
				376	that have a hot path and a cold path. The hot path is usually a small piece
Eric Christopher	1e61ffd	2015-02-19 18:46:25 +0000	[diff] [blame]	377	of code that doesn't use many registers. The cold path might need to call out to
Juergen Ributzka	e625013	2014-01-17 19:47:03 +0000	[diff] [blame]	378	another function and therefore only needs to preserve the caller-saved
Juergen Ributzka	5d05ed1	2014-01-17 22:24:35 +0000	[diff] [blame]	379	registers, which haven't already been saved by the caller. The
				380	`PreserveMost` calling convention is very similar to the `cold` calling
				381	convention in terms of caller/callee-saved registers, but they are used for
				382	different types of function calls. `coldcc` is for function calls that are
				383	rarely executed, whereas `preserve_mostcc` function calls are intended to be
				384	on the hot path and definitely executed a lot. Furthermore `preserve_mostcc`
				385	doesn't prevent the inliner from inlining the function call.
Juergen Ributzka	e625013	2014-01-17 19:47:03 +0000	[diff] [blame]	386
				387	This calling convention will be used by a future version of the ObjectiveC
				388	runtime and should therefore still be considered experimental at this time.
				389	Although this convention was created to optimize certain runtime calls to
				390	the ObjectiveC runtime, it is not limited to this runtime and might be used
				391	by other runtimes in the future too. The current implementation only
				392	supports X86-64, but the intention is to support more architectures in the
				393	future.
				394	"``preserve_allcc``" - The `PreserveAll` calling convention
				395	This calling convention attempts to make the code in the caller even less
				396	intrusive than the `PreserveMost` calling convention. This calling
				397	convention also behaves identical to the `C` calling convention on how
				398	arguments and return values are passed, but it uses a different set of
				399	caller/callee-saved registers. This removes the burden of saving and
Juergen Ributzka	980f2dc	2014-01-30 02:39:00 +0000	[diff] [blame]	400	recovering a large register set before and after the call in the caller. If
				401	the arguments are passed in callee-saved registers, then they will be
				402	preserved by the callee across the call. This doesn't apply for values
				403	returned in callee-saved registers.
Juergen Ributzka	e625013	2014-01-17 19:47:03 +0000	[diff] [blame]	404
				405	- On X86-64 the callee preserves all general purpose registers, except for
				406	R11. R11 can be used as a scratch register. Furthermore it also preserves
				407	all floating-point registers (XMMs/YMMs).
				408
				409	The idea behind this convention is to support calls to runtime functions
				410	that don't need to call out to any other functions.
				411
				412	This calling convention, like the `PreserveMost` calling convention, will be
				413	used by a future version of the ObjectiveC runtime and should be considered
				414	experimental at this time.
Manman Ren	19c7bbe	2015-12-04 17:40:13 +0000	[diff] [blame]	415	"``cxx_fast_tlscc``" - The `CXX_FAST_TLS` calling convention for access functions
Manman Ren	17567d2	2015-12-07 21:40:09 +0000	[diff] [blame]	416	Clang generates an access function to access C++-style TLS. The access
				417	function generally has an entry block, an exit block and an initialization
				418	block that is run at the first time. The entry and exit blocks can access
				419	a few TLS IR variables, each access will be lowered to a platform-specific
				420	sequence.
				421
Manman Ren	19c7bbe	2015-12-04 17:40:13 +0000	[diff] [blame]	422	This calling convention aims to minimize overhead in the caller by
Manman Ren	17567d2	2015-12-07 21:40:09 +0000	[diff] [blame]	423	preserving as many registers as possible (all the registers that are
				424	perserved on the fast path, composed of the entry and exit blocks).
				425
				426	This calling convention behaves identical to the `C` calling convention on
				427	how arguments and return values are passed, but it uses a different set of
				428	caller/callee-saved registers.
				429
				430	Given that each platform has its own lowering sequence, hence its own set
				431	of preserved registers, we can't use the existing `PreserveMost`.
Manman Ren	19c7bbe	2015-12-04 17:40:13 +0000	[diff] [blame]	432
				433	- On X86-64 the callee preserves all general purpose registers, except for
				434	RDI and RAX.
Manman Ren	f8bdd88	2016-04-05 22:41:47 +0000	[diff] [blame]	435	"``swiftcc``" - This calling convention is used for Swift language.
				436	- On X86-64 RCX and R8 are available for additional integer returns, and
				437	XMM2 and XMM3 are available for additional FP/vector returns.
Manman Ren	802cd6f	2016-04-05 22:44:44 +0000	[diff] [blame]	438	- On iOS platforms, we use AAPCS-VFP calling convention.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	439	"``cc <n>``" - Numbered convention
				440	Any calling convention may be specified by number, allowing
				441	target-specific calling conventions to be used. Target specific
				442	calling conventions start at 64.
				443
				444	More calling conventions can be added/defined on an as-needed basis, to
				445	support Pascal conventions or any other well-known target-independent
				446	convention.
				447
Eli Bendersky	fdc529a	2013-06-07 19:40:08 +0000	[diff] [blame]	448	.. _visibilitystyles:
				449
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	450	Visibility Styles
				451	-----------------
				452
				453	All Global Variables and Functions have one of the following visibility
				454	styles:
				455
				456	"``default``" - Default style
				457	On targets that use the ELF object file format, default visibility
				458	means that the declaration is visible to other modules and, in
				459	shared libraries, means that the declared entity may be overridden.
				460	On Darwin, default visibility means that the declaration is visible
				461	to other modules. Default visibility corresponds to "external
				462	linkage" in the language.
				463	"``hidden``" - Hidden style
				464	Two declarations of an object with hidden visibility refer to the
				465	same object if they are in the same shared object. Usually, hidden
				466	visibility indicates that the symbol will not be placed into the
				467	dynamic symbol table, so no other module (executable or shared
				468	library) can reference it directly.
				469	"``protected``" - Protected style
				470	On ELF, protected visibility indicates that the symbol will be
				471	placed in the dynamic symbol table, but that references within the
				472	defining module will bind to the local symbol. That is, the symbol
				473	cannot be overridden by another module.
				474
Duncan P. N. Exon Smith	b80de10	2014-05-07 22:57:20 +0000	[diff] [blame]	475	A symbol with ``internal`` or ``private`` linkage must have ``default``
				476	visibility.
				477
Rafael Espindola	3bc64d5	2014-05-26 21:30:40 +0000	[diff] [blame]	478	.. _dllstorageclass:
Eli Bendersky	fdc529a	2013-06-07 19:40:08 +0000	[diff] [blame]	479
Nico Rieck	7157bb7	2014-01-14 15:22:47 +0000	[diff] [blame]	480	DLL Storage Classes
				481	-------------------
				482
				483	All Global Variables, Functions and Aliases can have one of the following
				484	DLL storage class:
				485
				486	``dllimport``
				487	"``dllimport``" causes the compiler to reference a function or variable via
				488	a global pointer to a pointer that is set up by the DLL exporting the
				489	symbol. On Microsoft Windows targets, the pointer name is formed by
				490	combining ``__imp_`` and the function or variable name.
				491	``dllexport``
				492	"``dllexport``" causes the compiler to provide a global pointer to a pointer
				493	in a DLL, so that it can be referenced with the ``dllimport`` attribute. On
				494	Microsoft Windows targets, the pointer name is formed by combining
				495	``__imp_`` and the function or variable name. Since this storage class
				496	exists for defining a dll interface, the compiler, assembler and linker know
				497	it is externally referenced and must refrain from deleting the symbol.
				498
Rafael Espindola	59f7eba	2014-05-28 18:15:43 +0000	[diff] [blame]	499	.. _tls_model:
				500
				501	Thread Local Storage Models
				502	---------------------------
				503
				504	A variable may be defined as ``thread_local``, which means that it will
				505	not be shared by threads (each thread will have a separated copy of the
				506	variable). Not all targets support thread-local variables. Optionally, a
				507	TLS model may be specified:
				508
				509	``localdynamic``
				510	For variables that are only used within the current shared library.
				511	``initialexec``
				512	For variables in modules that will not be loaded dynamically.
				513	``localexec``
				514	For variables defined in the executable and only used within it.
				515
				516	If no explicit model is given, the "general dynamic" model is used.
				517
				518	The models correspond to the ELF TLS models; see `ELF Handling For
				519	Thread-Local Storage <http://people.redhat.com/drepper/tls.pdf>`_ for
				520	more information on under which circumstances the different models may
				521	be used. The target may choose a different TLS model if the specified
				522	model is not supported, or if a better choice of model can be made.
				523
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	524	A model can also be specified in an alias, but then it only governs how
Rafael Espindola	59f7eba	2014-05-28 18:15:43 +0000	[diff] [blame]	525	the alias is accessed. It will not have any effect in the aliasee.
				526
Chih-Hung Hsieh	1e85958	2015-07-28 16:24:05 +0000	[diff] [blame]	527	For platforms without linker support of ELF TLS model, the -femulated-tls
				528	flag can be used to generate GCC compatible emulated TLS code.
				529
Sean Fertile	c70d28b	2017-10-26 15:00:26 +0000	[diff] [blame]	530	.. _runtime_preemption_model:
				531
				532	Runtime Preemption Specifiers
				533	-----------------------------
				534
				535	Global variables, functions and aliases may have an optional runtime preemption
				536	specifier. If a preemption specifier isn't given explicitly, then a
				537	symbol is assumed to be ``dso_preemptable``.
				538
				539	``dso_preemptable``
				540	Indicates that the function or variable may be replaced by a symbol from
				541	outside the linkage unit at runtime.
				542
				543	``dso_local``
				544	The compiler may assume that a function or variable marked as ``dso_local``
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	545	will resolve to a symbol within the same linkage unit. Direct access will
Sean Fertile	c70d28b	2017-10-26 15:00:26 +0000	[diff] [blame]	546	be generated even if the definition is not within this compilation unit.
				547
Rafael Espindola	3bc64d5	2014-05-26 21:30:40 +0000	[diff] [blame]	548	.. _namedtypes:
				549
Reid Kleckner	7c84d1d	2014-03-05 02:21:50 +0000	[diff] [blame]	550	Structure Types
				551	---------------
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	552
Reid Kleckner	7c84d1d	2014-03-05 02:21:50 +0000	[diff] [blame]	553	LLVM IR allows you to specify both "identified" and "literal" :ref:`structure
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	554	types <t_struct>`. Literal types are uniqued structurally, but identified types
				555	are never uniqued. An :ref:`opaque structural type <t_opaque>` can also be used
Richard Smith	32dbdf6	2014-07-31 04:25:36 +0000	[diff] [blame]	556	to forward declare a type that is not yet available.
Reid Kleckner	7c84d1d	2014-03-05 02:21:50 +0000	[diff] [blame]	557
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	558	An example of an identified structure specification is:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	559
				560	.. code-block:: llvm
				561
				562	%mytype = type { %mytype*, i32 }
				563
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	564	Prior to the LLVM 3.0 release, identified types were structurally uniqued. Only
Reid Kleckner	7c84d1d	2014-03-05 02:21:50 +0000	[diff] [blame]	565	literal types are uniqued in recent versions of LLVM.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	566
Sanjoy Das	c6af5ea	2016-07-28 23:43:38 +0000	[diff] [blame]	567	.. _nointptrtype:
				568
				569	Non-Integral Pointer Type
				570	-------------------------
				571
				572	Note: non-integral pointer types are a work in progress, and they should be
				573	considered experimental at this time.
				574
				575	LLVM IR optionally allows the frontend to denote pointers in certain address
Sanjoy Das	63752e6	2016-08-10 21:48:24 +0000	[diff] [blame]	576	spaces as "non-integral" via the :ref:`datalayout string<langref_datalayout>`.
				577	Non-integral pointer types represent pointers that have an unspecified bitwise
				578	representation; that is, the integral representation may be target dependent or
				579	unstable (not backed by a fixed integer).
Sanjoy Das	c6af5ea	2016-07-28 23:43:38 +0000	[diff] [blame]	580
				581	``inttoptr`` instructions converting integers to non-integral pointer types are
				582	ill-typed, and so are ``ptrtoint`` instructions converting values of
				583	non-integral pointer types to integers. Vector versions of said instructions
				584	are ill-typed as well.
				585
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	586	.. _globalvars:
				587
				588	Global Variables
				589	----------------
				590
				591	Global variables define regions of memory allocated at compilation time
Rafael Espindola	5d1b745	2013-10-29 13:44:11 +0000	[diff] [blame]	592	instead of run-time.
				593
Eric Christopher	1e61ffd	2015-02-19 18:46:25 +0000	[diff] [blame]	594	Global variable definitions must be initialized.
Rafael Espindola	5d1b745	2013-10-29 13:44:11 +0000	[diff] [blame]	595
				596	Global variables in other translation units can also be declared, in which
				597	case they don't have an initializer.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	598
Bob Wilson	85b24f2	2014-06-12 20:40:33 +0000	[diff] [blame]	599	Either global variable definitions or declarations may have an explicit section
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	600	to be placed in and may have an optional explicit alignment specified. If there
				601	is a mismatch between the explicit or inferred section information for the
				602	variable declaration and its definition the resulting behavior is undefined.
Bob Wilson	85b24f2	2014-06-12 20:40:33 +0000	[diff] [blame]	603
Michael Gottesman	006039c	2013-01-31 05:48:48 +0000	[diff] [blame]	604	A variable may be defined as a global ``constant``, which indicates that
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	605	the contents of the variable will never be modified (enabling better
				606	optimization, allowing the global data to be placed in the read-only
				607	section of an executable, etc). Note that variables that need runtime
Michael Gottesman	1cffcf74	2013-01-31 05:44:04 +0000	[diff] [blame]	608	initialization cannot be marked ``constant`` as there is a store to the
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	609	variable.
				610
				611	LLVM explicitly allows declarations of global variables to be marked
				612	constant, even if the final definition of the global is not. This
				613	capability can be used to enable slightly better optimization of the
				614	program, but requires the language definition to guarantee that
				615	optimizations based on the 'constantness' are valid for the translation
				616	units that do not include the definition.
				617
				618	As SSA values, global variables define pointer values that are in scope
				619	(i.e. they dominate) all basic blocks in the program. Global variables
				620	always define a pointer to their "content" type because they describe a
				621	region of memory, and all memory objects in LLVM are accessed through
				622	pointers.
				623
				624	Global variables can be marked with ``unnamed_addr`` which indicates
				625	that the address is not significant, only the content. Constants marked
				626	like this can be merged with other constants if they have the same
				627	initializer. Note that a constant with significant address can be
				628	merged with a ``unnamed_addr`` constant, the result being a constant
				629	whose address is significant.
				630
Peter Collingbourne	96efdd6	2016-06-14 21:01:22 +0000	[diff] [blame]	631	If the ``local_unnamed_addr`` attribute is given, the address is known to
				632	not be significant within the module.
				633
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	634	A global variable may be declared to reside in a target-specific
				635	numbered address space. For targets that support them, address spaces
				636	may affect how optimizations are performed and/or what target
				637	instructions are used to access the variable. The default address space
				638	is zero. The address space qualifier must precede any other attributes.
				639
				640	LLVM allows an explicit section to be specified for globals. If the
				641	target supports it, it will emit globals to the section specified.
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	642	Additionally, the global can placed in a comdat if the target has the necessary
				643	support.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	644
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	645	External declarations may have an explicit section specified. Section
				646	information is retained in LLVM IR for targets that make use of this
				647	information. Attaching section information to an external declaration is an
				648	assertion that its definition is located in the specified section. If the
				649	definition is located in a different section, the behavior is undefined.
Erich Keane	0343ef8	2017-08-22 15:30:43 +0000	[diff] [blame]	650
Michael Gottesman	e743a30	2013-02-04 03:22:00 +0000	[diff] [blame]	651	By default, global initializers are optimized by assuming that global
Michael Gottesman	ef2bc77	2013-02-03 09:57:15 +0000	[diff] [blame]	652	variables defined within the module are not modified from their
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	653	initial values before the start of the global initializer. This is
Michael Gottesman	ef2bc77	2013-02-03 09:57:15 +0000	[diff] [blame]	654	true even for variables potentially accessible from outside the
				655	module, including those with external linkage or appearing in
Yunzhong Gao	f5b769e	2013-12-05 18:37:54 +0000	[diff] [blame]	656	``@llvm.used`` or dllexported variables. This assumption may be suppressed
				657	by marking the variable with ``externally_initialized``.
Michael Gottesman	ef2bc77	2013-02-03 09:57:15 +0000	[diff] [blame]	658
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	659	An explicit alignment may be specified for a global, which must be a
				660	power of 2. If not present, or if the alignment is set to zero, the
				661	alignment of the global is set by the target to whatever it feels
				662	convenient. If an explicit alignment is specified, the global is forced
				663	to have exactly that alignment. Targets and optimizers are not allowed
				664	to over-align the global if the global has an assigned section. In this
				665	case, the extra alignment could be observable: for example, code could
				666	assume that the globals are densely packed in their section and try to
				667	iterate over them as an array, alignment padding would break this
Reid Kleckner	15fe7a5	2014-07-15 01:16:09 +0000	[diff] [blame]	668	iteration. The maximum alignment is ``1 << 29``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	669
Javed Absar	f3d7904	2017-05-11 12:28:08 +0000	[diff] [blame]	670	Globals can also have a :ref:`DLL storage class <dllstorageclass>`,
Sean Fertile	c70d28b	2017-10-26 15:00:26 +0000	[diff] [blame]	671	an optional :ref:`runtime preemption specifier <runtime_preemption_model>`,
Javed Absar	f3d7904	2017-05-11 12:28:08 +0000	[diff] [blame]	672	an optional :ref:`global attributes <glattrs>` and
				673	an optional list of attached :ref:`metadata <metadata>`.
Nico Rieck	7157bb7	2014-01-14 15:22:47 +0000	[diff] [blame]	674
Peter Collingbourne	69ba016	2015-02-04 00:42:45 +0000	[diff] [blame]	675	Variables and aliases can have a
Rafael Espindola	59f7eba	2014-05-28 18:15:43 +0000	[diff] [blame]	676	:ref:`Thread Local Storage Model <tls_model>`.
				677
Nico Rieck	7157bb7	2014-01-14 15:22:47 +0000	[diff] [blame]	678	Syntax::
				679
Sean Fertile	c70d28b	2017-10-26 15:00:26 +0000	[diff] [blame]	680	@<GlobalVarName> = [Linkage] [PreemptionSpecifier] [Visibility]
				681	[DLLStorageClass] [ThreadLocal]
Peter Collingbourne	96efdd6	2016-06-14 21:01:22 +0000	[diff] [blame]	682	[(unnamed_addr\|local_unnamed_addr)] [AddrSpace]
				683	[ExternallyInitialized]
Bob Wilson	85b24f2	2014-06-12 20:40:33 +0000	[diff] [blame]	684	<global \| constant> <Type> [<InitializerConstant>]
Rafael Espindola	83a362c	2015-01-06 22:55:16 +0000	[diff] [blame]	685	[, section "name"] [, comdat [($name)]]
Peter Collingbourne	cceae7f	2016-05-31 23:01:54 +0000	[diff] [blame]	686	[, align <Alignment>] (, !name !N)*
Nico Rieck	7157bb7	2014-01-14 15:22:47 +0000	[diff] [blame]	687
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	688	For example, the following defines a global in a numbered address space
				689	with an initializer, section, and alignment:
				690
				691	.. code-block:: llvm
				692
				693	@G = addrspace(5) constant float 1.0, section "foo", align 4
				694
Rafael Espindola	5d1b745	2013-10-29 13:44:11 +0000	[diff] [blame]	695	The following example just declares a global variable
				696
				697	.. code-block:: llvm
				698
				699	@G = external global i32
				700
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	701	The following example defines a thread-local global with the
				702	``initialexec`` TLS model:
				703
				704	.. code-block:: llvm
				705
				706	@G = thread_local(initialexec) global i32 0, align 4
				707
				708	.. _functionstructure:
				709
				710	Functions
				711	---------
				712
				713	LLVM function definitions consist of the "``define``" keyword, an
Sean Fertile	c70d28b	2017-10-26 15:00:26 +0000	[diff] [blame]	714	optional :ref:`linkage type <linkage>`, an optional :ref:`runtime preemption
				715	specifier <runtime_preemption_model>`, an optional :ref:`visibility
Nico Rieck	7157bb7	2014-01-14 15:22:47 +0000	[diff] [blame]	716	style <visibility>`, an optional :ref:`DLL storage class <dllstorageclass>`,
				717	an optional :ref:`calling convention <callingconv>`,
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	718	an optional ``unnamed_addr`` attribute, a return type, an optional
				719	:ref:`parameter attribute <paramattrs>` for the return type, a function
				720	name, a (possibly empty) argument list (each with optional :ref:`parameter
				721	attributes <paramattrs>`), optional :ref:`function attributes <fnattrs>`,
Alexander Richardson	6bcf2ba	2018-08-23 09:25:17 +0000	[diff] [blame]	722	an optional address space, an optional section, an optional alignment,
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	723	an optional :ref:`comdat <langref_comdats>`,
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	724	an optional :ref:`garbage collector name <gc>`, an optional :ref:`prefix <prefixdata>`,
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	725	an optional :ref:`prologue <prologuedata>`,
				726	an optional :ref:`personality <personalityfn>`,
Peter Collingbourne	5010868	2015-11-06 02:41:02 +0000	[diff] [blame]	727	an optional list of attached :ref:`metadata <metadata>`,
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	728	an opening curly brace, a list of basic blocks, and a closing curly brace.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	729
				730	LLVM function declarations consist of the "``declare``" keyword, an
Peter Collingbourne	96efdd6	2016-06-14 21:01:22 +0000	[diff] [blame]	731	optional :ref:`linkage type <linkage>`, an optional :ref:`visibility style
				732	<visibility>`, an optional :ref:`DLL storage class <dllstorageclass>`, an
				733	optional :ref:`calling convention <callingconv>`, an optional ``unnamed_addr``
Alexander Richardson	6bcf2ba	2018-08-23 09:25:17 +0000	[diff] [blame]	734	or ``local_unnamed_addr`` attribute, an optional address space, a return type,
				735	an optional :ref:`parameter attribute <paramattrs>` for the return type, a function name, a possibly
Peter Collingbourne	96efdd6	2016-06-14 21:01:22 +0000	[diff] [blame]	736	empty list of arguments, an optional alignment, an optional :ref:`garbage
				737	collector name <gc>`, an optional :ref:`prefix <prefixdata>`, and an optional
				738	:ref:`prologue <prologuedata>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	739
Bill Wendling	6822ecb	2013-10-27 05:09:12 +0000	[diff] [blame]	740	A function definition contains a list of basic blocks, forming the CFG (Control
				741	Flow Graph) for the function. Each basic block may optionally start with a label
				742	(giving the basic block a symbol table entry), contains a list of instructions,
				743	and ends with a :ref:`terminator <terminators>` instruction (such as a branch or
				744	function return). If an explicit label is not provided, a block is assigned an
				745	implicit numbered label, using the next value from the same counter as used for
				746	unnamed temporaries (:ref:`see above<identifiers>`). For example, if a function
				747	entry block does not have an explicit label, it will be assigned label "%0",
				748	then the first unnamed temporary in that block will be "%1", etc.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	749
				750	The first basic block in a function is special in two ways: it is
				751	immediately executed on entrance to the function, and it is not allowed
				752	to have predecessor basic blocks (i.e. there can not be any branches to
				753	the entry block of a function). Because the block can have no
				754	predecessors, it also cannot have any :ref:`PHI nodes <i_phi>`.
				755
				756	LLVM allows an explicit section to be specified for functions. If the
				757	target supports it, it will emit functions to the section specified.
Eric Christopher	1e61ffd	2015-02-19 18:46:25 +0000	[diff] [blame]	758	Additionally, the function can be placed in a COMDAT.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	759
				760	An explicit alignment may be specified for a function. If not present,
				761	or if the alignment is set to zero, the alignment of the function is set
				762	by the target to whatever it feels convenient. If an explicit alignment
				763	is specified, the function is forced to have at least that much
				764	alignment. All alignments must be a power of 2.
				765
Eric Christopher	1e61ffd	2015-02-19 18:46:25 +0000	[diff] [blame]	766	If the ``unnamed_addr`` attribute is given, the address is known to not
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	767	be significant and two identical functions can be merged.
				768
Peter Collingbourne	96efdd6	2016-06-14 21:01:22 +0000	[diff] [blame]	769	If the ``local_unnamed_addr`` attribute is given, the address is known to
				770	not be significant within the module.
				771
Alexander Richardson	6bcf2ba	2018-08-23 09:25:17 +0000	[diff] [blame]	772	If an explicit address space is not given, it will default to the program
				773	address space from the :ref:`datalayout string<langref_datalayout>`.
				774
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	775	Syntax::
				776
Sean Fertile	c70d28b	2017-10-26 15:00:26 +0000	[diff] [blame]	777	define [linkage] [PreemptionSpecifier] [visibility] [DLLStorageClass]
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	778	[cconv] [ret attrs]
				779	<ResultType> @<FunctionName> ([argument list])
Alexander Richardson	6bcf2ba	2018-08-23 09:25:17 +0000	[diff] [blame]	780	[(unnamed_addr\|local_unnamed_addr)] [AddrSpace] [fn Attrs]
				781	[section "name"] [comdat [($name)]] [align N] [gc] [prefix Constant]
Peter Collingbourne	96efdd6	2016-06-14 21:01:22 +0000	[diff] [blame]	782	[prologue Constant] [personality Constant] (!name !N)* { ... }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	783
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	784	The argument list is a comma separated sequence of arguments where each
				785	argument is of the following form:
Dan Liew	2661dfc	2014-08-20 15:06:30 +0000	[diff] [blame]	786
				787	Syntax::
				788
				789	<type> [parameter Attrs] [name]
				790
				791
Eli Bendersky	fdc529a	2013-06-07 19:40:08 +0000	[diff] [blame]	792	.. _langref_aliases:
				793
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	794	Aliases
				795	-------
				796
Rafael Espindola	64c1e18	2014-06-03 02:41:57 +0000	[diff] [blame]	797	Aliases, unlike function or variables, don't create any new data. They
				798	are just a new symbol and metadata for an existing position.
				799
				800	Aliases have a name and an aliasee that is either a global value or a
				801	constant expression.
				802
Nico Rieck	7157bb7	2014-01-14 15:22:47 +0000	[diff] [blame]	803	Aliases may have an optional :ref:`linkage type <linkage>`, an optional
Sean Fertile	c70d28b	2017-10-26 15:00:26 +0000	[diff] [blame]	804	:ref:`runtime preemption specifier <runtime_preemption_model>`, an optional
Rafael Espindola	64c1e18	2014-06-03 02:41:57 +0000	[diff] [blame]	805	:ref:`visibility style <visibility>`, an optional :ref:`DLL storage class
				806	<dllstorageclass>` and an optional :ref:`tls model <tls_model>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	807
				808	Syntax::
				809
Sean Fertile	c70d28b	2017-10-26 15:00:26 +0000	[diff] [blame]	810	@<Name> = [Linkage] [PreemptionSpecifier] [Visibility] [DLLStorageClass] [ThreadLocal] [(unnamed_addr\|local_unnamed_addr)] alias <AliaseeTy>, <AliaseeTy>* @<Aliasee>
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	811
Rafael Espindola	2fb5bc3	2014-03-13 23:18:37 +0000	[diff] [blame]	812	The linkage must be one of ``private``, ``internal``, ``linkonce``, ``weak``,
Rafael Espindola	716e740	2013-11-01 17:09:14 +0000	[diff] [blame]	813	``linkonce_odr``, ``weak_odr``, ``external``. Note that some system linkers
Rafael Espindola	64c1e18	2014-06-03 02:41:57 +0000	[diff] [blame]	814	might not correctly handle dropping a weak symbol that is aliased.
Rafael Espindola	7852705	2013-10-06 15:10:43 +0000	[diff] [blame]	815
Eric Christopher	1e61ffd	2015-02-19 18:46:25 +0000	[diff] [blame]	816	Aliases that are not ``unnamed_addr`` are guaranteed to have the same address as
Rafael Espindola	42a4c9f	2014-06-06 01:20:28 +0000	[diff] [blame]	817	the aliasee expression. ``unnamed_addr`` ones are only guaranteed to point
				818	to the same content.
Rafael Espindola	f3336bc	2014-03-12 20:15:49 +0000	[diff] [blame]	819
Peter Collingbourne	96efdd6	2016-06-14 21:01:22 +0000	[diff] [blame]	820	If the ``local_unnamed_addr`` attribute is given, the address is known to
				821	not be significant within the module.
				822
Rafael Espindola	64c1e18	2014-06-03 02:41:57 +0000	[diff] [blame]	823	Since aliases are only a second name, some restrictions apply, of which
				824	some can only be checked when producing an object file:
Rafael Espindola	f3336bc	2014-03-12 20:15:49 +0000	[diff] [blame]	825
Rafael Espindola	64c1e18	2014-06-03 02:41:57 +0000	[diff] [blame]	826	* The expression defining the aliasee must be computable at assembly
				827	time. Since it is just a name, no relocations can be used.
				828
				829	* No alias in the expression can be weak as the possibility of the
				830	intermediate alias being overridden cannot be represented in an
				831	object file.
				832
				833	* No global value in the expression can be a declaration, since that
				834	would require a relocation, which is not possible.
Rafael Espindola	24a669d	2014-03-27 15:26:56 +0000	[diff] [blame]	835
Dmitry Polukhin	a1feff7	2016-04-07 12:32:19 +0000	[diff] [blame]	836	.. _langref_ifunc:
				837
				838	IFuncs
				839	-------
				840
				841	IFuncs, like as aliases, don't create any new data or func. They are just a new
				842	symbol that dynamic linker resolves at runtime by calling a resolver function.
				843
				844	IFuncs have a name and a resolver that is a function called by dynamic linker
				845	that returns address of another function associated with the name.
				846
				847	IFunc may have an optional :ref:`linkage type <linkage>` and an optional
				848	:ref:`visibility style <visibility>`.
				849
				850	Syntax::
				851
				852	@<Name> = [Linkage] [Visibility] ifunc <IFuncTy>, <ResolverTy>* @<Resolver>
				853
				854
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	855	.. _langref_comdats:
				856
				857	Comdats
				858	-------
				859
				860	Comdat IR provides access to COFF and ELF object file COMDAT functionality.
				861
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	862	Comdats have a name which represents the COMDAT key. All global objects that
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	863	specify this key will only end up in the final object file if the linker chooses
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	864	that key over some other key. Aliases are placed in the same COMDAT that their
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	865	aliasee computes to, if any.
				866
				867	Comdats have a selection kind to provide input on how the linker should
				868	choose between keys in two different object files.
				869
				870	Syntax::
				871
				872	$<Name> = comdat SelectionKind
				873
				874	The selection kind must be one of the following:
				875
				876	``any``
				877	The linker may choose any COMDAT key, the choice is arbitrary.
				878	``exactmatch``
				879	The linker may choose any COMDAT key but the sections must contain the
				880	same data.
				881	``largest``
				882	The linker will choose the section containing the largest COMDAT key.
				883	``noduplicates``
				884	The linker requires that only section with this COMDAT key exist.
				885	``samesize``
				886	The linker may choose any COMDAT key but the sections must contain the
				887	same amount of data.
				888
Sam Clegg	ea7cace	2018-01-09 23:43:14 +0000	[diff] [blame]	889	Note that the Mach-O platform doesn't support COMDATs, and ELF and WebAssembly
				890	only support ``any`` as a selection kind.
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	891
				892	Here is an example of a COMDAT group where a function will only be selected if
				893	the COMDAT key's section is the largest:
				894
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	895	.. code-block:: text
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	896
				897	$foo = comdat largest
Rafael Espindola	83a362c	2015-01-06 22:55:16 +0000	[diff] [blame]	898	@foo = global i32 2, comdat($foo)
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	899
Rafael Espindola	83a362c	2015-01-06 22:55:16 +0000	[diff] [blame]	900	define void @bar() comdat($foo) {
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	901	ret void
				902	}
				903
Rafael Espindola	83a362c	2015-01-06 22:55:16 +0000	[diff] [blame]	904	As a syntactic sugar the ``$name`` can be omitted if the name is the same as
				905	the global name:
				906
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	907	.. code-block:: text
Rafael Espindola	83a362c	2015-01-06 22:55:16 +0000	[diff] [blame]	908
				909	$foo = comdat any
				910	@foo = global i32 2, comdat
				911
				912
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	913	In a COFF object file, this will create a COMDAT section with selection kind
				914	``IMAGE_COMDAT_SELECT_LARGEST`` containing the contents of the ``@foo`` symbol
				915	and another COMDAT section with selection kind
				916	``IMAGE_COMDAT_SELECT_ASSOCIATIVE`` which is associated with the first COMDAT
Hans Wennborg	0def066	2014-09-10 17:05:08 +0000	[diff] [blame]	917	section and contains the contents of the ``@bar`` symbol.
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	918
				919	There are some restrictions on the properties of the global object.
				920	It, or an alias to it, must have the same name as the COMDAT group when
				921	targeting COFF.
				922	The contents and size of this object may be used during link-time to determine
				923	which COMDAT groups get selected depending on the selection kind.
				924	Because the name of the object must match the name of the COMDAT group, the
				925	linkage of the global object must not be local; local symbols can get renamed
				926	if a collision occurs in the symbol table.
				927
				928	The combined use of COMDATS and section attributes may yield surprising results.
				929	For example:
				930
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	931	.. code-block:: text
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	932
				933	$foo = comdat any
				934	$bar = comdat any
Rafael Espindola	83a362c	2015-01-06 22:55:16 +0000	[diff] [blame]	935	@g1 = global i32 42, section "sec", comdat($foo)
				936	@g2 = global i32 42, section "sec", comdat($bar)
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	937
				938	From the object file perspective, this requires the creation of two sections
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	939	with the same name. This is necessary because both globals belong to different
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	940	COMDAT groups and COMDATs, at the object file level, are represented by
				941	sections.
				942
Peter Collingbourne	1feef2e	2015-06-30 19:10:31 +0000	[diff] [blame]	943	Note that certain IR constructs like global variables and functions may
				944	create COMDATs in the object file in addition to any which are specified using
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	945	COMDAT IR. This arises when the code generator is configured to emit globals
Peter Collingbourne	1feef2e	2015-06-30 19:10:31 +0000	[diff] [blame]	946	in individual sections (e.g. when `-data-sections` or `-function-sections`
				947	is supplied to `llc`).
David Majnemer	dad0a64	2014-06-27 18:19:56 +0000	[diff] [blame]	948
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	949	.. _namedmetadatastructure:
				950
				951	Named Metadata
				952	--------------
				953
				954	Named metadata is a collection of metadata. :ref:`Metadata
				955	nodes <metadata>` (but not metadata strings) are the only valid
				956	operands for a named metadata.
				957
Filipe Cabecinhas	62431b1	2015-06-02 21:25:08 +0000	[diff] [blame]	958	#. Named metadata are represented as a string of characters with the
				959	metadata prefix. The rules for metadata names are the same as for
				960	identifiers, but quoted names are not allowed. ``"\xx"`` type escapes
				961	are still valid, which allows any character to be part of a name.
				962
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	963	Syntax::
				964
				965	; Some unnamed metadata nodes, which are referenced by the named metadata.
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	966	!0 = !{!"zero"}
				967	!1 = !{!"one"}
				968	!2 = !{!"two"}
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	969	; A named metadata.
				970	!name = !{!0, !1, !2}
				971
				972	.. _paramattrs:
				973
				974	Parameter Attributes
				975	--------------------
				976
				977	The return type and each parameter of a function type may have a set of
				978	parameter attributes associated with them. Parameter attributes are
				979	used to communicate additional information about the result or
				980	parameters of a function. Parameter attributes are considered to be part
				981	of the function, not of the function type, so functions with different
				982	parameter attributes can have the same function type.
				983
				984	Parameter attributes are simple keywords that follow the type specified.
				985	If multiple parameter attributes are needed, they are space separated.
				986	For example:
				987
				988	.. code-block:: llvm
				989
				990	declare i32 @printf(i8* noalias nocapture, ...)
				991	declare i32 @atoi(i8 zeroext)
				992	declare signext i8 @returns_signed_char()
				993
				994	Note that any attributes for the function result (``nounwind``,
				995	``readonly``) come immediately after the argument list.
				996
				997	Currently, only the following parameter attributes are defined:
				998
				999	``zeroext``
				1000	This indicates to the code generator that the parameter or return
				1001	value should be zero-extended to the extent required by the target's
Hans Wennborg	850ec6c	2016-02-08 19:34:30 +0000	[diff] [blame]	1002	ABI by the caller (for a parameter) or the callee (for a return value).
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1003	``signext``
				1004	This indicates to the code generator that the parameter or return
				1005	value should be sign-extended to the extent required by the target's
				1006	ABI (which is usually 32-bits) by the caller (for a parameter) or
				1007	the callee (for a return value).
				1008	``inreg``
				1009	This indicates that this parameter or return value should be treated
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	1010	in a special target-dependent fashion while emitting code for
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1011	a function call or return (usually, by putting it in a register as
				1012	opposed to memory, though some targets use it to distinguish between
				1013	two different kinds of registers). Use of this attribute is
				1014	target-specific.
				1015	``byval``
				1016	This indicates that the pointer parameter should really be passed by
				1017	value to the function. The attribute implies that a hidden copy of
				1018	the pointee is made between the caller and the callee, so the callee
				1019	is unable to modify the value in the caller. This attribute is only
				1020	valid on LLVM pointer arguments. It is generally used to pass
				1021	structs and arrays by value, but is also valid on pointers to
				1022	scalars. The copy is considered to belong to the caller not the
				1023	callee (for example, ``readonly`` functions should not write to
				1024	``byval`` parameters). This is not a valid attribute for return
				1025	values.
				1026
				1027	The byval attribute also supports specifying an alignment with the
				1028	align attribute. It indicates the alignment of the stack slot to
				1029	form and the known alignment of the pointer specified to the call
				1030	site. If the alignment is not specified, then the code generator
				1031	makes a target-specific assumption.
				1032
Reid Kleckner	a534a38	2013-12-19 02:14:12 +0000	[diff] [blame]	1033	.. _attr_inalloca:
				1034
				1035	``inalloca``
				1036
Reid Kleckner	60d3a83	2014-01-16 22:59:24 +0000	[diff] [blame]	1037	The ``inalloca`` argument attribute allows the caller to take the
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1038	address of outgoing stack arguments. An ``inalloca`` argument must
Reid Kleckner	436c42e	2014-01-17 23:58:17 +0000	[diff] [blame]	1039	be a pointer to stack memory produced by an ``alloca`` instruction.
				1040	The alloca, or argument allocation, must also be tagged with the
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1041	inalloca keyword. Only the last argument may have the ``inalloca``
Reid Kleckner	436c42e	2014-01-17 23:58:17 +0000	[diff] [blame]	1042	attribute, and that argument is guaranteed to be passed in memory.
Reid Kleckner	a534a38	2013-12-19 02:14:12 +0000	[diff] [blame]	1043
Reid Kleckner	436c42e	2014-01-17 23:58:17 +0000	[diff] [blame]	1044	An argument allocation may be used by a call at most once because
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1045	the call may deallocate it. The ``inalloca`` attribute cannot be
Reid Kleckner	436c42e	2014-01-17 23:58:17 +0000	[diff] [blame]	1046	used in conjunction with other attributes that affect argument
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1047	storage, like ``inreg``, ``nest``, ``sret``, or ``byval``. The
Reid Kleckner	f5b7651	2014-01-31 23:50:57 +0000	[diff] [blame]	1048	``inalloca`` attribute also disables LLVM's implicit lowering of
				1049	large aggregate return values, which means that frontend authors
				1050	must lower them with ``sret`` pointers.
Reid Kleckner	a534a38	2013-12-19 02:14:12 +0000	[diff] [blame]	1051
Reid Kleckner	60d3a83	2014-01-16 22:59:24 +0000	[diff] [blame]	1052	When the call site is reached, the argument allocation must have
				1053	been the most recent stack allocation that is still live, or the
Eli Friedman	0f522bd	2018-07-25 18:26:38 +0000	[diff] [blame]	1054	behavior is undefined. It is possible to allocate additional stack
Reid Kleckner	60d3a83	2014-01-16 22:59:24 +0000	[diff] [blame]	1055	space after an argument allocation and before its call site, but it
				1056	must be cleared off with :ref:`llvm.stackrestore
				1057	<int_stackrestore>`.
Reid Kleckner	a534a38	2013-12-19 02:14:12 +0000	[diff] [blame]	1058
				1059	See :doc:`InAlloca` for more information on how to use this
				1060	attribute.
				1061
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1062	``sret``
				1063	This indicates that the pointer parameter specifies the address of a
				1064	structure that is the return value of the function in the source
				1065	program. This pointer must be guaranteed by the caller to be valid:
Reid Kleckner	1361c0c	2016-09-08 15:45:27 +0000	[diff] [blame]	1066	loads and stores to the structure may be assumed by the callee not
				1067	to trap and to be properly aligned. This is not a valid attribute
				1068	for return values.
Sean Silva	1703e70	2014-04-08 21:06:22 +0000	[diff] [blame]	1069
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	1070	.. _attr_align:
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	1071
Hal Finkel	ccc7090	2014-07-22 16:58:55 +0000	[diff] [blame]	1072	``align <n>``
				1073	This indicates that the pointer value may be assumed by the optimizer to
				1074	have the specified alignment.
				1075
				1076	Note that this attribute has additional semantics when combined with the
				1077	``byval`` attribute.
				1078
Sean Silva	1703e70	2014-04-08 21:06:22 +0000	[diff] [blame]	1079	.. _noalias:
				1080
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1081	``noalias``
Hal Finkel	12d3630	2014-11-21 02:22:46 +0000	[diff] [blame]	1082	This indicates that objects accessed via pointer values
				1083	:ref:`based <pointeraliasing>` on the argument or return value are not also
				1084	accessed, during the execution of the function, via pointer values not
				1085	based on the argument or return value. The attribute on a return value
				1086	also has additional semantics described below. The caller shares the
				1087	responsibility with the callee for ensuring that these requirements are met.
				1088	For further details, please see the discussion of the NoAlias response in
				1089	:ref:`alias analysis <Must, May, or No>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1090
				1091	Note that this definition of ``noalias`` is intentionally similar
Hal Finkel	12d3630	2014-11-21 02:22:46 +0000	[diff] [blame]	1092	to the definition of ``restrict`` in C99 for function arguments.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1093
				1094	For function return values, C99's ``restrict`` is not meaningful,
Hal Finkel	12d3630	2014-11-21 02:22:46 +0000	[diff] [blame]	1095	while LLVM's ``noalias`` is. Furthermore, the semantics of the ``noalias``
				1096	attribute on return values are stronger than the semantics of the attribute
				1097	when used on function arguments. On function return values, the ``noalias``
				1098	attribute indicates that the function acts like a system memory allocation
				1099	function, returning a pointer to allocated storage disjoint from the
				1100	storage for any other object accessible to the caller.
				1101
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1102	``nocapture``
				1103	This indicates that the callee does not make any copies of the
				1104	pointer that outlive the callee itself. This is not a valid
David Majnemer	7f32420	2016-05-26 17:36:22 +0000	[diff] [blame]	1105	attribute for return values. Addresses used in volatile operations
				1106	are considered to be captured.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1107
				1108	.. _nest:
				1109
				1110	``nest``
				1111	This indicates that the pointer parameter can be excised using the
				1112	:ref:`trampoline intrinsics <int_trampoline>`. This is not a valid
Stephen Lin	b8bd232	2013-04-20 05:14:40 +0000	[diff] [blame]	1113	attribute for return values and can only be applied to one parameter.
				1114
				1115	``returned``
Stephen Lin	fec5b0b	2013-06-20 21:55:10 +0000	[diff] [blame]	1116	This indicates that the function always returns the argument as its return
Hal Finkel	3b66caa	2016-07-10 21:52:39 +0000	[diff] [blame]	1117	value. This is a hint to the optimizer and code generator used when
				1118	generating the caller, allowing value propagation, tail call optimization,
				1119	and omission of register saves and restores in some cases; it is not
				1120	checked or enforced when generating the callee. The parameter and the
				1121	function return type must be valid operands for the
				1122	:ref:`bitcast instruction <i_bitcast>`. This is not a valid attribute for
				1123	return values and can only be applied to one parameter.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1124
Nick Lewycky	d52b152	2014-05-20 01:23:40 +0000	[diff] [blame]	1125	``nonnull``
				1126	This indicates that the parameter or return pointer is not null. This
				1127	attribute may only be applied to pointer typed parameters. This is not
Eli Friedman	0f522bd	2018-07-25 18:26:38 +0000	[diff] [blame]	1128	checked or enforced by LLVM; if the parameter or return pointer is null,
				1129	the behavior is undefined.
Nick Lewycky	d52b152	2014-05-20 01:23:40 +0000	[diff] [blame]	1130
Hal Finkel	b0407ba	2014-07-18 15:51:28 +0000	[diff] [blame]	1131	``dereferenceable(<n>)``
				1132	This indicates that the parameter or return pointer is dereferenceable. This
				1133	attribute may only be applied to pointer typed parameters. A pointer that
				1134	is dereferenceable can be loaded from speculatively without a risk of
				1135	trapping. The number of bytes known to be dereferenceable must be provided
				1136	in parentheses. It is legal for the number of bytes to be less than the
				1137	size of the pointee type. The ``nonnull`` attribute does not imply
				1138	dereferenceability (consider a pointer to one element past the end of an
				1139	array), however ``dereferenceable(<n>)`` does imply ``nonnull`` in
				1140	``addrspace(0)`` (which is the default address space).
				1141
Sanjoy Das	31ea6d1	2015-04-16 20:29:50 +0000	[diff] [blame]	1142	``dereferenceable_or_null(<n>)``
				1143	This indicates that the parameter or return value isn't both
				1144	non-null and non-dereferenceable (up to ``<n>`` bytes) at the same
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1145	time. All non-null pointers tagged with
Sanjoy Das	31ea6d1	2015-04-16 20:29:50 +0000	[diff] [blame]	1146	``dereferenceable_or_null(<n>)`` are ``dereferenceable(<n>)``.
				1147	For address space 0 ``dereferenceable_or_null(<n>)`` implies that
				1148	a pointer is exactly one of ``dereferenceable(<n>)`` or ``null``,
				1149	and in other address spaces ``dereferenceable_or_null(<n>)``
				1150	implies that a pointer is at least one of ``dereferenceable(<n>)``
				1151	or ``null`` (i.e. it may be both ``null`` and
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1152	``dereferenceable(<n>)``). This attribute may only be applied to
Sanjoy Das	31ea6d1	2015-04-16 20:29:50 +0000	[diff] [blame]	1153	pointer typed parameters.
				1154
Manman Ren	f46262e	2016-03-29 17:37:21 +0000	[diff] [blame]	1155	``swiftself``
				1156	This indicates that the parameter is the self/context parameter. This is not
				1157	a valid attribute for return values and can only be applied to one
				1158	parameter.
				1159
Manman Ren	9bfd0d0	2016-04-01 21:41:15 +0000	[diff] [blame]	1160	``swifterror``
				1161	This attribute is motivated to model and optimize Swift error handling. It
				1162	can be applied to a parameter with pointer to pointer type or a
				1163	pointer-sized alloca. At the call site, the actual argument that corresponds
Arnold Schwaighofer	6c57f4f	2016-09-10 19:42:53 +0000	[diff] [blame]	1164	to a ``swifterror`` parameter has to come from a ``swifterror`` alloca or
				1165	the ``swifterror`` parameter of the caller. A ``swifterror`` value (either
				1166	the parameter or the alloca) can only be loaded and stored from, or used as
				1167	a ``swifterror`` argument. This is not a valid attribute for return values
				1168	and can only be applied to one parameter.
Manman Ren	9bfd0d0	2016-04-01 21:41:15 +0000	[diff] [blame]	1169
				1170	These constraints allow the calling convention to optimize access to
				1171	``swifterror`` variables by associating them with a specific register at
				1172	call boundaries rather than placing them in memory. Since this does change
				1173	the calling convention, a function which uses the ``swifterror`` attribute
				1174	on a parameter is not ABI-compatible with one which does not.
				1175
				1176	These constraints also allow LLVM to assume that a ``swifterror`` argument
				1177	does not alias any other memory visible within a function and that a
				1178	``swifterror`` alloca passed as an argument does not escape.
				1179
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1180	.. _gc:
				1181
Philip Reames	f80bbff	2015-02-25 23:45:20 +0000	[diff] [blame]	1182	Garbage Collector Strategy Names
				1183	--------------------------------
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1184
Philip Reames	f80bbff	2015-02-25 23:45:20 +0000	[diff] [blame]	1185	Each function may specify a garbage collector strategy name, which is simply a
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1186	string:
				1187
				1188	.. code-block:: llvm
				1189
				1190	define void @f() gc "name" { ... }
				1191
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	1192	The supported values of name includes those :ref:`built in to LLVM
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1193	<builtin-gc-strategies>` and any provided by loaded plugins. Specifying a GC
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	1194	strategy will cause the compiler to alter its output in order to support the
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1195	named garbage collection algorithm. Note that LLVM itself does not contain a
Philip Reames	f80bbff	2015-02-25 23:45:20 +0000	[diff] [blame]	1196	garbage collector, this functionality is restricted to generating machine code
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	1197	which can interoperate with a collector provided externally.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1198
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1199	.. _prefixdata:
				1200
				1201	Prefix Data
				1202	-----------
				1203
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1204	Prefix data is data associated with a function which the code
				1205	generator will emit immediately before the function's entrypoint.
				1206	The purpose of this feature is to allow frontends to associate
				1207	language-specific runtime metadata with specific functions and make it
				1208	available through the function pointer while still allowing the
				1209	function pointer to be called.
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1210
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1211	To access the data for a given function, a program may bitcast the
				1212	function pointer to a pointer to the constant's type and dereference
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1213	index -1. This implies that the IR symbol points just past the end of
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1214	the prefix data. For instance, take the example of a function annotated
				1215	with a single ``i32``,
				1216
				1217	.. code-block:: llvm
				1218
				1219	define void @f() prefix i32 123 { ... }
				1220
				1221	The prefix data can be referenced as,
				1222
				1223	.. code-block:: llvm
				1224
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	1225	%0 = bitcast void* () @f to i32*
				1226	%a = getelementptr inbounds i32, i32* %0, i32 -1
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	1227	%b = load i32, i32* %a
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1228
				1229	Prefix data is laid out as if it were an initializer for a global variable
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1230	of the prefix data's type. The function will be placed such that the
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1231	beginning of the prefix data is aligned. This means that if the size
				1232	of the prefix data is not a multiple of the alignment size, the
				1233	function's entrypoint will not be aligned. If alignment of the
				1234	function's entrypoint is desired, padding must be added to the prefix
				1235	data.
				1236
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1237	A function may have prefix data but no body. This has similar semantics
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1238	to the ``available_externally`` linkage in that the data may be used by the
				1239	optimizers but will not be emitted in the object file.
				1240
				1241	.. _prologuedata:
				1242
				1243	Prologue Data
				1244	-------------
				1245
				1246	The ``prologue`` attribute allows arbitrary code (encoded as bytes) to
				1247	be inserted prior to the function body. This can be used for enabling
				1248	function hot-patching and instrumentation.
				1249
				1250	To maintain the semantics of ordinary function calls, the prologue data must
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1251	have a particular format. Specifically, it must begin with a sequence of
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1252	bytes which decode to a sequence of machine instructions, valid for the
				1253	module's target, which transfer control to the point immediately succeeding
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1254	the prologue data, without performing any other visible action. This allows
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1255	the inliner and other passes to reason about the semantics of the function
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1256	definition without needing to reason about the prologue data. Obviously this
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1257	makes the format of the prologue data highly target dependent.
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1258
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1259	A trivial example of valid prologue data for the x86 architecture is ``i8 144``,
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1260	which encodes the ``nop`` instruction:
				1261
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	1262	.. code-block:: text
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1263
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1264	define void @f() prologue i8 144 { ... }
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1265
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1266	Generally prologue data can be formed by encoding a relative branch instruction
				1267	which skips the metadata, as in this example of valid prologue data for the
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1268	x86_64 architecture, where the first two bytes encode ``jmp .+10``:
				1269
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	1270	.. code-block:: text
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1271
				1272	%0 = type <{ i8, i8, i8* }>
				1273
Peter Collingbourne	51d2de7	2014-12-03 02:08:38 +0000	[diff] [blame]	1274	define void @f() prologue %0 <{ i8 235, i8 8, i8* @md}> { ... }
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1275
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1276	A function may have prologue data but no body. This has similar semantics
Peter Collingbourne	3fa50f9	2013-09-16 01:08:15 +0000	[diff] [blame]	1277	to the ``available_externally`` linkage in that the data may be used by the
				1278	optimizers but will not be emitted in the object file.
				1279
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	1280	.. _personalityfn:
				1281
				1282	Personality Function
David Majnemer	c5ad8a9	2015-06-17 21:21:16 +0000	[diff] [blame]	1283	--------------------
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	1284
				1285	The ``personality`` attribute permits functions to specify what function
				1286	to use for exception handling.
				1287
Bill Wendling	63b8819	2013-02-06 06:52:58 +0000	[diff] [blame]	1288	.. _attrgrp:
				1289
				1290	Attribute Groups
				1291	----------------
				1292
				1293	Attribute groups are groups of attributes that are referenced by objects within
				1294	the IR. They are important for keeping ``.ll`` files readable, because a lot of
				1295	functions will use the same set of attributes. In the degenerative case of a
				1296	``.ll`` file that corresponds to a single ``.c`` file, the single attribute
				1297	group will capture the important command line flags used to build that file.
				1298
				1299	An attribute group is a module-level object. To use an attribute group, an
				1300	object references the attribute group's ID (e.g. ``#37``). An object may refer
				1301	to more than one attribute group. In that situation, the attributes from the
				1302	different groups are merged.
				1303
				1304	Here is an example of attribute groups for a function that should always be
				1305	inlined, has a stack alignment of 4, and which shouldn't use SSE instructions:
				1306
				1307	.. code-block:: llvm
				1308
				1309	; Target-independent attributes:
Eli Bendersky	97ad924	2013-04-18 16:11:44 +0000	[diff] [blame]	1310	attributes #0 = { alwaysinline alignstack=4 }
Bill Wendling	63b8819	2013-02-06 06:52:58 +0000	[diff] [blame]	1311
				1312	; Target-dependent attributes:
Eli Bendersky	97ad924	2013-04-18 16:11:44 +0000	[diff] [blame]	1313	attributes #1 = { "no-sse" }
Bill Wendling	63b8819	2013-02-06 06:52:58 +0000	[diff] [blame]	1314
				1315	; Function @f has attributes: alwaysinline, alignstack=4, and "no-sse".
				1316	define void @f() #0 #1 { ... }
				1317
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1318	.. _fnattrs:
				1319
				1320	Function Attributes
				1321	-------------------
				1322
				1323	Function attributes are set to communicate additional information about
				1324	a function. Function attributes are considered to be part of the
				1325	function, not of the function type, so functions with different function
				1326	attributes can have the same function type.
				1327
				1328	Function attributes are simple keywords that follow the type specified.
				1329	If multiple attributes are needed, they are space separated. For
				1330	example:
				1331
				1332	.. code-block:: llvm
				1333
				1334	define void @f() noinline { ... }
				1335	define void @f() alwaysinline { ... }
				1336	define void @f() alwaysinline optsize { ... }
				1337	define void @f() optsize { ... }
				1338
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1339	``alignstack(<n>)``
				1340	This attribute indicates that, when emitting the prologue and
				1341	epilogue, the backend should forcibly align the stack pointer.
				1342	Specify the desired alignment, which must be a power of two, in
				1343	parentheses.
George Burgess IV	278199f	2016-04-12 01:05:35 +0000	[diff] [blame]	1344	``allocsize(<EltSizeParam>[, <NumEltsParam>])``
				1345	This attribute indicates that the annotated function will always return at
				1346	least a given number of bytes (or null). Its arguments are zero-indexed
				1347	parameter numbers; if one argument is provided, then it's assumed that at
				1348	least ``CallSite.Args[EltSizeParam]`` bytes will be available at the
				1349	returned pointer. If two are provided, then it's assumed that
				1350	``CallSite.Args[EltSizeParam] * CallSite.Args[NumEltsParam]`` bytes are
				1351	available. The referenced parameters must be integer types. No assumptions
				1352	are made about the contents of the returned block of memory.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1353	``alwaysinline``
				1354	This attribute indicates that the inliner should attempt to inline
				1355	this function into callers whenever possible, ignoring any active
				1356	inlining size threshold for this caller.
Michael Gottesman	41748d7	2013-06-27 00:25:01 +0000	[diff] [blame]	1357	``builtin``
				1358	This indicates that the callee function at a call site should be
				1359	recognized as a built-in function, even though the function's declaration
Michael Gottesman	3a6a967	2013-07-02 21:32:56 +0000	[diff] [blame]	1360	uses the ``nobuiltin`` attribute. This is only valid at call sites for
Richard Smith	32dbdf6	2014-07-31 04:25:36 +0000	[diff] [blame]	1361	direct calls to functions that are declared with the ``nobuiltin``
Michael Gottesman	41748d7	2013-06-27 00:25:01 +0000	[diff] [blame]	1362	attribute.
Michael Gottesman	296adb8	2013-06-27 22:48:08 +0000	[diff] [blame]	1363	``cold``
				1364	This attribute indicates that this function is rarely called. When
				1365	computing edge weights, basic blocks post-dominated by a cold
				1366	function call are also considered to be cold; and, thus, given low
				1367	weight.
Owen Anderson	85fa7d5	2015-05-26 23:48:40 +0000	[diff] [blame]	1368	``convergent``
Justin Lebar	d5fb695	2016-02-09 23:03:17 +0000	[diff] [blame]	1369	In some parallel execution models, there exist operations that cannot be
				1370	made control-dependent on any additional values. We call such operations
Justin Lebar	58535b1	2016-02-17 17:46:41 +0000	[diff] [blame]	1371	``convergent``, and mark them with this attribute.
Justin Lebar	d5fb695	2016-02-09 23:03:17 +0000	[diff] [blame]	1372
Justin Lebar	58535b1	2016-02-17 17:46:41 +0000	[diff] [blame]	1373	The ``convergent`` attribute may appear on functions or call/invoke
				1374	instructions. When it appears on a function, it indicates that calls to
				1375	this function should not be made control-dependent on additional values.
Justin Bogner	a463537	2016-07-06 20:02:45 +0000	[diff] [blame]	1376	For example, the intrinsic ``llvm.nvvm.barrier0`` is ``convergent``, so
Justin Lebar	d5fb695	2016-02-09 23:03:17 +0000	[diff] [blame]	1377	calls to this intrinsic cannot be made control-dependent on additional
Justin Lebar	58535b1	2016-02-17 17:46:41 +0000	[diff] [blame]	1378	values.
Justin Lebar	d5fb695	2016-02-09 23:03:17 +0000	[diff] [blame]	1379
Justin Lebar	58535b1	2016-02-17 17:46:41 +0000	[diff] [blame]	1380	When it appears on a call/invoke, the ``convergent`` attribute indicates
				1381	that we should treat the call as though we're calling a convergent
				1382	function. This is particularly useful on indirect calls; without this we
				1383	may treat such calls as though the target is non-convergent.
				1384
				1385	The optimizer may remove the ``convergent`` attribute on functions when it
				1386	can prove that the function does not execute any convergent operations.
				1387	Similarly, the optimizer may remove ``convergent`` on calls/invokes when it
				1388	can prove that the call/invoke cannot call a convergent function.
Vaivaswatha Nagaraj	fb3f490	2015-12-16 16:16:19 +0000	[diff] [blame]	1389	``inaccessiblememonly``
				1390	This attribute indicates that the function may only access memory that
				1391	is not accessible by the module being compiled. This is a weaker form
Eli Friedman	0f522bd	2018-07-25 18:26:38 +0000	[diff] [blame]	1392	of ``readnone``. If the function reads or writes other memory, the
				1393	behavior is undefined.
Vaivaswatha Nagaraj	fb3f490	2015-12-16 16:16:19 +0000	[diff] [blame]	1394	``inaccessiblemem_or_argmemonly``
				1395	This attribute indicates that the function may only access memory that is
				1396	either not accessible by the module being compiled, or is pointed to
Eli Friedman	0f522bd	2018-07-25 18:26:38 +0000	[diff] [blame]	1397	by its pointer arguments. This is a weaker form of ``argmemonly``. If the
				1398	function reads or writes other memory, the behavior is undefined.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1399	``inlinehint``
				1400	This attribute indicates that the source code contained a hint that
				1401	inlining this function is desirable (such as the "inline" keyword in
				1402	C/C++). It is just a hint; it imposes no requirements on the
				1403	inliner.
Tom Roeder	44cb65f	2014-06-05 19:29:43 +0000	[diff] [blame]	1404	``jumptable``
				1405	This attribute indicates that the function should be added to a
				1406	jump-instruction table at code-generation time, and that all address-taken
				1407	references to this function should be replaced with a reference to the
				1408	appropriate jump-instruction-table function pointer. Note that this creates
				1409	a new pointer for the original function, which means that code that depends
				1410	on function-pointer identity can break. So, any function annotated with
				1411	``jumptable`` must also be ``unnamed_addr``.
Andrea Di Biagio	9b5d23b	2013-08-09 18:42:18 +0000	[diff] [blame]	1412	``minsize``
				1413	This attribute suggests that optimization passes and code generator
				1414	passes make choices that keep the code size of this function as small
Andrew Trick	d4d1d9c	2013-10-31 17:18:07 +0000	[diff] [blame]	1415	as possible and perform optimizations that may sacrifice runtime
Andrea Di Biagio	9b5d23b	2013-08-09 18:42:18 +0000	[diff] [blame]	1416	performance in order to minimize the size of the generated code.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1417	``naked``
				1418	This attribute disables prologue / epilogue emission for the
				1419	function. This can have very system-specific consequences.
Sumanth Gundapaneni	6af104e	2017-07-28 22:26:22 +0000	[diff] [blame]	1420	``no-jump-tables``
				1421	When this attribute is set to true, the jump tables and lookup tables that
				1422	can be generated from a switch case lowering are disabled.
Eli Bendersky	97ad924	2013-04-18 16:11:44 +0000	[diff] [blame]	1423	``nobuiltin``
Michael Gottesman	41748d7	2013-06-27 00:25:01 +0000	[diff] [blame]	1424	This indicates that the callee function at a call site is not recognized as
				1425	a built-in function. LLVM will retain the original call and not replace it
				1426	with equivalent code based on the semantics of the built-in function, unless
				1427	the call site uses the ``builtin`` attribute. This is valid at call sites
				1428	and on function declarations and definitions.
Bill Wendling	bf902f1	2013-02-06 06:22:58 +0000	[diff] [blame]	1429	``noduplicate``
				1430	This attribute indicates that calls to the function cannot be
				1431	duplicated. A call to a ``noduplicate`` function may be moved
				1432	within its parent function, but may not be duplicated within
				1433	its parent function.
				1434
				1435	A function containing a ``noduplicate`` call may still
				1436	be an inlining candidate, provided that the call is not
				1437	duplicated by inlining. That implies that the function has
				1438	internal linkage and only has one call site, so the original
				1439	call is dead after inlining.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1440	``noimplicitfloat``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	1441	This attributes disables implicit floating-point instructions.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1442	``noinline``
				1443	This attribute indicates that the inliner should never inline this
				1444	function in any situation. This attribute may not be used together
				1445	with the ``alwaysinline`` attribute.
Sean Silva	1cbbcf1	2013-08-06 19:34:37 +0000	[diff] [blame]	1446	``nonlazybind``
				1447	This attribute suppresses lazy symbol binding for the function. This
				1448	may make calls to the function faster, at the cost of extra program
				1449	startup time if the function is not called during program startup.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1450	``noredzone``
				1451	This attribute indicates that the code generator should not use a
				1452	red zone, even if the target-specific ABI normally permits it.
				1453	``noreturn``
				1454	This function attribute indicates that the function never returns
				1455	normally. This produces undefined behavior at runtime if the
				1456	function ever does dynamically return.
James Molloy	e6f87ca	2015-11-06 10:32:53 +0000	[diff] [blame]	1457	``norecurse``
				1458	This function attribute indicates that the function does not call itself
				1459	either directly or indirectly down any possible call path. This produces
				1460	undefined behavior at runtime if the function ever does recurse.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1461	``nounwind``
Reid Kleckner	96d0113	2015-02-11 01:23:16 +0000	[diff] [blame]	1462	This function attribute indicates that the function never raises an
				1463	exception. If the function does raise an exception, its runtime
				1464	behavior is undefined. However, functions marked nounwind may still
				1465	trap or generate asynchronous exceptions. Exception handling schemes
				1466	that are recognized by LLVM to handle asynchronous exceptions, such
				1467	as SEH, will still provide their implementation defined semantics.
Manoj Gupta	77eeac3	2018-07-09 22:27:23 +0000	[diff] [blame]	1468	``"null-pointer-is-valid"``
				1469	If ``"null-pointer-is-valid"`` is set to ``"true"``, then ``null`` address
				1470	in address-space 0 is considered to be a valid address for memory loads and
				1471	stores. Any analysis or optimization should not treat dereferencing a
				1472	pointer to ``null`` as undefined behavior in this function.
				1473	Note: Comparing address of a global variable to ``null`` may still
				1474	evaluate to false because of a limitation in querying this attribute inside
				1475	constant expressions.
Matt Morehouse	3181941	2018-03-22 19:50:10 +0000	[diff] [blame]	1476	``optforfuzzing``
				1477	This attribute indicates that this function should be optimized
				1478	for maximum fuzzing signal.
Andrea Di Biagio	377496b	2013-08-23 11:53:55 +0000	[diff] [blame]	1479	``optnone``
Paul Robinson	a2550a6	2015-11-30 21:56:16 +0000	[diff] [blame]	1480	This function attribute indicates that most optimization passes will skip
				1481	this function, with the exception of interprocedural optimization passes.
				1482	Code generation defaults to the "fast" instruction selector.
Andrea Di Biagio	377496b	2013-08-23 11:53:55 +0000	[diff] [blame]	1483	This attribute cannot be used together with the ``alwaysinline``
				1484	attribute; this attribute is also incompatible
				1485	with the ``minsize`` attribute and the ``optsize`` attribute.
Andrew Trick	d4d1d9c	2013-10-31 17:18:07 +0000	[diff] [blame]	1486
Paul Robinson	dcbe35b	2013-11-18 21:44:03 +0000	[diff] [blame]	1487	This attribute requires the ``noinline`` attribute to be specified on
				1488	the function as well, so the function is never inlined into any caller.
Andrea Di Biagio	377496b	2013-08-23 11:53:55 +0000	[diff] [blame]	1489	Only functions with the ``alwaysinline`` attribute are valid
Paul Robinson	dcbe35b	2013-11-18 21:44:03 +0000	[diff] [blame]	1490	candidates for inlining into the body of this function.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1491	``optsize``
				1492	This attribute suggests that optimization passes and code generator
				1493	passes make choices that keep the code size of this function low,
Andrea Di Biagio	9b5d23b	2013-08-09 18:42:18 +0000	[diff] [blame]	1494	and otherwise do optimizations specifically to reduce code size as
				1495	long as they do not significantly impact runtime performance.
Sanjoy Das	c0441c2	2016-04-19 05:24:47 +0000	[diff] [blame]	1496	``"patchable-function"``
				1497	This attribute tells the code generator that the code
				1498	generated for this function needs to follow certain conventions that
				1499	make it possible for a runtime function to patch over it later.
				1500	The exact effect of this attribute depends on its string value,
Charles Davis	e9c32c7	2016-08-08 21:20:15 +0000	[diff] [blame]	1501	for which there currently is one legal possibility:
Sanjoy Das	c0441c2	2016-04-19 05:24:47 +0000	[diff] [blame]	1502
				1503	* ``"prologue-short-redirect"`` - This style of patchable
				1504	function is intended to support patching a function prologue to
				1505	redirect control away from the function in a thread safe
				1506	manner. It guarantees that the first instruction of the
				1507	function will be large enough to accommodate a short jump
				1508	instruction, and will be sufficiently aligned to allow being
				1509	fully changed via an atomic compare-and-swap instruction.
				1510	While the first requirement can be satisfied by inserting large
				1511	enough NOP, LLVM can and will try to re-purpose an existing
				1512	instruction (i.e. one that would have to be emitted anyway) as
				1513	the patchable instruction larger than a short jump.
				1514
				1515	``"prologue-short-redirect"`` is currently only supported on
				1516	x86-64.
				1517
				1518	This attribute by itself does not imply restrictions on
				1519	inter-procedural optimizations. All of the semantic effects the
				1520	patching may have to be separately conveyed via the linkage type.
whitequark	ed54b4a	2017-06-21 18:46:50 +0000	[diff] [blame]	1521	``"probe-stack"``
				1522	This attribute indicates that the function will trigger a guard region
				1523	in the end of the stack. It ensures that accesses to the stack must be
				1524	no further apart than the size of the guard region to a previous
				1525	access of the stack. It takes one required string value, the name of
				1526	the stack probing function that will be called.
				1527
				1528	If a function that has a ``"probe-stack"`` attribute is inlined into
				1529	a function with another ``"probe-stack"`` attribute, the resulting
				1530	function has the ``"probe-stack"`` attribute of the caller. If a
				1531	function that has a ``"probe-stack"`` attribute is inlined into a
				1532	function that has no ``"probe-stack"`` attribute at all, the resulting
				1533	function has the ``"probe-stack"`` attribute of the callee.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1534	``readnone``
Nick Lewycky	c2ec072	2013-07-06 00:29:58 +0000	[diff] [blame]	1535	On a function, this attribute indicates that the function computes its
				1536	result (or decides to unwind an exception) based strictly on its arguments,
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1537	without dereferencing any pointer arguments or otherwise accessing
				1538	any mutable state (e.g. memory, control registers, etc) visible to
				1539	caller functions. It does not write through any pointer arguments
				1540	(including ``byval`` arguments) and never changes any state visible
Sanjoy Das	5be2e84	2017-02-13 23:19:07 +0000	[diff] [blame]	1541	to callers. This means while it cannot unwind exceptions by calling
				1542	the ``C++`` exception throwing methods (since they write to memory), there may
				1543	be non-``C++`` mechanisms that throw exceptions without writing to LLVM
				1544	visible memory.
Andrew Trick	d4d1d9c	2013-10-31 17:18:07 +0000	[diff] [blame]	1545
Nick Lewycky	c2ec072	2013-07-06 00:29:58 +0000	[diff] [blame]	1546	On an argument, this attribute indicates that the function does not
				1547	dereference that pointer argument, even though it may read or write the
Nick Lewycky	efe31f2	2013-07-06 01:04:47 +0000	[diff] [blame]	1548	memory that the pointer points to if accessed through other pointers.
Eli Friedman	0f522bd	2018-07-25 18:26:38 +0000	[diff] [blame]	1549
				1550	If a readnone function reads or writes memory visible to the program, or
				1551	has other side-effects, the behavior is undefined. If a function reads from
				1552	or writes to a readnone pointer argument, the behavior is undefined.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1553	``readonly``
Nick Lewycky	c2ec072	2013-07-06 00:29:58 +0000	[diff] [blame]	1554	On a function, this attribute indicates that the function does not write
				1555	through any pointer arguments (including ``byval`` arguments) or otherwise
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1556	modify any state (e.g. memory, control registers, etc) visible to
				1557	caller functions. It may dereference pointer arguments and read
				1558	state that may be set in the caller. A readonly function always
				1559	returns the same value (or unwinds an exception identically) when
Sanjoy Das	5be2e84	2017-02-13 23:19:07 +0000	[diff] [blame]	1560	called with the same set of arguments and global state. This means while it
				1561	cannot unwind exceptions by calling the ``C++`` exception throwing methods
				1562	(since they write to memory), there may be non-``C++`` mechanisms that throw
				1563	exceptions without writing to LLVM visible memory.
Andrew Trick	d4d1d9c	2013-10-31 17:18:07 +0000	[diff] [blame]	1564
Nick Lewycky	c2ec072	2013-07-06 00:29:58 +0000	[diff] [blame]	1565	On an argument, this attribute indicates that the function does not write
				1566	through this pointer argument, even though it may write to the memory that
				1567	the pointer points to.
Eli Friedman	0f522bd	2018-07-25 18:26:38 +0000	[diff] [blame]	1568
				1569	If a readonly function writes memory visible to the program, or
				1570	has other side-effects, the behavior is undefined. If a function writes to
				1571	a readonly pointer argument, the behavior is undefined.
whitequark	08b2035	2017-06-22 23:22:36 +0000	[diff] [blame]	1572	``"stack-probe-size"``
				1573	This attribute controls the behavior of stack probes: either
				1574	the ``"probe-stack"`` attribute, or ABI-required stack probes, if any.
				1575	It defines the size of the guard region. It ensures that if the function
				1576	may use more stack space than the size of the guard region, stack probing
				1577	sequence will be emitted. It takes one required integer value, which
				1578	is 4096 by default.
				1579
				1580	If a function that has a ``"stack-probe-size"`` attribute is inlined into
				1581	a function with another ``"stack-probe-size"`` attribute, the resulting
				1582	function has the ``"stack-probe-size"`` attribute that has the lower
				1583	numeric value. If a function that has a ``"stack-probe-size"`` attribute is
				1584	inlined into a function that has no ``"stack-probe-size"`` attribute
				1585	at all, the resulting function has the ``"stack-probe-size"`` attribute
				1586	of the callee.
Hans Wennborg	89c35fc	2018-02-23 13:46:25 +0000	[diff] [blame]	1587	``"no-stack-arg-probe"``
				1588	This attribute disables ABI-required stack probes, if any.
Nicolai Haehnle	84c9f99	2016-07-04 08:01:29 +0000	[diff] [blame]	1589	``writeonly``
				1590	On a function, this attribute indicates that the function may write to but
				1591	does not read from memory.
				1592
				1593	On an argument, this attribute indicates that the function may write to but
				1594	does not read through this pointer argument (even though it may read from
				1595	the memory that the pointer points to).
Eli Friedman	0f522bd	2018-07-25 18:26:38 +0000	[diff] [blame]	1596
				1597	If a writeonly function reads memory visible to the program, or
				1598	has other side-effects, the behavior is undefined. If a function reads
				1599	from a writeonly pointer argument, the behavior is undefined.
Igor Laevsky	39d662f	2015-07-11 10:30:36 +0000	[diff] [blame]	1600	``argmemonly``
				1601	This attribute indicates that the only memory accesses inside function are
				1602	loads and stores from objects pointed to by its pointer-typed arguments,
				1603	with arbitrary offsets. Or in other words, all memory operations in the
				1604	function can refer to memory only using pointers based on its function
				1605	arguments.
Eli Friedman	0f522bd	2018-07-25 18:26:38 +0000	[diff] [blame]	1606
Igor Laevsky	39d662f	2015-07-11 10:30:36 +0000	[diff] [blame]	1607	Note that ``argmemonly`` can be used together with ``readonly`` attribute
				1608	in order to specify that function reads only from its arguments.
Eli Friedman	0f522bd	2018-07-25 18:26:38 +0000	[diff] [blame]	1609
				1610	If an argmemonly function reads or writes memory other than the pointer
				1611	arguments, or has other side-effects, the behavior is undefined.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1612	``returns_twice``
				1613	This attribute indicates that this function can return twice. The C
				1614	``setjmp`` is an example of such a function. The compiler disables
				1615	some optimizations (like tail calls) in the caller of these
				1616	functions.
Peter Collingbourne	82437bf	2015-06-15 21:07:11 +0000	[diff] [blame]	1617	``safestack``
				1618	This attribute indicates that
				1619	`SafeStack <http://clang.llvm.org/docs/SafeStack.html>`_
				1620	protection is enabled for this function.
				1621
				1622	If a function that has a ``safestack`` attribute is inlined into a
				1623	function that doesn't have a ``safestack`` attribute or which has an
				1624	``ssp``, ``sspstrong`` or ``sspreq`` attribute, then the resulting
				1625	function will have a ``safestack`` attribute.
Kostya Serebryany	cf880b9	2013-02-26 06:58:09 +0000	[diff] [blame]	1626	``sanitize_address``
				1627	This attribute indicates that AddressSanitizer checks
				1628	(dynamic address safety analysis) are enabled for this function.
				1629	``sanitize_memory``
				1630	This attribute indicates that MemorySanitizer checks (dynamic detection
				1631	of accesses to uninitialized memory) are enabled for this function.
				1632	``sanitize_thread``
				1633	This attribute indicates that ThreadSanitizer checks
				1634	(dynamic thread safety analysis) are enabled for this function.
Evgeniy Stepanov	c667c1f	2017-12-09 00:21:41 +0000	[diff] [blame]	1635	``sanitize_hwaddress``
				1636	This attribute indicates that HWAddressSanitizer checks
				1637	(dynamic address safety analysis based on tagged pointers) are enabled for
				1638	this function.
Chandler Carruth	664aa86	2018-09-04 12:38:00 +0000	[diff] [blame^]	1639	``speculative_load_hardening``
				1640	This attribute indicates that
				1641	`Speculative Load Hardening <https://llvm.org/docs/SpeculativeLoadHardening.html>`_
				1642	should be enabled for the function body. This is a best-effort attempt to
				1643	mitigate all known speculative execution information leak vulnerabilities
				1644	that are based on the fundamental principles of modern processors'
				1645	speculative execution. These vulnerabilities are classified as "Spectre
				1646	variant #1" vulnerabilities typically. Notably, this does not attempt to
				1647	mitigate any vulnerabilities where the speculative execution and/or
				1648	prediction devices of specific processors can be completely undermined
				1649	(such as "Branch Target Injection", a.k.a, "Spectre variant #2"). Instead,
				1650	this is a target-independent request to harden against the completely
				1651	generic risk posed by speculative execution to incorrectly load secret data,
				1652	making it available to some micro-architectural side-channel for information
				1653	leak. For a processor without any speculative execution or predictors, this
				1654	is expected to be a no-op.
				1655
				1656	When inlining, the attribute is sticky. Inlining a function that carries
				1657	this attribute will cause the caller to gain the attribute. This is intended
				1658	to provide a maximally conservative model where the code in a function
				1659	annotated with this attribute will always (even after inlining) end up
				1660	hardened.
Matt Arsenault	b19b57e	2017-04-28 20:25:27 +0000	[diff] [blame]	1661	``speculatable``
				1662	This function attribute indicates that the function does not have any
				1663	effects besides calculating its result and does not have undefined behavior.
				1664	Note that ``speculatable`` is not enough to conclude that along any
Xin Tong	c718020	2017-05-02 23:24:12 +0000	[diff] [blame]	1665	particular execution path the number of calls to this function will not be
Matt Arsenault	b19b57e	2017-04-28 20:25:27 +0000	[diff] [blame]	1666	externally observable. This attribute is only valid on functions
				1667	and declarations, not on individual call sites. If a function is
				1668	incorrectly marked as speculatable and really does exhibit
				1669	undefined behavior, the undefined behavior may be observed even
				1670	if the call site is dead code.
				1671
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1672	``ssp``
				1673	This attribute indicates that the function should emit a stack
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	1674	smashing protector. It is in the form of a "canary" --- a random value
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1675	placed on the stack before the local variables that's checked upon
				1676	return from the function to see if it has been overwritten. A
				1677	heuristic is used to determine if a function needs stack protectors
Bill Wendling	7c8f96a	2013-01-23 06:43:53 +0000	[diff] [blame]	1678	or not. The heuristic used will enable protectors for functions with:
Dmitri Gribenko	69b5647	2013-01-29 23:14:41 +0000	[diff] [blame]	1679
Bill Wendling	7c8f96a	2013-01-23 06:43:53 +0000	[diff] [blame]	1680	- Character arrays larger than ``ssp-buffer-size`` (default 8).
				1681	- Aggregates containing character arrays larger than ``ssp-buffer-size``.
				1682	- Calls to alloca() with variable sizes or constant sizes greater than
				1683	``ssp-buffer-size``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1684
Josh Magee	24c7f06	2014-02-01 01:36:16 +0000	[diff] [blame]	1685	Variables that are identified as requiring a protector will be arranged
				1686	on the stack such that they are adjacent to the stack protector guard.
				1687
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1688	If a function that has an ``ssp`` attribute is inlined into a
				1689	function that doesn't have an ``ssp`` attribute, then the resulting
				1690	function will have an ``ssp`` attribute.
				1691	``sspreq``
				1692	This attribute indicates that the function should always emit a
				1693	stack smashing protector. This overrides the ``ssp`` function
				1694	attribute.
				1695
Josh Magee	24c7f06	2014-02-01 01:36:16 +0000	[diff] [blame]	1696	Variables that are identified as requiring a protector will be arranged
				1697	on the stack such that they are adjacent to the stack protector guard.
				1698	The specific layout rules are:
				1699
				1700	#. Large arrays and structures containing large arrays
				1701	(``>= ssp-buffer-size``) are closest to the stack protector.
				1702	#. Small arrays and structures containing small arrays
				1703	(``< ssp-buffer-size``) are 2nd closest to the protector.
				1704	#. Variables that have had their address taken are 3rd closest to the
				1705	protector.
				1706
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1707	If a function that has an ``sspreq`` attribute is inlined into a
				1708	function that doesn't have an ``sspreq`` attribute or which has an
Bill Wendling	d154e283	2013-01-23 06:41:41 +0000	[diff] [blame]	1709	``ssp`` or ``sspstrong`` attribute, then the resulting function will have
				1710	an ``sspreq`` attribute.
				1711	``sspstrong``
				1712	This attribute indicates that the function should emit a stack smashing
Bill Wendling	7c8f96a	2013-01-23 06:43:53 +0000	[diff] [blame]	1713	protector. This attribute causes a strong heuristic to be used when
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1714	determining if a function needs stack protectors. The strong heuristic
Bill Wendling	7c8f96a	2013-01-23 06:43:53 +0000	[diff] [blame]	1715	will enable protectors for functions with:
Dmitri Gribenko	69b5647	2013-01-29 23:14:41 +0000	[diff] [blame]	1716
Bill Wendling	7c8f96a	2013-01-23 06:43:53 +0000	[diff] [blame]	1717	- Arrays of any size and type
				1718	- Aggregates containing an array of any size and type.
				1719	- Calls to alloca().
				1720	- Local variables that have had their address taken.
				1721
Josh Magee	24c7f06	2014-02-01 01:36:16 +0000	[diff] [blame]	1722	Variables that are identified as requiring a protector will be arranged
				1723	on the stack such that they are adjacent to the stack protector guard.
				1724	The specific layout rules are:
				1725
				1726	#. Large arrays and structures containing large arrays
				1727	(``>= ssp-buffer-size``) are closest to the stack protector.
				1728	#. Small arrays and structures containing small arrays
				1729	(``< ssp-buffer-size``) are 2nd closest to the protector.
				1730	#. Variables that have had their address taken are 3rd closest to the
				1731	protector.
				1732
Bill Wendling	7c8f96a	2013-01-23 06:43:53 +0000	[diff] [blame]	1733	This overrides the ``ssp`` function attribute.
Bill Wendling	d154e283	2013-01-23 06:41:41 +0000	[diff] [blame]	1734
				1735	If a function that has an ``sspstrong`` attribute is inlined into a
				1736	function that doesn't have an ``sspstrong`` attribute, then the
				1737	resulting function will have an ``sspstrong`` attribute.
Andrew Kaylor	53a5fbb	2017-08-14 21:15:13 +0000	[diff] [blame]	1738	``strictfp``
				1739	This attribute indicates that the function was called from a scope that
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	1740	requires strict floating-point semantics. LLVM will not attempt any
				1741	optimizations that require assumptions about the floating-point rounding
				1742	mode or that might alter the state of floating-point status flags that
Andrew Kaylor	53a5fbb	2017-08-14 21:15:13 +0000	[diff] [blame]	1743	might otherwise be set or cleared by calling this function.
Reid Kleckner	5a2ab2b	2015-03-04 00:08:56 +0000	[diff] [blame]	1744	``"thunk"``
				1745	This attribute indicates that the function will delegate to some other
				1746	function with a tail call. The prototype of a thunk should not be used for
				1747	optimization purposes. The caller is expected to cast the thunk prototype to
				1748	match the thunk target prototype.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1749	``uwtable``
				1750	This attribute indicates that the ABI being targeted requires that
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	1751	an unwind table entry be produced for this function even if we can
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1752	show that no exceptions passes by it. This is normally the case for
				1753	the ELF x86-64 abi, but it can be disabled for some compilation
				1754	units.
Oren Ben Simhon	fdd72fd	2018-03-17 13:29:46 +0000	[diff] [blame]	1755	``nocf_check``
Hiroshi Inoue	c36a1f1	2018-06-15 05:10:09 +0000	[diff] [blame]	1756	This attribute indicates that no control-flow check will be performed on
Oren Ben Simhon	fdd72fd	2018-03-17 13:29:46 +0000	[diff] [blame]	1757	the attributed entity. It disables -fcf-protection=<> for a specific
				1758	entity to fine grain the HW control flow protection mechanism. The flag
Hiroshi Inoue	c36a1f1	2018-06-15 05:10:09 +0000	[diff] [blame]	1759	is target independent and currently appertains to a function or function
Oren Ben Simhon	fdd72fd	2018-03-17 13:29:46 +0000	[diff] [blame]	1760	pointer.
Vlad Tsyrklevich	d17f61e	2018-04-03 20:10:40 +0000	[diff] [blame]	1761	``shadowcallstack``
				1762	This attribute indicates that the ShadowCallStack checks are enabled for
				1763	the function. The instrumentation checks that the return address for the
				1764	function has not changed between the function prolog and eiplog. It is
				1765	currently x86_64-specific.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1766
Javed Absar	f3d7904	2017-05-11 12:28:08 +0000	[diff] [blame]	1767	.. _glattrs:
				1768
				1769	Global Attributes
				1770	-----------------
				1771
				1772	Attributes may be set to communicate additional information about a global variable.
				1773	Unlike :ref:`function attributes <fnattrs>`, attributes on a global variable
				1774	are grouped into a single :ref:`attribute group <attrgrp>`.
Sanjoy Das	b513a9f	2015-09-24 23:34:52 +0000	[diff] [blame]	1775
				1776	.. _opbundles:
				1777
				1778	Operand Bundles
				1779	---------------
				1780
Sanjoy Das	b513a9f	2015-09-24 23:34:52 +0000	[diff] [blame]	1781	Operand bundles are tagged sets of SSA values that can be associated
Sanjoy Das	b0e9d4a5	2015-09-25 00:05:40 +0000	[diff] [blame]	1782	with certain LLVM instructions (currently only ``call`` s and
				1783	``invoke`` s). In a way they are like metadata, but dropping them is
Sanjoy Das	b513a9f	2015-09-24 23:34:52 +0000	[diff] [blame]	1784	incorrect and will change program semantics.
				1785
				1786	Syntax::
David Majnemer	34cacb4	2015-10-22 01:46:38 +0000	[diff] [blame]	1787
Sanjoy Das	9f3c125	2015-11-21 09:12:07 +0000	[diff] [blame]	1788	operand bundle set ::= '[' operand bundle (, operand bundle )* ']'
Sanjoy Das	b513a9f	2015-09-24 23:34:52 +0000	[diff] [blame]	1789	operand bundle ::= tag '(' [ bundle operand ] (, bundle operand )* ')'
				1790	bundle operand ::= SSA value
				1791	tag ::= string constant
				1792
				1793	Operand bundles are not part of a function's signature, and a
				1794	given function may be called from multiple places with different kinds
				1795	of operand bundles. This reflects the fact that the operand bundles
				1796	are conceptually a part of the ``call`` (or ``invoke``), not the
				1797	callee being dispatched to.
				1798
				1799	Operand bundles are a generic mechanism intended to support
				1800	runtime-introspection-like functionality for managed languages. While
				1801	the exact semantics of an operand bundle depend on the bundle tag,
				1802	there are certain limitations to how much the presence of an operand
				1803	bundle can influence the semantics of a program. These restrictions
				1804	are described as the semantics of an "unknown" operand bundle. As
				1805	long as the behavior of an operand bundle is describable within these
				1806	restrictions, LLVM does not need to have special knowledge of the
				1807	operand bundle to not miscompile programs containing it.
				1808
David Majnemer	34cacb4	2015-10-22 01:46:38 +0000	[diff] [blame]	1809	- The bundle operands for an unknown operand bundle escape in unknown
				1810	ways before control is transferred to the callee or invokee.
				1811	- Calls and invokes with operand bundles have unknown read / write
				1812	effect on the heap on entry and exit (even if the call target is
Sylvestre Ledru	84666a1	2016-02-14 20:16:22 +0000	[diff] [blame]	1813	``readnone`` or ``readonly``), unless they're overridden with
Sanjoy Das	98a341b	2015-10-22 03:12:22 +0000	[diff] [blame]	1814	callsite specific attributes.
				1815	- An operand bundle at a call site cannot change the implementation
				1816	of the called function. Inter-procedural optimizations work as
				1817	usual as long as they take into account the first two properties.
Sanjoy Das	b513a9f	2015-09-24 23:34:52 +0000	[diff] [blame]	1818
Sanjoy Das	cdafd84	2015-11-11 21:38:02 +0000	[diff] [blame]	1819	More specific types of operand bundles are described below.
				1820
Sanjoy Das	b51325d	2016-03-11 19:08:34 +0000	[diff] [blame]	1821	.. _deopt_opbundles:
				1822
Sanjoy Das	cdafd84	2015-11-11 21:38:02 +0000	[diff] [blame]	1823	Deoptimization Operand Bundles
				1824	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				1825
Sanjoy Das	9f3c125	2015-11-21 09:12:07 +0000	[diff] [blame]	1826	Deoptimization operand bundles are characterized by the ``"deopt"``
Sanjoy Das	cdafd84	2015-11-11 21:38:02 +0000	[diff] [blame]	1827	operand bundle tag. These operand bundles represent an alternate
				1828	"safe" continuation for the call site they're attached to, and can be
				1829	used by a suitable runtime to deoptimize the compiled frame at the
Sanjoy Das	9f3c125	2015-11-21 09:12:07 +0000	[diff] [blame]	1830	specified call site. There can be at most one ``"deopt"`` operand
				1831	bundle attached to a call site. Exact details of deoptimization is
				1832	out of scope for the language reference, but it usually involves
				1833	rewriting a compiled frame into a set of interpreted frames.
Sanjoy Das	cdafd84	2015-11-11 21:38:02 +0000	[diff] [blame]	1834
				1835	From the compiler's perspective, deoptimization operand bundles make
				1836	the call sites they're attached to at least ``readonly``. They read
				1837	through all of their pointer typed operands (even if they're not
				1838	otherwise escaped) and the entire visible heap. Deoptimization
				1839	operand bundles do not capture their operands except during
				1840	deoptimization, in which case control will not be returned to the
				1841	compiled frame.
				1842
Sanjoy Das	2d16145	2015-11-18 06:23:38 +0000	[diff] [blame]	1843	The inliner knows how to inline through calls that have deoptimization
				1844	operand bundles. Just like inlining through a normal call site
				1845	involves composing the normal and exceptional continuations, inlining
				1846	through a call site with a deoptimization operand bundle needs to
				1847	appropriately compose the "safe" deoptimization continuation. The
				1848	inliner does this by prepending the parent's deoptimization
				1849	continuation to every deoptimization continuation in the inlined body.
				1850	E.g. inlining ``@f`` into ``@g`` in the following example
				1851
				1852	.. code-block:: llvm
				1853
				1854	define void @f() {
				1855	call void @x() ;; no deopt state
				1856	call void @y() [ "deopt"(i32 10) ]
				1857	call void @y() [ "deopt"(i32 10), "unknown"(i8* null) ]
				1858	ret void
				1859	}
				1860
				1861	define void @g() {
				1862	call void @f() [ "deopt"(i32 20) ]
				1863	ret void
				1864	}
				1865
				1866	will result in
				1867
				1868	.. code-block:: llvm
				1869
				1870	define void @g() {
				1871	call void @x() ;; still no deopt state
				1872	call void @y() [ "deopt"(i32 20, i32 10) ]
				1873	call void @y() [ "deopt"(i32 20, i32 10), "unknown"(i8* null) ]
				1874	ret void
				1875	}
				1876
				1877	It is the frontend's responsibility to structure or encode the
				1878	deoptimization state in a way that syntactically prepending the
				1879	caller's deoptimization state to the callee's deoptimization state is
				1880	semantically equivalent to composing the caller's deoptimization
				1881	continuation after the callee's deoptimization continuation.
				1882
Joseph Tremoulet	e28885e	2016-01-10 04:28:38 +0000	[diff] [blame]	1883	.. _ob_funclet:
				1884
David Majnemer	3bb88c0	2015-12-15 21:27:27 +0000	[diff] [blame]	1885	Funclet Operand Bundles
				1886	^^^^^^^^^^^^^^^^^^^^^^^
				1887
				1888	Funclet operand bundles are characterized by the ``"funclet"``
				1889	operand bundle tag. These operand bundles indicate that a call site
				1890	is within a particular funclet. There can be at most one
				1891	``"funclet"`` operand bundle attached to a call site and it must have
				1892	exactly one bundle operand.
				1893
Joseph Tremoulet	e28885e	2016-01-10 04:28:38 +0000	[diff] [blame]	1894	If any funclet EH pads have been "entered" but not "exited" (per the
				1895	`description in the EH doc\ <ExceptionHandling.html#wineh-constraints>`_),
				1896	it is undefined behavior to execute a ``call`` or ``invoke`` which:
				1897
				1898	* does not have a ``"funclet"`` bundle and is not a ``call`` to a nounwind
				1899	intrinsic, or
				1900	* has a ``"funclet"`` bundle whose operand is not the most-recently-entered
				1901	not-yet-exited funclet EH pad.
				1902
				1903	Similarly, if no funclet EH pads have been entered-but-not-yet-exited,
				1904	executing a ``call`` or ``invoke`` with a ``"funclet"`` bundle is undefined behavior.
				1905
Sanjoy Das	a34ce95	2016-01-20 19:50:25 +0000	[diff] [blame]	1906	GC Transition Operand Bundles
				1907	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				1908
				1909	GC transition operand bundles are characterized by the
				1910	``"gc-transition"`` operand bundle tag. These operand bundles mark a
				1911	call as a transition between a function with one GC strategy to a
				1912	function with a different GC strategy. If coordinating the transition
				1913	between GC strategies requires additional code generation at the call
				1914	site, these bundles may contain any values that are needed by the
				1915	generated code. For more details, see :ref:`GC Transitions
				1916	<gc_transition_args>`.
				1917
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1918	.. _moduleasm:
				1919
				1920	Module-Level Inline Assembly
				1921	----------------------------
				1922
				1923	Modules may contain "module-level inline asm" blocks, which corresponds
				1924	to the GCC "file scope inline asm" blocks. These blocks are internally
				1925	concatenated by LLVM and treated as a single unit, but may be separated
				1926	in the ``.ll`` file if desired. The syntax is very simple:
				1927
				1928	.. code-block:: llvm
				1929
				1930	module asm "inline asm code goes here"
				1931	module asm "more can go here"
				1932
				1933	The strings can contain any character by escaping non-printable
				1934	characters. The escape sequence used is simply "\\xx" where "xx" is the
				1935	two digit hex code for the number.
				1936
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	1937	Note that the assembly string must be parseable by LLVM's integrated assembler
				1938	(unless it is disabled), even when emitting a ``.s`` file.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1939
Eli Bendersky	fdc529a	2013-06-07 19:40:08 +0000	[diff] [blame]	1940	.. _langref_datalayout:
				1941
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1942	Data Layout
				1943	-----------
				1944
				1945	A module may specify a target specific data layout string that specifies
				1946	how data is to be laid out in memory. The syntax for the data layout is
				1947	simply:
				1948
				1949	.. code-block:: llvm
				1950
				1951	target datalayout = "layout specification"
				1952
				1953	The layout specification consists of a list of specifications
				1954	separated by the minus sign character ('-'). Each specification starts
				1955	with a letter and may include other information after the letter to
				1956	define some aspect of the data layout. The specifications accepted are
				1957	as follows:
				1958
				1959	``E``
				1960	Specifies that the target lays out data in big-endian form. That is,
				1961	the bits with the most significance have the lowest address
				1962	location.
				1963	``e``
				1964	Specifies that the target lays out data in little-endian form. That
				1965	is, the bits with the least significance have the lowest address
				1966	location.
				1967	``S<size>``
				1968	Specifies the natural alignment of the stack in bits. Alignment
				1969	promotion of stack variables is limited to the natural stack
				1970	alignment to avoid dynamic stack realignment. The stack alignment
				1971	must be a multiple of 8-bits. If omitted, the natural stack
				1972	alignment defaults to "unspecified", which does not prevent any
				1973	alignment promotions.
Dylan McKay	ced2fe6	2018-02-19 09:56:22 +0000	[diff] [blame]	1974	``P<address space>``
				1975	Specifies the address space that corresponds to program memory.
				1976	Harvard architectures can use this to specify what space LLVM
				1977	should place things such as functions into. If omitted, the
				1978	program memory space defaults to the default address space of 0,
				1979	which corresponds to a Von Neumann architecture that has code
				1980	and data in the same space.
Matt Arsenault	3c1fc76	2017-04-10 22:27:50 +0000	[diff] [blame]	1981	``A<address space>``
Dylan McKay	ced2fe6	2018-02-19 09:56:22 +0000	[diff] [blame]	1982	Specifies the address space of objects created by '``alloca``'.
Matt Arsenault	3c1fc76	2017-04-10 22:27:50 +0000	[diff] [blame]	1983	Defaults to the default address space of 0.
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	1984	``p[n]:<size>:<abi>:<pref>:<idx>``
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1985	This specifies the size of a pointer and its ``<abi>`` and
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	1986	``<pref>``\erred alignments for address space ``n``. The fourth parameter
				1987	``<idx>`` is a size of index that used for address calculation. If not
				1988	specified, the default index size is equal to the pointer size. All sizes
				1989	are in bits. The address space, ``n``, is optional, and if not specified,
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	1990	denotes the default address space 0. The value of ``n`` must be
Rafael Espindola	abdd726	2014-01-06 21:40:24 +0000	[diff] [blame]	1991	in the range [1,2^23).
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	1992	``i<size>:<abi>:<pref>``
				1993	This specifies the alignment for an integer type of a given bit
				1994	``<size>``. The value of ``<size>`` must be in the range [1,2^23).
				1995	``v<size>:<abi>:<pref>``
				1996	This specifies the alignment for a vector type of a given bit
				1997	``<size>``.
				1998	``f<size>:<abi>:<pref>``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	1999	This specifies the alignment for a floating-point type of a given bit
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2000	``<size>``. Only values of ``<size>`` that are supported by the target
				2001	will work. 32 (float) and 64 (double) are supported on all targets; 80
				2002	or 128 (different flavors of long double) are also supported on some
				2003	targets.
Rafael Espindola	abdd726	2014-01-06 21:40:24 +0000	[diff] [blame]	2004	``a:<abi>:<pref>``
				2005	This specifies the alignment for an object of aggregate type.
Rafael Espindola	5887356	2014-01-03 19:21:54 +0000	[diff] [blame]	2006	``m:<mangling>``
Reid Kleckner	f8b51c5	2018-03-16 20:13:32 +0000	[diff] [blame]	2007	If present, specifies that llvm names are mangled in the output. Symbols
				2008	prefixed with the mangling escape character ``\01`` are passed through
				2009	directly to the assembler without the escape character. The mangling style
Hans Wennborg	d4245ac	2014-01-15 02:49:17 +0000	[diff] [blame]	2010	options are
				2011
				2012	* ``e``: ELF mangling: Private symbols get a ``.L`` prefix.
				2013	* ``m``: Mips mangling: Private symbols get a ``$`` prefix.
				2014	* ``o``: Mach-O mangling: Private symbols get ``L`` prefix. Other
				2015	symbols get a ``_`` prefix.
Reid Kleckner	f8b51c5	2018-03-16 20:13:32 +0000	[diff] [blame]	2016	* ``x``: Windows x86 COFF mangling: Private symbols get the usual prefix.
				2017	Regular C symbols get a ``_`` prefix. Functions with ``__stdcall``,
				2018	``__fastcall``, and ``__vectorcall`` have custom mangling that appends
				2019	``@N`` where N is the number of bytes used to pass parameters. C++ symbols
				2020	starting with ``?`` are not mangled in any way.
				2021	* ``w``: Windows COFF mangling: Similar to ``x``, except that normal C
				2022	symbols do not receive a ``_`` prefix.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2023	``n<size1>:<size2>:<size3>...``
				2024	This specifies a set of native integer widths for the target CPU in
				2025	bits. For example, it might contain ``n32`` for 32-bit PowerPC,
				2026	``n32:64`` for PowerPC 64, or ``n8:16:32:64`` for X86-64. Elements of
				2027	this set are considered to support most general arithmetic operations
				2028	efficiently.
Sanjoy Das	c6af5ea	2016-07-28 23:43:38 +0000	[diff] [blame]	2029	``ni:<address space0>:<address space1>:<address space2>...``
				2030	This specifies pointer types with the specified address spaces
				2031	as :ref:`Non-Integral Pointer Type <nointptrtype>` s. The ``0``
				2032	address space cannot be specified as non-integral.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2033
Rafael Espindola	abdd726	2014-01-06 21:40:24 +0000	[diff] [blame]	2034	On every specification that takes a ``<abi>:<pref>``, specifying the
				2035	``<pref>`` alignment is optional. If omitted, the preceding ``:``
				2036	should be omitted too and ``<pref>`` will be equal to ``<abi>``.
				2037
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2038	When constructing the data layout for a given target, LLVM starts with a
				2039	default set of specifications which are then (possibly) overridden by
				2040	the specifications in the ``datalayout`` keyword. The default
				2041	specifications are given in this list:
				2042
				2043	- ``E`` - big endian
Matt Arsenault	24b49c4	2013-07-31 17:49:08 +0000	[diff] [blame]	2044	- ``p:64:64:64`` - 64-bit pointers with 64-bit alignment.
				2045	- ``p[n]:64:64:64`` - Other address spaces are assumed to be the
				2046	same as the default address space.
Patrik Hagglund	a832ab1	2013-01-30 09:02:06 +0000	[diff] [blame]	2047	- ``S0`` - natural stack alignment is unspecified
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2048	- ``i1:8:8`` - i1 is 8-bit (byte) aligned
				2049	- ``i8:8:8`` - i8 is 8-bit (byte) aligned
				2050	- ``i16:16:16`` - i16 is 16-bit aligned
				2051	- ``i32:32:32`` - i32 is 32-bit aligned
				2052	- ``i64:32:64`` - i64 has ABI alignment of 32-bits but preferred
				2053	alignment of 64-bits
Patrik Hagglund	a832ab1	2013-01-30 09:02:06 +0000	[diff] [blame]	2054	- ``f16:16:16`` - half is 16-bit aligned
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2055	- ``f32:32:32`` - float is 32-bit aligned
				2056	- ``f64:64:64`` - double is 64-bit aligned
Patrik Hagglund	a832ab1	2013-01-30 09:02:06 +0000	[diff] [blame]	2057	- ``f128:128:128`` - quad is 128-bit aligned
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2058	- ``v64:64:64`` - 64-bit vector is 64-bit aligned
				2059	- ``v128:128:128`` - 128-bit vector is 128-bit aligned
Rafael Espindola	e8f4d58	2013-12-12 17:21:51 +0000	[diff] [blame]	2060	- ``a:0:64`` - aggregates are 64-bit aligned
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2061
				2062	When LLVM is determining the alignment for a given type, it uses the
				2063	following rules:
				2064
				2065	#. If the type sought is an exact match for one of the specifications,
				2066	that specification is used.
				2067	#. If no match is found, and the type sought is an integer type, then
				2068	the smallest integer type that is larger than the bitwidth of the
				2069	sought type is used. If none of the specifications are larger than
				2070	the bitwidth then the largest integer type is used. For example,
				2071	given the default specifications above, the i7 type will use the
				2072	alignment of i8 (next largest) while both i65 and i256 will use the
				2073	alignment of i64 (largest specified).
				2074	#. If no match is found, and the type sought is a vector type, then the
				2075	largest vector type that is smaller than the sought vector type will
				2076	be used as a fall back. This happens because <128 x double> can be
				2077	implemented in terms of 64 <2 x double>, for example.
				2078
				2079	The function of the data layout string may not be what you expect.
				2080	Notably, this is not a specification from the frontend of what alignment
				2081	the code generator should use.
				2082
				2083	Instead, if specified, the target data layout is required to match what
				2084	the ultimate code generator expects. This string is used by the
				2085	mid-level optimizers to improve code, and this only works if it matches
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	2086	what the ultimate code generator uses. There is no way to generate IR
				2087	that does not embed this target-specific detail into the IR. If you
				2088	don't specify the string, the default specifications will be used to
				2089	generate a Data Layout and the optimization phases will operate
				2090	accordingly and introduce target specificity into the IR with respect to
				2091	these default specifications.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2092
Bill Wendling	5cc9084	2013-10-18 23:41:25 +0000	[diff] [blame]	2093	.. _langref_triple:
				2094
				2095	Target Triple
				2096	-------------
				2097
				2098	A module may specify a target triple string that describes the target
				2099	host. The syntax for the target triple is simply:
				2100
				2101	.. code-block:: llvm
				2102
				2103	target triple = "x86_64-apple-macosx10.7.0"
				2104
				2105	The target triple string consists of a series of identifiers delimited
				2106	by the minus sign character ('-'). The canonical forms are:
				2107
				2108	::
				2109
				2110	ARCHITECTURE-VENDOR-OPERATING_SYSTEM
				2111	ARCHITECTURE-VENDOR-OPERATING_SYSTEM-ENVIRONMENT
				2112
				2113	This information is passed along to the backend so that it generates
				2114	code for the proper architecture. It's possible to override this on the
				2115	command line with the ``-mtriple`` command line option.
				2116
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2117	.. _pointeraliasing:
				2118
				2119	Pointer Aliasing Rules
				2120	----------------------
				2121
				2122	Any memory access must be done through a pointer value associated with
				2123	an address range of the memory access, otherwise the behavior is
				2124	undefined. Pointer values are associated with address ranges according
				2125	to the following rules:
				2126
				2127	- A pointer value is associated with the addresses associated with any
				2128	value it is based on.
				2129	- An address of a global variable is associated with the address range
				2130	of the variable's storage.
				2131	- The result value of an allocation instruction is associated with the
				2132	address range of the allocated storage.
				2133	- A null pointer in the default address-space is associated with no
				2134	address.
				2135	- An integer constant other than zero or a pointer value returned from
				2136	a function not defined within LLVM may be associated with address
				2137	ranges allocated through mechanisms other than those provided by
				2138	LLVM. Such ranges shall not overlap with any ranges of addresses
				2139	allocated by mechanisms provided by LLVM.
				2140
				2141	A pointer value is based on another pointer value according to the
				2142	following rules:
				2143
Sanjoy Das	6d48949	2017-09-13 18:49:22 +0000	[diff] [blame]	2144	- A pointer value formed from a scalar ``getelementptr`` operation is based on
				2145	the pointer-typed operand of the ``getelementptr``.
				2146	- The pointer in lane l of the result of a vector ``getelementptr`` operation
				2147	is based on the pointer in lane l of the vector-of-pointers-typed operand
				2148	of the ``getelementptr``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2149	- The result value of a ``bitcast`` is based on the operand of the
				2150	``bitcast``.
				2151	- A pointer value formed by an ``inttoptr`` is based on all pointer
				2152	values that contribute (directly or indirectly) to the computation of
				2153	the pointer's value.
				2154	- The "based on" relationship is transitive.
				2155
				2156	Note that this definition of "based" is intentionally similar to the
				2157	definition of "based" in C99, though it is slightly weaker.
				2158
				2159	LLVM IR does not associate types with memory. The result type of a
				2160	``load`` merely indicates the size and alignment of the memory from
				2161	which to load, as well as the interpretation of the value. The first
				2162	operand type of a ``store`` similarly only indicates the size and
				2163	alignment of the store.
				2164
				2165	Consequently, type-based alias analysis, aka TBAA, aka
				2166	``-fstrict-aliasing``, is not applicable to general unadorned LLVM IR.
				2167	:ref:`Metadata <metadata>` may be used to encode additional information
				2168	which specialized optimization passes may use to implement type-based
				2169	alias analysis.
				2170
				2171	.. _volatile:
				2172
				2173	Volatile Memory Accesses
				2174	------------------------
				2175
				2176	Certain memory accesses, such as :ref:`load <i_load>`'s,
				2177	:ref:`store <i_store>`'s, and :ref:`llvm.memcpy <int_memcpy>`'s may be
				2178	marked ``volatile``. The optimizers must not change the number of
				2179	volatile operations or change their order of execution relative to other
				2180	volatile operations. The optimizers may change the order of volatile
				2181	operations relative to non-volatile operations. This is not Java's
				2182	"volatile" and has no cross-thread synchronization behavior.
				2183
Andrew Trick	89fc5a6	2013-01-30 21:19:35 +0000	[diff] [blame]	2184	IR-level volatile loads and stores cannot safely be optimized into
				2185	llvm.memcpy or llvm.memmove intrinsics even when those intrinsics are
				2186	flagged volatile. Likewise, the backend should never split or merge
				2187	target-legal volatile load/store instructions.
				2188
Andrew Trick	7e6f928	2013-01-31 00:49:39 +0000	[diff] [blame]	2189	.. admonition:: Rationale
				2190
				2191	Platforms may rely on volatile loads and stores of natively supported
				2192	data width to be executed as single instruction. For example, in C
				2193	this holds for an l-value of volatile primitive type with native
				2194	hardware support, but not necessarily for aggregate types. The
				2195	frontend upholds these expectations, which are intentionally
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	2196	unspecified in the IR. The rules above ensure that IR transformations
Andrew Trick	7e6f928	2013-01-31 00:49:39 +0000	[diff] [blame]	2197	do not violate the frontend's contract with the language.
				2198
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2199	.. _memmodel:
				2200
				2201	Memory Model for Concurrent Operations
				2202	--------------------------------------
				2203
				2204	The LLVM IR does not define any way to start parallel threads of
				2205	execution or to register signal handlers. Nonetheless, there are
				2206	platform-specific ways to create them, and we define LLVM IR's behavior
				2207	in their presence. This model is inspired by the C++0x memory model.
				2208
				2209	For a more informal introduction to this model, see the :doc:`Atomics`.
				2210
				2211	We define a happens-before partial order as the least partial order
				2212	that
				2213
				2214	- Is a superset of single-thread program order, and
				2215	- When a synchronizes-with ``b``, includes an edge from ``a`` to
				2216	``b``. Synchronizes-with pairs are introduced by platform-specific
				2217	techniques, like pthread locks, thread creation, thread joining,
				2218	etc., and by atomic instructions. (See also :ref:`Atomic Memory Ordering
				2219	Constraints <ordering>`).
				2220
				2221	Note that program order does not introduce happens-before edges
				2222	between a thread and signals executing inside that thread.
				2223
				2224	Every (defined) read operation (load instructions, memcpy, atomic
				2225	loads/read-modify-writes, etc.) R reads a series of bytes written by
				2226	(defined) write operations (store instructions, atomic
				2227	stores/read-modify-writes, memcpy, etc.). For the purposes of this
				2228	section, initialized globals are considered to have a write of the
				2229	initializer which is atomic and happens before any other read or write
				2230	of the memory in question. For each byte of a read R, R\ :sub:`byte`
				2231	may see any write to the same byte, except:
				2232
				2233	- If write\ :sub:`1` happens before write\ :sub:`2`, and
				2234	write\ :sub:`2` happens before R\ :sub:`byte`, then
				2235	R\ :sub:`byte` does not see write\ :sub:`1`.
				2236	- If R\ :sub:`byte` happens before write\ :sub:`3`, then
				2237	R\ :sub:`byte` does not see write\ :sub:`3`.
				2238
				2239	Given that definition, R\ :sub:`byte` is defined as follows:
				2240
				2241	- If R is volatile, the result is target-dependent. (Volatile is
				2242	supposed to give guarantees which can support ``sig_atomic_t`` in
Richard Smith	32dbdf6	2014-07-31 04:25:36 +0000	[diff] [blame]	2243	C/C++, and may be used for accesses to addresses that do not behave
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2244	like normal memory. It does not generally provide cross-thread
				2245	synchronization.)
				2246	- Otherwise, if there is no write to the same byte that happens before
				2247	R\ :sub:`byte`, R\ :sub:`byte` returns ``undef`` for that byte.
				2248	- Otherwise, if R\ :sub:`byte` may see exactly one write,
				2249	R\ :sub:`byte` returns the value written by that write.
				2250	- Otherwise, if R is atomic, and all the writes R\ :sub:`byte` may
				2251	see are atomic, it chooses one of the values written. See the :ref:`Atomic
				2252	Memory Ordering Constraints <ordering>` section for additional
				2253	constraints on how the choice is made.
				2254	- Otherwise R\ :sub:`byte` returns ``undef``.
				2255
				2256	R returns the value composed of the series of bytes it read. This
				2257	implies that some bytes within the value may be ``undef`` without
				2258	the entire value being ``undef``. Note that this only defines the
				2259	semantics of the operation; it doesn't mean that targets will emit more
				2260	than one instruction to read the series of bytes.
				2261
				2262	Note that in cases where none of the atomic intrinsics are used, this
				2263	model places only one restriction on IR transformations on top of what
				2264	is required for single-threaded execution: introducing a store to a byte
				2265	which might not otherwise be stored is not allowed in general.
				2266	(Specifically, in the case where another thread might write to and read
				2267	from an address, introducing a store can change a load that may see
				2268	exactly one write into a load that may see multiple writes.)
				2269
				2270	.. _ordering:
				2271
				2272	Atomic Memory Ordering Constraints
				2273	----------------------------------
				2274
				2275	Atomic instructions (:ref:`cmpxchg <i_cmpxchg>`,
				2276	:ref:`atomicrmw <i_atomicrmw>`, :ref:`fence <i_fence>`,
				2277	:ref:`atomic load <i_load>`, and :ref:`atomic store <i_store>`) take
Tim Northover	e94a518	2014-03-11 10:48:52 +0000	[diff] [blame]	2278	ordering parameters that determine which other atomic instructions on
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2279	the same address they synchronize with. These semantics are borrowed
				2280	from Java and C++0x, but are somewhat more colloquial. If these
				2281	descriptions aren't precise enough, check those specs (see spec
				2282	references in the :doc:`atomics guide <Atomics>`).
				2283	:ref:`fence <i_fence>` instructions treat these orderings somewhat
				2284	differently since they don't take an address. See that instruction's
				2285	documentation for details.
				2286
				2287	For a simpler introduction to the ordering constraints, see the
				2288	:doc:`Atomics`.
				2289
				2290	``unordered``
				2291	The set of values that can be read is governed by the happens-before
				2292	partial order. A value cannot be read unless some operation wrote
				2293	it. This is intended to provide a guarantee strong enough to model
				2294	Java's non-volatile shared variables. This ordering cannot be
				2295	specified for read-modify-write operations; it is not strong enough
				2296	to make them atomic in any interesting way.
				2297	``monotonic``
				2298	In addition to the guarantees of ``unordered``, there is a single
				2299	total order for modifications by ``monotonic`` operations on each
				2300	address. All modification orders must be compatible with the
				2301	happens-before order. There is no guarantee that the modification
				2302	orders can be combined to a global total order for the whole program
				2303	(and this often will not be possible). The read in an atomic
				2304	read-modify-write operation (:ref:`cmpxchg <i_cmpxchg>` and
				2305	:ref:`atomicrmw <i_atomicrmw>`) reads the value in the modification
				2306	order immediately before the value it writes. If one atomic read
				2307	happens before another atomic read of the same address, the later
				2308	read must see the same value or a later value in the address's
				2309	modification order. This disallows reordering of ``monotonic`` (or
				2310	stronger) operations on the same address. If an address is written
				2311	``monotonic``-ally by one thread, and other threads ``monotonic``-ally
				2312	read that address repeatedly, the other threads must eventually see
				2313	the write. This corresponds to the C++0x/C1x
				2314	``memory_order_relaxed``.
				2315	``acquire``
				2316	In addition to the guarantees of ``monotonic``, a
				2317	synchronizes-with edge may be formed with a ``release`` operation.
				2318	This is intended to model C++'s ``memory_order_acquire``.
				2319	``release``
				2320	In addition to the guarantees of ``monotonic``, if this operation
				2321	writes a value which is subsequently read by an ``acquire``
				2322	operation, it synchronizes-with that operation. (This isn't a
				2323	complete description; see the C++0x definition of a release
				2324	sequence.) This corresponds to the C++0x/C1x
				2325	``memory_order_release``.
				2326	``acq_rel`` (acquire+release)
				2327	Acts as both an ``acquire`` and ``release`` operation on its
				2328	address. This corresponds to the C++0x/C1x ``memory_order_acq_rel``.
				2329	``seq_cst`` (sequentially consistent)
				2330	In addition to the guarantees of ``acq_rel`` (``acquire`` for an
Richard Smith	32dbdf6	2014-07-31 04:25:36 +0000	[diff] [blame]	2331	operation that only reads, ``release`` for an operation that only
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2332	writes), there is a global total order on all
				2333	sequentially-consistent operations on all addresses, which is
				2334	consistent with the happens-before partial order and with the
				2335	modification orders of all the affected addresses. Each
				2336	sequentially-consistent read sees the last preceding write to the
				2337	same address in this global order. This corresponds to the C++0x/C1x
				2338	``memory_order_seq_cst`` and Java volatile.
				2339
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	2340	.. _syncscope:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2341
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	2342	If an atomic operation is marked ``syncscope("singlethread")``, it only
				2343	synchronizes with and only participates in the seq\_cst total orderings of
				2344	other operations running in the same thread (for example, in signal handlers).
				2345
				2346	If an atomic operation is marked ``syncscope("<target-scope>")``, where
				2347	``<target-scope>`` is a target specific synchronization scope, then it is target
				2348	dependent if it synchronizes with and participates in the seq\_cst total
				2349	orderings of other operations.
				2350
				2351	Otherwise, an atomic operation that is not marked ``syncscope("singlethread")``
				2352	or ``syncscope("<target-scope>")`` synchronizes with and participates in the
				2353	seq\_cst total orderings of other operations that are not marked
				2354	``syncscope("singlethread")`` or ``syncscope("<target-scope>")``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2355
Sanjay Patel	54b161e	2018-03-20 16:38:22 +0000	[diff] [blame]	2356	.. _floatenv:
				2357
				2358	Floating-Point Environment
				2359	--------------------------
				2360
				2361	The default LLVM floating-point environment assumes that floating-point
				2362	instructions do not have side effects. Results assume the round-to-nearest
				2363	rounding mode. No floating-point exception state is maintained in this
				2364	environment. Therefore, there is no attempt to create or preserve invalid
Chandler Carruth	297620d	2018-08-06 02:02:09 +0000	[diff] [blame]	2365	operation (SNaN) or division-by-zero exceptions.
Sanjay Patel	54b161e	2018-03-20 16:38:22 +0000	[diff] [blame]	2366
				2367	The benefit of this exception-free assumption is that floating-point
				2368	operations may be speculated freely without any other fast-math relaxations
				2369	to the floating-point model.
				2370
				2371	Code that requires different behavior than this should use the
Sanjay Patel	ec95e0e	2018-03-20 17:05:19 +0000	[diff] [blame]	2372	:ref:`Constrained Floating-Point Intrinsics <constrainedfp>`.
Sanjay Patel	54b161e	2018-03-20 16:38:22 +0000	[diff] [blame]	2373
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2374	.. _fastmath:
				2375
				2376	Fast-Math Flags
				2377	---------------
				2378
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	2379	LLVM IR floating-point operations (:ref:`fadd <i_fadd>`,
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2380	:ref:`fsub <i_fsub>`, :ref:`fmul <i_fmul>`, :ref:`fdiv <i_fdiv>`,
Matt Arsenault	74b73e5	2017-01-10 18:06:38 +0000	[diff] [blame]	2381	:ref:`frem <i_frem>`, :ref:`fcmp <i_fcmp>`) and :ref:`call <i_call>`
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	2382	may use the following flags to enable otherwise unsafe
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	2383	floating-point transformations.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2384
				2385	``nnan``
				2386	No NaNs - Allow optimizations to assume the arguments and result are not
Eli Friedman	d3a3087	2018-07-17 20:31:42 +0000	[diff] [blame]	2387	NaN. If an argument is a nan, or the result would be a nan, it produces
				2388	a :ref:`poison value <poisonvalues>` instead.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2389
				2390	``ninf``
				2391	No Infs - Allow optimizations to assume the arguments and result are not
Eli Friedman	d3a3087	2018-07-17 20:31:42 +0000	[diff] [blame]	2392	+/-Inf. If an argument is +/-Inf, or the result would be +/-Inf, it
				2393	produces a :ref:`poison value <poisonvalues>` instead.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2394
				2395	``nsz``
				2396	No Signed Zeros - Allow optimizations to treat the sign of a zero
				2397	argument or result as insignificant.
				2398
				2399	``arcp``
				2400	Allow Reciprocal - Allow optimizations to use the reciprocal of an
				2401	argument rather than perform division.
				2402
Adam Nemet	cd847a8	2017-03-28 20:11:52 +0000	[diff] [blame]	2403	``contract``
				2404	Allow floating-point contraction (e.g. fusing a multiply followed by an
				2405	addition into a fused multiply-and-add).
				2406
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	2407	``afn``
				2408	Approximate functions - Allow substitution of approximate calculations for
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	2409	functions (sin, log, sqrt, etc). See floating-point intrinsic definitions
				2410	for places where this can apply to LLVM's intrinsic math functions.
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	2411
				2412	``reassoc``
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	2413	Allow reassociation transformations for floating-point instructions.
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2414	This may dramatically change results in floating-point.
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	2415
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2416	``fast``
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	2417	This flag implies all of the others.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2418
Duncan P. N. Exon Smith	0a448fb	2014-08-19 21:30:15 +0000	[diff] [blame]	2419	.. _uselistorder:
				2420
				2421	Use-list Order Directives
				2422	-------------------------
				2423
				2424	Use-list directives encode the in-memory order of each use-list, allowing the
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	2425	order to be recreated. ``<order-indexes>`` is a comma-separated list of
				2426	indexes that are assigned to the referenced value's uses. The referenced
Duncan P. N. Exon Smith	0a448fb	2014-08-19 21:30:15 +0000	[diff] [blame]	2427	value's use-list is immediately sorted by these indexes.
				2428
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	2429	Use-list directives may appear at function scope or global scope. They are not
				2430	instructions, and have no effect on the semantics of the IR. When they're at
Duncan P. N. Exon Smith	0a448fb	2014-08-19 21:30:15 +0000	[diff] [blame]	2431	function scope, they must appear after the terminator of the final basic block.
				2432
				2433	If basic blocks have their address taken via ``blockaddress()`` expressions,
				2434	``uselistorder_bb`` can be used to reorder their use-lists from outside their
				2435	function's scope.
				2436
				2437	:Syntax:
				2438
				2439	::
				2440
				2441	uselistorder <ty> <value>, { <order-indexes> }
				2442	uselistorder_bb @function, %block { <order-indexes> }
				2443
				2444	:Examples:
				2445
				2446	::
				2447
Duncan P. N. Exon Smith	2304665	2014-08-19 21:48:04 +0000	[diff] [blame]	2448	define void @foo(i32 %arg1, i32 %arg2) {
				2449	entry:
				2450	; ... instructions ...
				2451	bb:
				2452	; ... instructions ...
				2453
				2454	; At function scope.
				2455	uselistorder i32 %arg1, { 1, 0, 2 }
				2456	uselistorder label %bb, { 1, 0 }
				2457	}
Duncan P. N. Exon Smith	0a448fb	2014-08-19 21:30:15 +0000	[diff] [blame]	2458
				2459	; At global scope.
				2460	uselistorder i32* @global, { 1, 2, 0 }
				2461	uselistorder i32 7, { 1, 0 }
				2462	uselistorder i32 (i32) @bar, { 1, 0 }
				2463	uselistorder_bb @foo, %bb, { 5, 1, 3, 2, 0, 4 }
				2464
Teresa Johnson	de9b8b4	2016-04-22 13:09:17 +0000	[diff] [blame]	2465	.. _source_filename:
				2466
				2467	Source Filename
				2468	---------------
				2469
				2470	The source filename string is set to the original module identifier,
				2471	which will be the name of the compiled source file when compiling from
				2472	source through the clang front end, for example. It is then preserved through
				2473	the IR and bitcode.
				2474
				2475	This is currently necessary to generate a consistent unique global
				2476	identifier for local functions used in profile data, which prepends the
				2477	source file name to the local function name.
				2478
				2479	The syntax for the source file name is simply:
				2480
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	2481	.. code-block:: text
Teresa Johnson	de9b8b4	2016-04-22 13:09:17 +0000	[diff] [blame]	2482
				2483	source_filename = "/path/to/source.c"
				2484
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2485	.. _typesystem:
				2486
				2487	Type System
				2488	===========
				2489
				2490	The LLVM type system is one of the most important features of the
				2491	intermediate representation. Being typed enables a number of
				2492	optimizations to be performed on the intermediate representation
				2493	directly, without having to do extra analyses on the side before the
				2494	transformation. A strong type system makes it easier to read the
				2495	generated code and enables novel analyses and transformations that are
				2496	not feasible to perform on normal three address code representations.
				2497
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2498	.. _t_void:
Eli Bendersky	0220e6b	2013-06-07 20:24:43 +0000	[diff] [blame]	2499
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2500	Void Type
				2501	---------
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2502
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2503	:Overview:
				2504
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2505
				2506	The void type does not represent any value and has no size.
				2507
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2508	:Syntax:
				2509
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2510
				2511	::
				2512
				2513	void
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2514
				2515
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2516	.. _t_function:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2517
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2518	Function Type
				2519	-------------
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2520
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2521	:Overview:
				2522
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2523
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2524	The function type can be thought of as a function signature. It consists of a
				2525	return type and a list of formal parameter types. The return type of a function
				2526	type is a void type or first class type --- except for :ref:`label <t_label>`
				2527	and :ref:`metadata <t_metadata>` types.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2528
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2529	:Syntax:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2530
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2531	::
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2532
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2533	<returntype> (<parameter list>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2534
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2535	...where '``<parameter list>``' is a comma-separated list of type
				2536	specifiers. Optionally, the parameter list may include a type ``...``, which
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	2537	indicates that the function takes a variable number of arguments. Variable
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2538	argument functions can access their arguments with the :ref:`variable argument
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	2539	handling intrinsic <int_varargs>` functions. '``<returntype>``' is any type
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2540	except :ref:`label <t_label>` and :ref:`metadata <t_metadata>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2541
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2542	:Examples:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2543
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2544	+---------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+
				2545	\| ``i32 (i32)`` \| function taking an ``i32``, returning an ``i32`` \|
				2546	+---------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+
				2547	\| ``float (i16, i32 ) `` \| :ref:`Pointer <t_pointer>` to a function that takes an ``i16`` and a :ref:`pointer <t_pointer>` to ``i32``, returning ``float``. \|
				2548	+---------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+
				2549	\| ``i32 (i8*, ...)`` \| A vararg function that takes at least one :ref:`pointer <t_pointer>` to ``i8`` (char in C), which returns an integer. This is the signature for ``printf`` in LLVM. \|
				2550	+---------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+
				2551	\| ``{i32, i32} (i32)`` \| A function taking an ``i32``, returning a :ref:`structure <t_struct>` containing two ``i32`` values \|
				2552	+---------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+
				2553
				2554	.. _t_firstclass:
				2555
				2556	First Class Types
				2557	-----------------
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2558
				2559	The :ref:`first class <t_firstclass>` types are perhaps the most important.
				2560	Values of these types are the only ones which can be produced by
				2561	instructions.
				2562
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2563	.. _t_single_value:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2564
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2565	Single Value Types
				2566	^^^^^^^^^^^^^^^^^^
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2567
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2568	These are the types that are valid in registers from CodeGen's perspective.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2569
				2570	.. _t_integer:
				2571
				2572	Integer Type
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2573	""""""""""""
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2574
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2575	:Overview:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2576
				2577	The integer type is a very simple type that simply specifies an
				2578	arbitrary bit width for the integer type desired. Any bit width from 1
				2579	bit to 2\ :sup:`23`\ -1 (about 8 million) can be specified.
				2580
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2581	:Syntax:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2582
				2583	::
				2584
				2585	iN
				2586
				2587	The number of bits the integer will occupy is specified by the ``N``
				2588	value.
				2589
				2590	Examples:
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2591	*********
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2592
				2593	+----------------+------------------------------------------------+
				2594	\| ``i1`` \| a single-bit integer. \|
				2595	+----------------+------------------------------------------------+
				2596	\| ``i32`` \| a 32-bit integer. \|
				2597	+----------------+------------------------------------------------+
				2598	\| ``i1942652`` \| a really big integer of over 1 million bits. \|
				2599	+----------------+------------------------------------------------+
				2600
				2601	.. _t_floating:
				2602
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2603	Floating-Point Types
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2604	""""""""""""""""""""
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2605
				2606	.. list-table::
				2607	:header-rows: 1
				2608
				2609	* - Type
				2610	- Description
				2611
				2612	* - ``half``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2613	- 16-bit floating-point value
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2614
				2615	* - ``float``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2616	- 32-bit floating-point value
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2617
				2618	* - ``double``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2619	- 64-bit floating-point value
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2620
				2621	* - ``fp128``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2622	- 128-bit floating-point value (112-bit mantissa)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2623
				2624	* - ``x86_fp80``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2625	- 80-bit floating-point value (X87)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2626
				2627	* - ``ppc_fp128``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2628	- 128-bit floating-point value (two 64-bits)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2629
Sanjay Patel	bab6ce0	2018-03-21 15:22:09 +0000	[diff] [blame]	2630	The binary format of half, float, double, and fp128 correspond to the
				2631	IEEE-754-2008 specifications for binary16, binary32, binary64, and binary128
				2632	respectively.
				2633
Reid Kleckner	9a16d08	2014-03-05 02:41:37 +0000	[diff] [blame]	2634	X86_mmx Type
				2635	""""""""""""
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2636
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2637	:Overview:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2638
Reid Kleckner	9a16d08	2014-03-05 02:41:37 +0000	[diff] [blame]	2639	The x86_mmx type represents a value held in an MMX register on an x86
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2640	machine. The operations allowed on it are quite limited: parameters and
				2641	return values, load and store, and bitcast. User-specified MMX
				2642	instructions are represented as intrinsic or asm calls with arguments
				2643	and/or results of this type. There are no arrays, vectors or constants
				2644	of this type.
				2645
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2646	:Syntax:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2647
				2648	::
				2649
Reid Kleckner	9a16d08	2014-03-05 02:41:37 +0000	[diff] [blame]	2650	x86_mmx
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2651
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2652
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2653	.. _t_pointer:
				2654
				2655	Pointer Type
				2656	""""""""""""
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2657
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2658	:Overview:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2659
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2660	The pointer type is used to specify memory locations. Pointers are
				2661	commonly used to reference objects in memory.
				2662
				2663	Pointer types may have an optional address space attribute defining the
				2664	numbered address space where the pointed-to object resides. The default
				2665	address space is number zero. The semantics of non-zero address spaces
				2666	are target-specific.
				2667
				2668	Note that LLVM does not permit pointers to void (``void*``) nor does it
				2669	permit pointers to labels (``label``). Use ``i8`` instead.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2670
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2671	:Syntax:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2672
				2673	::
				2674
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2675	<type> *
				2676
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2677	:Examples:
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2678
				2679	+-------------------------+--------------------------------------------------------------------------------------------------------------+
				2680	\| ``[4 x i32]*`` \| A :ref:`pointer <t_pointer>` to :ref:`array <t_array>` of four ``i32`` values. \|
				2681	+-------------------------+--------------------------------------------------------------------------------------------------------------+
				2682	\| ``i32 (i32) `` \| A :ref:`pointer <t_pointer>` to a :ref:`function <t_function>` that takes an ``i32*``, returning an ``i32``. \|
				2683	+-------------------------+--------------------------------------------------------------------------------------------------------------+
				2684	\| ``i32 addrspace(5)*`` \| A :ref:`pointer <t_pointer>` to an ``i32`` value that resides in address space #5. \|
				2685	+-------------------------+--------------------------------------------------------------------------------------------------------------+
				2686
				2687	.. _t_vector:
				2688
				2689	Vector Type
				2690	"""""""""""
				2691
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2692	:Overview:
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2693
				2694	A vector type is a simple derived type that represents a vector of
				2695	elements. Vector types are used when multiple primitive data are
				2696	operated in parallel using a single instruction (SIMD). A vector type
				2697	requires a size (number of elements) and an underlying primitive data
				2698	type. Vector types are considered :ref:`first class <t_firstclass>`.
				2699
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2700	:Syntax:
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2701
				2702	::
				2703
				2704	< <# elements> x <elementtype> >
				2705
				2706	The number of elements is a constant integer value larger than 0;
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2707	elementtype may be any integer, floating-point or pointer type. Vectors
Manuel Jacob	961f787	2014-07-30 12:30:06 +0000	[diff] [blame]	2708	of size zero are not allowed.
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2709
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2710	:Examples:
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2711
				2712	+-------------------+--------------------------------------------------+
				2713	\| ``<4 x i32>`` \| Vector of 4 32-bit integer values. \|
				2714	+-------------------+--------------------------------------------------+
				2715	\| ``<8 x float>`` \| Vector of 8 32-bit floating-point values. \|
				2716	+-------------------+--------------------------------------------------+
				2717	\| ``<2 x i64>`` \| Vector of 2 64-bit integer values. \|
				2718	+-------------------+--------------------------------------------------+
				2719	\| ``<4 x i64*>`` \| Vector of 4 pointers to 64-bit integer values. \|
				2720	+-------------------+--------------------------------------------------+
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2721
				2722	.. _t_label:
				2723
				2724	Label Type
				2725	^^^^^^^^^^
				2726
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2727	:Overview:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2728
				2729	The label type represents code labels.
				2730
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2731	:Syntax:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2732
				2733	::
				2734
				2735	label
				2736
David Majnemer	b611e3f	2015-08-14 05:09:07 +0000	[diff] [blame]	2737	.. _t_token:
				2738
				2739	Token Type
				2740	^^^^^^^^^^
				2741
				2742	:Overview:
				2743
				2744	The token type is used when a value is associated with an instruction
				2745	but all uses of the value must not attempt to introspect or obscure it.
				2746	As such, it is not appropriate to have a :ref:`phi <i_phi>` or
				2747	:ref:`select <i_select>` of type token.
				2748
				2749	:Syntax:
				2750
				2751	::
				2752
				2753	token
				2754
				2755
				2756
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2757	.. _t_metadata:
				2758
				2759	Metadata Type
				2760	^^^^^^^^^^^^^
				2761
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2762	:Overview:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2763
				2764	The metadata type represents embedded metadata. No derived types may be
				2765	created from metadata except for :ref:`function <t_function>` arguments.
				2766
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2767	:Syntax:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2768
				2769	::
				2770
				2771	metadata
				2772
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2773	.. _t_aggregate:
				2774
				2775	Aggregate Types
				2776	^^^^^^^^^^^^^^^
				2777
				2778	Aggregate Types are a subset of derived types that can contain multiple
				2779	member types. :ref:`Arrays <t_array>` and :ref:`structs <t_struct>` are
				2780	aggregate types. :ref:`Vectors <t_vector>` are not considered to be
				2781	aggregate types.
				2782
				2783	.. _t_array:
				2784
				2785	Array Type
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2786	""""""""""
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2787
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2788	:Overview:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2789
				2790	The array type is a very simple derived type that arranges elements
				2791	sequentially in memory. The array type requires a size (number of
				2792	elements) and an underlying data type.
				2793
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2794	:Syntax:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2795
				2796	::
				2797
				2798	[<# elements> x <elementtype>]
				2799
				2800	The number of elements is a constant integer value; ``elementtype`` may
				2801	be any type with a size.
				2802
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2803	:Examples:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2804
				2805	+------------------+--------------------------------------+
				2806	\| ``[40 x i32]`` \| Array of 40 32-bit integer values. \|
				2807	+------------------+--------------------------------------+
				2808	\| ``[41 x i32]`` \| Array of 41 32-bit integer values. \|
				2809	+------------------+--------------------------------------+
				2810	\| ``[4 x i8]`` \| Array of 4 8-bit integer values. \|
				2811	+------------------+--------------------------------------+
				2812
				2813	Here are some examples of multidimensional arrays:
				2814
				2815	+-----------------------------+----------------------------------------------------------+
				2816	\| ``[3 x [4 x i32]]`` \| 3x4 array of 32-bit integer values. \|
				2817	+-----------------------------+----------------------------------------------------------+
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2818	\| ``[12 x [10 x float]]`` \| 12x10 array of single precision floating-point values. \|
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2819	+-----------------------------+----------------------------------------------------------+
				2820	\| ``[2 x [3 x [4 x i16]]]`` \| 2x3x4 array of 16-bit integer values. \|
				2821	+-----------------------------+----------------------------------------------------------+
				2822
				2823	There is no restriction on indexing beyond the end of the array implied
				2824	by a static type (though there are restrictions on indexing beyond the
				2825	bounds of an allocated object in some cases). This means that
				2826	single-dimension 'variable sized array' addressing can be implemented in
				2827	LLVM with a zero length array type. An implementation of 'pascal style
				2828	arrays' in LLVM could use the type "``{ i32, [0 x float]}``", for
				2829	example.
				2830
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2831	.. _t_struct:
				2832
				2833	Structure Type
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2834	""""""""""""""
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2835
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2836	:Overview:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2837
				2838	The structure type is used to represent a collection of data members
				2839	together in memory. The elements of a structure may be any type that has
				2840	a size.
				2841
				2842	Structures in memory are accessed using '``load``' and '``store``' by
				2843	getting a pointer to a field with the '``getelementptr``' instruction.
				2844	Structures in registers are accessed using the '``extractvalue``' and
				2845	'``insertvalue``' instructions.
				2846
				2847	Structures may optionally be "packed" structures, which indicate that
				2848	the alignment of the struct is one byte, and that there is no padding
				2849	between the elements. In non-packed structs, padding between field types
				2850	is inserted as defined by the DataLayout string in the module, which is
				2851	required to match what the underlying code generator expects.
				2852
				2853	Structures can either be "literal" or "identified". A literal structure
				2854	is defined inline with other types (e.g. ``{i32, i32}*``) whereas
				2855	identified types are always defined at the top level with a name.
				2856	Literal types are uniqued by their contents and can never be recursive
				2857	or opaque since there is no way to write one. Identified types can be
				2858	recursive, can be opaqued, and are never uniqued.
				2859
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2860	:Syntax:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2861
				2862	::
				2863
				2864	%T1 = type { <type list> } ; Identified normal struct type
				2865	%T2 = type <{ <type list> }> ; Identified packed struct type
				2866
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2867	:Examples:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2868
				2869	+------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
				2870	\| ``{ i32, i32, i32 }`` \| A triple of three ``i32`` values \|
				2871	+------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
Daniel Dunbar	1dc66ca	2013-01-17 18:57:32 +0000	[diff] [blame]	2872	\| ``{ float, i32 (i32) * }`` \| A pair, where the first element is a ``float`` and the second element is a :ref:`pointer <t_pointer>` to a :ref:`function <t_function>` that takes an ``i32``, returning an ``i32``. \|
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2873	+------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
				2874	\| ``<{ i8, i32 }>`` \| A packed struct known to be 5 bytes in size. \|
				2875	+------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
				2876
				2877	.. _t_opaque:
				2878
				2879	Opaque Structure Types
Rafael Espindola	0801334	2013-12-07 19:34:20 +0000	[diff] [blame]	2880	""""""""""""""""""""""
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2881
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2882	:Overview:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2883
				2884	Opaque structure types are used to represent named structure types that
				2885	do not have a body specified. This corresponds (for example) to the C
				2886	notion of a forward declared structure.
				2887
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2888	:Syntax:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2889
				2890	::
				2891
				2892	%X = type opaque
				2893	%52 = type opaque
				2894
Rafael Espindola	2f6d7b9	2013-12-10 14:53:22 +0000	[diff] [blame]	2895	:Examples:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2896
				2897	+--------------+-------------------+
				2898	\| ``opaque`` \| An opaque type. \|
				2899	+--------------+-------------------+
				2900
Sean Silva	1703e70	2014-04-08 21:06:22 +0000	[diff] [blame]	2901	.. _constants:
				2902
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2903	Constants
				2904	=========
				2905
				2906	LLVM has several different basic types of constants. This section
				2907	describes them all and their syntax.
				2908
				2909	Simple Constants
				2910	----------------
				2911
				2912	Boolean constants
				2913	The two strings '``true``' and '``false``' are both valid constants
				2914	of the ``i1`` type.
				2915	Integer constants
				2916	Standard integers (such as '4') are constants of the
				2917	:ref:`integer <t_integer>` type. Negative numbers may be used with
				2918	integer types.
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2919	Floating-point constants
				2920	Floating-point constants use standard decimal notation (e.g.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2921	123.421), exponential notation (e.g. 1.23421e+2), or a more precise
				2922	hexadecimal notation (see below). The assembler requires the exact
				2923	decimal value of a floating-point constant. For example, the
				2924	assembler accepts 1.25 but rejects 1.3 because 1.3 is a repeating
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2925	decimal in binary. Floating-point constants must have a
				2926	:ref:`floating-point <t_floating>` type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2927	Null pointer constants
				2928	The identifier '``null``' is recognized as a null pointer constant
				2929	and must be of :ref:`pointer type <t_pointer>`.
David Majnemer	f0f224d	2015-11-11 21:57:16 +0000	[diff] [blame]	2930	Token constants
				2931	The identifier '``none``' is recognized as an empty token constant
				2932	and must be of :ref:`token type <t_token>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2933
				2934	The one non-intuitive notation for constants is the hexadecimal form of
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2935	floating-point constants. For example, the form
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2936	'``double 0x432ff973cafa8000``' is equivalent to (but harder to read
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2937	than) '``double 4.5e+15``'. The only time hexadecimal floating-point
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2938	constants are required (and the only time that they are generated by the
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	2939	disassembler) is when a floating-point constant must be emitted but it
				2940	cannot be represented as a decimal floating-point number in a reasonable
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2941	number of digits. For example, NaN's, infinities, and other special
				2942	values are represented in their IEEE hexadecimal format so that assembly
				2943	and disassembly do not cause any bits to change in the constants.
				2944
				2945	When using the hexadecimal form, constants of types half, float, and
				2946	double are represented using the 16-digit form shown above (which
				2947	matches the IEEE754 representation for double); half and float values
Dmitri Gribenko	4dc2ba1	2013-01-16 23:40:37 +0000	[diff] [blame]	2948	must, however, be exactly representable as IEEE 754 half and single
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2949	precision, respectively. Hexadecimal format is always used for long
				2950	double, and there are three forms of long double. The 80-bit format used
				2951	by x86 is represented as ``0xK`` followed by 20 hexadecimal digits. The
				2952	128-bit format used by PowerPC (two adjacent doubles) is represented by
				2953	``0xM`` followed by 32 hexadecimal digits. The IEEE 128-bit format is
Richard Sandiford	ae426b4	2013-05-03 14:32:27 +0000	[diff] [blame]	2954	represented by ``0xL`` followed by 32 hexadecimal digits. Long doubles
				2955	will only work if they match the long double format on your target.
				2956	The IEEE 16-bit format (half precision) is represented by ``0xH``
				2957	followed by 4 hexadecimal digits. All hexadecimal formats are big-endian
				2958	(sign bit at the left).
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2959
Reid Kleckner	9a16d08	2014-03-05 02:41:37 +0000	[diff] [blame]	2960	There are no constants of type x86_mmx.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2961
Eli Bendersky	0220e6b	2013-06-07 20:24:43 +0000	[diff] [blame]	2962	.. _complexconstants:
				2963
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2964	Complex Constants
				2965	-----------------
				2966
				2967	Complex constants are a (potentially recursive) combination of simple
				2968	constants and smaller complex constants.
				2969
				2970	Structure constants
				2971	Structure constants are represented with notation similar to
				2972	structure type definitions (a comma separated list of elements,
				2973	surrounded by braces (``{}``)). For example:
				2974	"``{ i32 4, float 17.0, i32* @G }``", where "``@G``" is declared as
				2975	"``@G = external global i32``". Structure constants must have
				2976	:ref:`structure type <t_struct>`, and the number and types of elements
				2977	must match those specified by the type.
				2978	Array constants
				2979	Array constants are represented with notation similar to array type
				2980	definitions (a comma separated list of elements, surrounded by
				2981	square brackets (``[]``)). For example:
				2982	"``[ i32 42, i32 11, i32 74 ]``". Array constants must have
				2983	:ref:`array type <t_array>`, and the number and types of elements must
Daniel Sanders	f605184	2014-09-11 12:02:59 +0000	[diff] [blame]	2984	match those specified by the type. As a special case, character array
				2985	constants may also be represented as a double-quoted string using the ``c``
				2986	prefix. For example: "``c"Hello World\0A\00"``".
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	2987	Vector constants
				2988	Vector constants are represented with notation similar to vector
				2989	type definitions (a comma separated list of elements, surrounded by
				2990	less-than/greater-than's (``<>``)). For example:
				2991	"``< i32 42, i32 11, i32 74, i32 100 >``". Vector constants
				2992	must have :ref:`vector type <t_vector>`, and the number and types of
				2993	elements must match those specified by the type.
				2994	Zero initialization
				2995	The string '``zeroinitializer``' can be used to zero initialize a
				2996	value to zero of any type, including scalar and
				2997	:ref:`aggregate <t_aggregate>` types. This is often used to avoid
				2998	having to print large zero initializers (e.g. for large arrays) and
				2999	is always exactly equivalent to using explicit zero initializers.
				3000	Metadata node
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	3001	A metadata node is a constant tuple without types. For example:
				3002	"``!{!0, !{!2, !0}, !"test"}``". Metadata can reference constant values,
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	3003	for example: "``!{!0, i32 0, i8* @global, i64 (i64)* @function, !"str"}``".
				3004	Unlike other typed constants that are meant to be interpreted as part of
				3005	the instruction stream, metadata is a place to attach additional
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3006	information such as debug info.
				3007
				3008	Global Variable and Function Addresses
				3009	--------------------------------------
				3010
				3011	The addresses of :ref:`global variables <globalvars>` and
				3012	:ref:`functions <functionstructure>` are always implicitly valid
				3013	(link-time) constants. These constants are explicitly referenced when
				3014	the :ref:`identifier for the global <identifiers>` is used and always have
				3015	:ref:`pointer <t_pointer>` type. For example, the following is a legal LLVM
				3016	file:
				3017
				3018	.. code-block:: llvm
				3019
				3020	@X = global i32 17
				3021	@Y = global i32 42
				3022	@Z = global [2 x i32] [ i32 @X, i32* @Y ]
				3023
				3024	.. _undefvalues:
				3025
				3026	Undefined Values
				3027	----------------
				3028
				3029	The string '``undef``' can be used anywhere a constant is expected, and
				3030	indicates that the user of the value may receive an unspecified
				3031	bit-pattern. Undefined values may be of any type (other than '``label``'
				3032	or '``void``') and be used anywhere a constant is permitted.
				3033
				3034	Undefined values are useful because they indicate to the compiler that
				3035	the program is well defined no matter what value is used. This gives the
				3036	compiler more freedom to optimize. Here are some examples of
				3037	(potentially surprising) transformations that are valid (in pseudo IR):
				3038
				3039	.. code-block:: llvm
				3040
				3041	%A = add %X, undef
				3042	%B = sub %X, undef
				3043	%C = xor %X, undef
				3044	Safe:
				3045	%A = undef
				3046	%B = undef
				3047	%C = undef
				3048
				3049	This is safe because all of the output bits are affected by the undef
				3050	bits. Any output bit can have a zero or one depending on the input bits.
				3051
				3052	.. code-block:: llvm
				3053
				3054	%A = or %X, undef
				3055	%B = and %X, undef
				3056	Safe:
				3057	%A = -1
				3058	%B = 0
Sanjoy Das	151493a	2016-09-15 01:56:58 +0000	[diff] [blame]	3059	Safe:
				3060	%A = %X ;; By choosing undef as 0
				3061	%B = %X ;; By choosing undef as -1
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3062	Unsafe:
				3063	%A = undef
				3064	%B = undef
				3065
				3066	These logical operations have bits that are not always affected by the
				3067	input. For example, if ``%X`` has a zero bit, then the output of the
				3068	'``and``' operation will always be a zero for that bit, no matter what
				3069	the corresponding bit from the '``undef``' is. As such, it is unsafe to
				3070	optimize or assume that the result of the '``and``' is '``undef``'.
				3071	However, it is safe to assume that all bits of the '``undef``' could be
				3072	0, and optimize the '``and``' to 0. Likewise, it is safe to assume that
				3073	all the bits of the '``undef``' operand to the '``or``' could be set,
				3074	allowing the '``or``' to be folded to -1.
				3075
				3076	.. code-block:: llvm
				3077
				3078	%A = select undef, %X, %Y
				3079	%B = select undef, 42, %Y
				3080	%C = select %X, %Y, undef
				3081	Safe:
				3082	%A = %X (or %Y)
				3083	%B = 42 (or %Y)
				3084	%C = %Y
				3085	Unsafe:
				3086	%A = undef
				3087	%B = undef
				3088	%C = undef
				3089
				3090	This set of examples shows that undefined '``select``' (and conditional
				3091	branch) conditions can go either way, but they have to come from one
				3092	of the two operands. In the ``%A`` example, if ``%X`` and ``%Y`` were
				3093	both known to have a clear low bit, then ``%A`` would have to have a
				3094	cleared low bit. However, in the ``%C`` example, the optimizer is
				3095	allowed to assume that the '``undef``' operand could be the same as
				3096	``%Y``, allowing the whole '``select``' to be eliminated.
				3097
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	3098	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3099
				3100	%A = xor undef, undef
				3101
				3102	%B = undef
				3103	%C = xor %B, %B
				3104
				3105	%D = undef
Jonathan Roelofs	ec81c0b	2014-10-16 19:28:10 +0000	[diff] [blame]	3106	%E = icmp slt %D, 4
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3107	%F = icmp gte %D, 4
				3108
				3109	Safe:
				3110	%A = undef
				3111	%B = undef
				3112	%C = undef
				3113	%D = undef
				3114	%E = undef
				3115	%F = undef
				3116
				3117	This example points out that two '``undef``' operands are not
				3118	necessarily the same. This can be surprising to people (and also matches
				3119	C semantics) where they assume that "``X^X``" is always zero, even if
				3120	``X`` is undefined. This isn't true for a number of reasons, but the
				3121	short answer is that an '``undef``' "variable" can arbitrarily change
				3122	its value over its "live range". This is true because the variable
				3123	doesn't actually have a live range. Instead, the value is logically
				3124	read from arbitrary registers that happen to be around when needed, so
				3125	the value is not necessarily consistent over time. In fact, ``%A`` and
				3126	``%C`` need to have the same semantics or the core LLVM "replace all
				3127	uses with" concept would not hold.
				3128
				3129	.. code-block:: llvm
				3130
Sanjay Patel	3aaf6a0	2018-03-09 15:27:48 +0000	[diff] [blame]	3131	%A = sdiv undef, %X
				3132	%B = sdiv %X, undef
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3133	Safe:
Sanjay Patel	3aaf6a0	2018-03-09 15:27:48 +0000	[diff] [blame]	3134	%A = 0
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3135	b: unreachable
				3136
				3137	These examples show the crucial difference between an undefined value
				3138	and undefined behavior. An undefined value (like '``undef``') is
				3139	allowed to have an arbitrary bit-pattern. This means that the ``%A``
Sanjay Patel	3aaf6a0	2018-03-09 15:27:48 +0000	[diff] [blame]	3140	operation can be constant folded to '``0``', because the '``undef``'
				3141	could be zero, and zero divided by any value is zero.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3142	However, in the second example, we can make a more aggressive
				3143	assumption: because the ``undef`` is allowed to be an arbitrary value,
				3144	we are allowed to assume that it could be zero. Since a divide by zero
				3145	has undefined behavior, we are allowed to assume that the operation
				3146	does not execute at all. This allows us to delete the divide and all
				3147	code after it. Because the undefined operation "can't happen", the
				3148	optimizer can assume that it occurs in dead code.
				3149
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	3150	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3151
				3152	a: store undef -> %X
				3153	b: store %X -> undef
				3154	Safe:
				3155	a: <deleted>
				3156	b: unreachable
				3157
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	3158	A store of an undefined value can be assumed to not have any effect;
				3159	we can assume that the value is overwritten with bits that happen to
				3160	match what was already there. However, a store to an undefined
				3161	location could clobber arbitrary memory, therefore, it has undefined
				3162	behavior.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3163
				3164	.. _poisonvalues:
				3165
				3166	Poison Values
				3167	-------------
				3168
				3169	Poison values are similar to :ref:`undef values <undefvalues>`, however
				3170	they also represent the fact that an instruction or constant expression
Richard Smith	32dbdf6	2014-07-31 04:25:36 +0000	[diff] [blame]	3171	that cannot evoke side effects has nevertheless detected a condition
				3172	that results in undefined behavior.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3173
				3174	There is currently no way of representing a poison value in the IR; they
				3175	only exist when produced by operations such as :ref:`add <i_add>` with
				3176	the ``nsw`` flag.
				3177
				3178	Poison value behavior is defined in terms of value dependence:
				3179
				3180	- Values other than :ref:`phi <i_phi>` nodes depend on their operands.
				3181	- :ref:`Phi <i_phi>` nodes depend on the operand corresponding to
				3182	their dynamic predecessor basic block.
				3183	- Function arguments depend on the corresponding actual argument values
				3184	in the dynamic callers of their functions.
				3185	- :ref:`Call <i_call>` instructions depend on the :ref:`ret <i_ret>`
				3186	instructions that dynamically transfer control back to them.
				3187	- :ref:`Invoke <i_invoke>` instructions depend on the
				3188	:ref:`ret <i_ret>`, :ref:`resume <i_resume>`, or exception-throwing
				3189	call instructions that dynamically transfer control back to them.
				3190	- Non-volatile loads and stores depend on the most recent stores to all
				3191	of the referenced memory addresses, following the order in the IR
				3192	(including loads and stores implied by intrinsics such as
				3193	:ref:`@llvm.memcpy <int_memcpy>`.)
				3194	- An instruction with externally visible side effects depends on the
				3195	most recent preceding instruction with externally visible side
				3196	effects, following the order in the IR. (This includes :ref:`volatile
				3197	operations <volatile>`.)
				3198	- An instruction control-depends on a :ref:`terminator
				3199	instruction <terminators>` if the terminator instruction has
				3200	multiple successors and the instruction is always executed when
				3201	control transfers to one of the successors, and may not be executed
				3202	when control is transferred to another.
				3203	- Additionally, an instruction also control-depends on a terminator
				3204	instruction if the set of instructions it otherwise depends on would
				3205	be different if the terminator had transferred control to a different
				3206	successor.
				3207	- Dependence is transitive.
				3208
Richard Smith	32dbdf6	2014-07-31 04:25:36 +0000	[diff] [blame]	3209	Poison values have the same behavior as :ref:`undef values <undefvalues>`,
				3210	with the additional effect that any instruction that has a dependence
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3211	on a poison value has undefined behavior.
				3212
				3213	Here are some examples:
				3214
				3215	.. code-block:: llvm
				3216
				3217	entry:
				3218	%poison = sub nuw i32 0, 1 ; Results in a poison value.
				3219	%still_poison = and i32 %poison, 0 ; 0, but also poison.
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	3220	%poison_yet_again = getelementptr i32, i32* @h, i32 %still_poison
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3221	store i32 0, i32* %poison_yet_again ; memory at @h[0] is poisoned
				3222
				3223	store i32 %poison, i32* @g ; Poison value stored to memory.
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	3224	%poison2 = load i32, i32* @g ; Poison value loaded back from memory.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3225
				3226	store volatile i32 %poison, i32* @g ; External observation; undefined behavior.
				3227
				3228	%narrowaddr = bitcast i32* @g to i16*
				3229	%wideaddr = bitcast i32* @g to i64*
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	3230	%poison3 = load i16, i16* %narrowaddr ; Returns a poison value.
				3231	%poison4 = load i64, i64* %wideaddr ; Returns a poison value.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3232
				3233	%cmp = icmp slt i32 %poison, 0 ; Returns a poison value.
				3234	br i1 %cmp, label %true, label %end ; Branch to either destination.
				3235
				3236	true:
				3237	store volatile i32 0, i32* @g ; This is control-dependent on %cmp, so
				3238	; it has undefined behavior.
				3239	br label %end
				3240
				3241	end:
				3242	%p = phi i32 [ 0, %entry ], [ 1, %true ]
				3243	; Both edges into this PHI are
				3244	; control-dependent on %cmp, so this
				3245	; always results in a poison value.
				3246
				3247	store volatile i32 0, i32* @g ; This would depend on the store in %true
				3248	; if %cmp is true, or the store in %entry
				3249	; otherwise, so this is undefined behavior.
				3250
				3251	br i1 %cmp, label %second_true, label %second_end
				3252	; The same branch again, but this time the
				3253	; true block doesn't have side effects.
				3254
				3255	second_true:
				3256	; No side effects!
				3257	ret void
				3258
				3259	second_end:
				3260	store volatile i32 0, i32* @g ; This time, the instruction always depends
				3261	; on the store in %end. Also, it is
				3262	; control-equivalent to %end, so this is
				3263	; well-defined (ignoring earlier undefined
				3264	; behavior in this example).
				3265
				3266	.. _blockaddress:
				3267
				3268	Addresses of Basic Blocks
				3269	-------------------------
				3270
				3271	``blockaddress(@function, %block)``
				3272
				3273	The '``blockaddress``' constant computes the address of the specified
				3274	basic block in the specified function, and always has an ``i8*`` type.
				3275	Taking the address of the entry block is illegal.
				3276
				3277	This value only has defined behavior when used as an operand to the
				3278	':ref:`indirectbr <i_indirectbr>`' instruction, or for comparisons
				3279	against null. Pointer equality tests between labels addresses results in
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	3280	undefined behavior --- though, again, comparison against null is ok, and
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3281	no label is equal to the null pointer. This may be passed around as an
				3282	opaque pointer sized value as long as the bits are not inspected. This
				3283	allows ``ptrtoint`` and arithmetic to be performed on these values so
				3284	long as the original value is reconstituted before the ``indirectbr``
				3285	instruction.
				3286
				3287	Finally, some targets may provide defined semantics when using the value
				3288	as the operand to an inline assembly, but that is target specific.
				3289
Eli Bendersky	0220e6b	2013-06-07 20:24:43 +0000	[diff] [blame]	3290	.. _constantexprs:
				3291
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3292	Constant Expressions
				3293	--------------------
				3294
				3295	Constant expressions are used to allow expressions involving other
				3296	constants to be used as constants. Constant expressions may be of any
				3297	:ref:`first class <t_firstclass>` type and may involve any LLVM operation
				3298	that does not have side effects (e.g. load and call are not supported).
				3299	The following is the syntax for constant expressions:
				3300
				3301	``trunc (CST to TYPE)``
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	3302	Perform the :ref:`trunc operation <i_trunc>` on constants.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3303	``zext (CST to TYPE)``
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	3304	Perform the :ref:`zext operation <i_zext>` on constants.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3305	``sext (CST to TYPE)``
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	3306	Perform the :ref:`sext operation <i_sext>` on constants.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3307	``fptrunc (CST to TYPE)``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3308	Truncate a floating-point constant to another floating-point type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3309	The size of CST must be larger than the size of TYPE. Both types
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3310	must be floating-point.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3311	``fpext (CST to TYPE)``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3312	Floating-point extend a constant to another type. The size of CST
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3313	must be smaller or equal to the size of TYPE. Both types must be
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3314	floating-point.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3315	``fptoui (CST to TYPE)``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3316	Convert a floating-point constant to the corresponding unsigned
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3317	integer constant. TYPE must be a scalar or vector integer type. CST
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3318	must be of scalar or vector floating-point type. Both CST and TYPE
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3319	must be scalars, or vectors of the same number of elements. If the
Eli Friedman	c065bb2	2018-06-08 21:33:33 +0000	[diff] [blame]	3320	value won't fit in the integer type, the result is a
				3321	:ref:`poison value <poisonvalues>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3322	``fptosi (CST to TYPE)``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3323	Convert a floating-point constant to the corresponding signed
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3324	integer constant. TYPE must be a scalar or vector integer type. CST
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3325	must be of scalar or vector floating-point type. Both CST and TYPE
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3326	must be scalars, or vectors of the same number of elements. If the
Eli Friedman	c065bb2	2018-06-08 21:33:33 +0000	[diff] [blame]	3327	value won't fit in the integer type, the result is a
				3328	:ref:`poison value <poisonvalues>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3329	``uitofp (CST to TYPE)``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3330	Convert an unsigned integer constant to the corresponding
				3331	floating-point constant. TYPE must be a scalar or vector floating-point
				3332	type. CST must be of scalar or vector integer type. Both CST and TYPE must
Eli Friedman	3f1ce09	2018-06-14 22:58:48 +0000	[diff] [blame]	3333	be scalars, or vectors of the same number of elements.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3334	``sitofp (CST to TYPE)``
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3335	Convert a signed integer constant to the corresponding floating-point
				3336	constant. TYPE must be a scalar or vector floating-point type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3337	CST must be of scalar or vector integer type. Both CST and TYPE must
Eli Friedman	3f1ce09	2018-06-14 22:58:48 +0000	[diff] [blame]	3338	be scalars, or vectors of the same number of elements.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3339	``ptrtoint (CST to TYPE)``
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	3340	Perform the :ref:`ptrtoint operation <i_ptrtoint>` on constants.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3341	``inttoptr (CST to TYPE)``
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	3342	Perform the :ref:`inttoptr operation <i_inttoptr>` on constants.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3343	This one is really dangerous!
				3344	``bitcast (CST to TYPE)``
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	3345	Convert a constant, CST, to another TYPE.
				3346	The constraints of the operands are the same as those for the
				3347	:ref:`bitcast instruction <i_bitcast>`.
Matt Arsenault	b03bd4d	2013-11-15 01:34:59 +0000	[diff] [blame]	3348	``addrspacecast (CST to TYPE)``
				3349	Convert a constant pointer or constant vector of pointer, CST, to another
				3350	TYPE in a different address space. The constraints of the operands are the
				3351	same as those for the :ref:`addrspacecast instruction <i_addrspacecast>`.
David Blaikie	f72d05b	2015-03-13 18:20:45 +0000	[diff] [blame]	3352	``getelementptr (TY, CSTPTR, IDX0, IDX1, ...)``, ``getelementptr inbounds (TY, CSTPTR, IDX0, IDX1, ...)``
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3353	Perform the :ref:`getelementptr operation <i_getelementptr>` on
				3354	constants. As with the :ref:`getelementptr <i_getelementptr>`
David Blaikie	f91b030	2017-06-19 05:34:21 +0000	[diff] [blame]	3355	instruction, the index list may have one or more indexes, which are
David Blaikie	f72d05b	2015-03-13 18:20:45 +0000	[diff] [blame]	3356	required to make sense for the type of "pointer to TY".
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3357	``select (COND, VAL1, VAL2)``
				3358	Perform the :ref:`select operation <i_select>` on constants.
				3359	``icmp COND (VAL1, VAL2)``
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	3360	Perform the :ref:`icmp operation <i_icmp>` on constants.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3361	``fcmp COND (VAL1, VAL2)``
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	3362	Perform the :ref:`fcmp operation <i_fcmp>` on constants.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3363	``extractelement (VAL, IDX)``
				3364	Perform the :ref:`extractelement operation <i_extractelement>` on
				3365	constants.
				3366	``insertelement (VAL, ELT, IDX)``
				3367	Perform the :ref:`insertelement operation <i_insertelement>` on
				3368	constants.
				3369	``shufflevector (VEC1, VEC2, IDXMASK)``
				3370	Perform the :ref:`shufflevector operation <i_shufflevector>` on
				3371	constants.
				3372	``extractvalue (VAL, IDX0, IDX1, ...)``
				3373	Perform the :ref:`extractvalue operation <i_extractvalue>` on
				3374	constants. The index list is interpreted in a similar manner as
				3375	indices in a ':ref:`getelementptr <i_getelementptr>`' operation. At
				3376	least one index value must be specified.
				3377	``insertvalue (VAL, ELT, IDX0, IDX1, ...)``
				3378	Perform the :ref:`insertvalue operation <i_insertvalue>` on constants.
				3379	The index list is interpreted in a similar manner as indices in a
				3380	':ref:`getelementptr <i_getelementptr>`' operation. At least one index
				3381	value must be specified.
				3382	``OPCODE (LHS, RHS)``
				3383	Perform the specified operation of the LHS and RHS constants. OPCODE
				3384	may be any of the :ref:`binary <binaryops>` or :ref:`bitwise
				3385	binary <bitwiseops>` operations. The constraints on operands are
				3386	the same as those for the corresponding instruction (e.g. no bitwise
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3387	operations on floating-point values are allowed).
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3388
				3389	Other Values
				3390	============
				3391
Eli Bendersky	0220e6b	2013-06-07 20:24:43 +0000	[diff] [blame]	3392	.. _inlineasmexprs:
				3393
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3394	Inline Assembler Expressions
				3395	----------------------------
				3396
				3397	LLVM supports inline assembler expressions (as opposed to :ref:`Module-Level
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3398	Inline Assembly <moduleasm>`) through the use of a special value. This value
				3399	represents the inline assembler as a template string (containing the
				3400	instructions to emit), a list of operand constraints (stored as a string), a
				3401	flag that indicates whether or not the inline asm expression has side effects,
				3402	and a flag indicating whether the function containing the asm needs to align its
				3403	stack conservatively.
				3404
				3405	The template string supports argument substitution of the operands using "``$``"
				3406	followed by a number, to indicate substitution of the given register/memory
				3407	location, as specified by the constraint string. "``${NUM:MODIFIER}``" may also
				3408	be used, where ``MODIFIER`` is a target-specific annotation for how to print the
				3409	operand (See :ref:`inline-asm-modifiers`).
				3410
				3411	A literal "``$``" may be included by using "``$$``" in the template. To include
				3412	other special characters into the output, the usual "``\XX``" escapes may be
				3413	used, just as in other strings. Note that after template substitution, the
				3414	resulting assembly string is parsed by LLVM's integrated assembler unless it is
				3415	disabled -- even when emitting a ``.s`` file -- and thus must contain assembly
				3416	syntax known to LLVM.
				3417
Reid Kleckner	71cb164	2017-02-06 18:08:45 +0000	[diff] [blame]	3418	LLVM also supports a few more substitions useful for writing inline assembly:
				3419
				3420	- ``${:uid}``: Expands to a decimal integer unique to this inline assembly blob.
				3421	This substitution is useful when declaring a local label. Many standard
				3422	compiler optimizations, such as inlining, may duplicate an inline asm blob.
				3423	Adding a blob-unique identifier ensures that the two labels will not conflict
				3424	during assembly. This is used to implement `GCC's %= special format
				3425	string <https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html>`_.
				3426	- ``${:comment}``: Expands to the comment character of the current target's
				3427	assembly dialect. This is usually ``#``, but many targets use other strings,
				3428	such as ``;``, ``//``, or ``!``.
				3429	- ``${:private}``: Expands to the assembler private label prefix. Labels with
				3430	this prefix will not appear in the symbol table of the assembled object.
				3431	Typically the prefix is ``L``, but targets may use other strings. ``.L`` is
				3432	relatively popular.
				3433
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3434	LLVM's support for inline asm is modeled closely on the requirements of Clang's
				3435	GCC-compatible inline-asm support. Thus, the feature-set and the constraint and
				3436	modifier codes listed here are similar or identical to those in GCC's inline asm
				3437	support. However, to be clear, the syntax of the template and constraint strings
				3438	described here is not the same as the syntax accepted by GCC and Clang, and,
				3439	while most constraint letters are passed through as-is by Clang, some get
				3440	translated to other codes when converting from the C source to the LLVM
				3441	assembly.
				3442
				3443	An example inline assembler expression is:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	3444
				3445	.. code-block:: llvm
				3446
				3447	i32 (i32) asm "bswap $0", "=r,r"
				3448
				3449	Inline assembler expressions may only be used as the callee operand
				3450	of a :ref:`call <i_call>` or an :ref:`invoke <i_invoke>` instruction.
				3451	Thus, typically we have:
				3452
				3453	.. code-block:: llvm
				3454
				3455	%X = call i32 asm "bswap $0", "=r,r"(i32 %Y)
				3456
				3457	Inline asms with side effects not visible in the constraint list must be
				3458	marked as having side effects. This is done through the use of the
				3459	'``sideeffect``' keyword, like so:
				3460
				3461	.. code-block:: llvm
				3462
				3463	call void asm sideeffect "eieio", ""()
				3464
				3465	In some cases inline asms will contain code that will not work unless
				3466	the stack is aligned in some way, such as calls or SSE instructions on
				3467	x86, yet will not contain code that does that alignment within the asm.
				3468	The compiler should make conservative assumptions about what the asm
				3469	might contain and should generate its usual stack alignment code in the
				3470	prologue if the '``alignstack``' keyword is present:
				3471
				3472	.. code-block:: llvm
				3473
				3474	call void asm alignstack "eieio", ""()
				3475
				3476	Inline asms also support using non-standard assembly dialects. The
				3477	assumed dialect is ATT. When the '``inteldialect``' keyword is present,
				3478	the inline asm is using the Intel dialect. Currently, ATT and Intel are
				3479	the only supported dialects. An example is:
				3480
				3481	.. code-block:: llvm
				3482
				3483	call void asm inteldialect "eieio", ""()
				3484
				3485	If multiple keywords appear the '``sideeffect``' keyword must come
				3486	first, the '``alignstack``' keyword second and the '``inteldialect``'
				3487	keyword last.
				3488
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3489	Inline Asm Constraint String
				3490	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				3491
				3492	The constraint list is a comma-separated string, each element containing one or
				3493	more constraint codes.
				3494
				3495	For each element in the constraint list an appropriate register or memory
				3496	operand will be chosen, and it will be made available to assembly template
				3497	string expansion as ``$0`` for the first constraint in the list, ``$1`` for the
				3498	second, etc.
				3499
				3500	There are three different types of constraints, which are distinguished by a
				3501	prefix symbol in front of the constraint code: Output, Input, and Clobber. The
				3502	constraints must always be given in that order: outputs first, then inputs, then
				3503	clobbers. They cannot be intermingled.
				3504
				3505	There are also three different categories of constraint codes:
				3506
				3507	- Register constraint. This is either a register class, or a fixed physical
				3508	register. This kind of constraint will allocate a register, and if necessary,
				3509	bitcast the argument or result to the appropriate type.
				3510	- Memory constraint. This kind of constraint is for use with an instruction
				3511	taking a memory operand. Different constraints allow for different addressing
				3512	modes used by the target.
				3513	- Immediate value constraint. This kind of constraint is for an integer or other
				3514	immediate value which can be rendered directly into an instruction. The
				3515	various target-specific constraints allow the selection of a value in the
				3516	proper range for the instruction you wish to use it with.
				3517
				3518	Output constraints
				3519	""""""""""""""""""
				3520
				3521	Output constraints are specified by an "``=``" prefix (e.g. "``=r``"). This
				3522	indicates that the assembly will write to this operand, and the operand will
				3523	then be made available as a return value of the ``asm`` expression. Output
				3524	constraints do not consume an argument from the call instruction. (Except, see
				3525	below about indirect outputs).
				3526
				3527	Normally, it is expected that no output locations are written to by the assembly
				3528	expression until all of the inputs have been read. As such, LLVM may assign
				3529	the same register to an output and an input. If this is not safe (e.g. if the
				3530	assembly contains two instructions, where the first writes to one output, and
				3531	the second reads an input and writes to a second output), then the "``&``"
				3532	modifier must be used (e.g. "``=&r``") to specify that the output is an
Sylvestre Ledru	84666a1	2016-02-14 20:16:22 +0000	[diff] [blame]	3533	"early-clobber" output. Marking an output as "early-clobber" ensures that LLVM
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3534	will not use the same register for any inputs (other than an input tied to this
				3535	output).
				3536
				3537	Input constraints
				3538	"""""""""""""""""
				3539
				3540	Input constraints do not have a prefix -- just the constraint codes. Each input
				3541	constraint will consume one argument from the call instruction. It is not
				3542	permitted for the asm to write to any input register or memory location (unless
				3543	that input is tied to an output). Note also that multiple inputs may all be
				3544	assigned to the same register, if LLVM can determine that they necessarily all
				3545	contain the same value.
				3546
				3547	Instead of providing a Constraint Code, input constraints may also "tie"
				3548	themselves to an output constraint, by providing an integer as the constraint
				3549	string. Tied inputs still consume an argument from the call instruction, and
				3550	take up a position in the asm template numbering as is usual -- they will simply
				3551	be constrained to always use the same register as the output they've been tied
				3552	to. For example, a constraint string of "``=r,0``" says to assign a register for
				3553	output, and use that register as an input as well (it being the 0'th
				3554	constraint).
				3555
				3556	It is permitted to tie an input to an "early-clobber" output. In that case, no
				3557	other input may share the same register as the input tied to the early-clobber
				3558	(even when the other input has the same value).
				3559
				3560	You may only tie an input to an output which has a register constraint, not a
				3561	memory constraint. Only a single input may be tied to an output.
				3562
				3563	There is also an "interesting" feature which deserves a bit of explanation: if a
				3564	register class constraint allocates a register which is too small for the value
				3565	type operand provided as input, the input value will be split into multiple
				3566	registers, and all of them passed to the inline asm.
				3567
				3568	However, this feature is often not as useful as you might think.
				3569
				3570	Firstly, the registers are not guaranteed to be consecutive. So, on those
				3571	architectures that have instructions which operate on multiple consecutive
				3572	instructions, this is not an appropriate way to support them. (e.g. the 32-bit
				3573	SparcV8 has a 64-bit load, which instruction takes a single 32-bit register. The
				3574	hardware then loads into both the named register, and the next register. This
				3575	feature of inline asm would not be useful to support that.)
				3576
				3577	A few of the targets provide a template string modifier allowing explicit access
				3578	to the second register of a two-register operand (e.g. MIPS ``L``, ``M``, and
				3579	``D``). On such an architecture, you can actually access the second allocated
				3580	register (yet, still, not any subsequent ones). But, in that case, you're still
				3581	probably better off simply splitting the value into two separate operands, for
				3582	clarity. (e.g. see the description of the ``A`` constraint on X86, which,
				3583	despite existing only for use with this feature, is not really a good idea to
				3584	use)
				3585
				3586	Indirect inputs and outputs
				3587	"""""""""""""""""""""""""""
				3588
				3589	Indirect output or input constraints can be specified by the "``*``" modifier
				3590	(which goes after the "``=``" in case of an output). This indicates that the asm
				3591	will write to or read from the contents of an address provided as an input
				3592	argument. (Note that in this way, indirect outputs act more like an input than
				3593	an output: just like an input, they consume an argument of the call expression,
				3594	rather than producing a return value. An indirect output constraint is an
				3595	"output" only in that the asm is expected to write to the contents of the input
				3596	memory location, instead of just read from it).
				3597
				3598	This is most typically used for memory constraint, e.g. "``=*m``", to pass the
				3599	address of a variable as a value.
				3600
				3601	It is also possible to use an indirect register constraint, but only on output
				3602	(e.g. "``=*r``"). This will cause LLVM to allocate a register for an output
				3603	value normally, and then, separately emit a store to the address provided as
				3604	input, after the provided inline asm. (It's not clear what value this
				3605	functionality provides, compared to writing the store explicitly after the asm
				3606	statement, and it can only produce worse code, since it bypasses many
				3607	optimization passes. I would recommend not using it.)
				3608
				3609
				3610	Clobber constraints
				3611	"""""""""""""""""""
				3612
				3613	A clobber constraint is indicated by a "``~``" prefix. A clobber does not
				3614	consume an input operand, nor generate an output. Clobbers cannot use any of the
				3615	general constraint code letters -- they may use only explicit register
				3616	constraints, e.g. "``~{eax}``". The one exception is that a clobber string of
				3617	"``~{memory}``" indicates that the assembly writes to arbitrary undeclared
				3618	memory locations -- not only the memory pointed to by a declared indirect
				3619	output.
				3620
Peter Zotov	0025723	2016-08-30 10:48:31 +0000	[diff] [blame]	3621	Note that clobbering named registers that are also present in output
				3622	constraints is not legal.
				3623
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3624
				3625	Constraint Codes
				3626	""""""""""""""""
				3627	After a potential prefix comes constraint code, or codes.
				3628
				3629	A Constraint Code is either a single letter (e.g. "``r``"), a "``^``" character
				3630	followed by two letters (e.g. "``^wc``"), or "``{``" register-name "``}``"
				3631	(e.g. "``{eax}``").
				3632
				3633	The one and two letter constraint codes are typically chosen to be the same as
				3634	GCC's constraint codes.
				3635
				3636	A single constraint may include one or more than constraint code in it, leaving
				3637	it up to LLVM to choose which one to use. This is included mainly for
				3638	compatibility with the translation of GCC inline asm coming from clang.
				3639
				3640	There are two ways to specify alternatives, and either or both may be used in an
				3641	inline asm constraint list:
				3642
				3643	1) Append the codes to each other, making a constraint code set. E.g. "``im``"
				3644	or "``{eax}m``". This means "choose any of the options in the set". The
				3645	choice of constraint is made independently for each constraint in the
				3646	constraint list.
				3647
				3648	2) Use "``\|``" between constraint code sets, creating alternatives. Every
				3649	constraint in the constraint list must have the same number of alternative
				3650	sets. With this syntax, the same alternative in all of the items in the
				3651	constraint list will be chosen together.
				3652
				3653	Putting those together, you might have a two operand constraint string like
				3654	``"rm\|r,ri\|rm"``. This indicates that if operand 0 is ``r`` or ``m``, then
				3655	operand 1 may be one of ``r`` or ``i``. If operand 0 is ``r``, then operand 1
				3656	may be one of ``r`` or ``m``. But, operand 0 and 1 cannot both be of type m.
				3657
				3658	However, the use of either of the alternatives features is NOT recommended, as
				3659	LLVM is not able to make an intelligent choice about which one to use. (At the
				3660	point it currently needs to choose, not enough information is available to do so
				3661	in a smart way.) Thus, it simply tries to make a choice that's most likely to
				3662	compile, not one that will be optimal performance. (e.g., given "``rm``", it'll
				3663	always choose to use memory, not registers). And, if given multiple registers,
				3664	or multiple register classes, it will simply choose the first one. (In fact, it
				3665	doesn't currently even ensure explicitly specified physical registers are
				3666	unique, so specifying multiple physical registers as alternatives, like
				3667	``{r11}{r12},{r11}{r12}``, will assign r11 to both operands, not at all what was
				3668	intended.)
				3669
				3670	Supported Constraint Code List
				3671	""""""""""""""""""""""""""""""
				3672
				3673	The constraint codes are, in general, expected to behave the same way they do in
				3674	GCC. LLVM's support is often implemented on an 'as-needed' basis, to support C
				3675	inline asm code which was supported by GCC. A mismatch in behavior between LLVM
				3676	and GCC likely indicates a bug in LLVM.
				3677
				3678	Some constraint codes are typically supported by all targets:
				3679
				3680	- ``r``: A register in the target's general purpose register class.
				3681	- ``m``: A memory address operand. It is target-specific what addressing modes
				3682	are supported, typical examples are register, or register + register offset,
				3683	or register + immediate offset (of some target-specific size).
				3684	- ``i``: An integer constant (of target-specific width). Allows either a simple
				3685	immediate, or a relocatable value.
				3686	- ``n``: An integer constant -- not including relocatable values.
				3687	- ``s``: An integer constant, but allowing only relocatable values.
				3688	- ``X``: Allows an operand of any kind, no constraint whatsoever. Typically
				3689	useful to pass a label for an asm branch or call.
				3690
				3691	.. FIXME: but that surely isn't actually okay to jump out of an asm
				3692	block without telling llvm about the control transfer???)
				3693
				3694	- ``{register-name}``: Requires exactly the named physical register.
				3695
				3696	Other constraints are target-specific:
				3697
				3698	AArch64:
				3699
				3700	- ``z``: An immediate integer 0. Outputs ``WZR`` or ``XZR``, as appropriate.
				3701	- ``I``: An immediate integer valid for an ``ADD`` or ``SUB`` instruction,
				3702	i.e. 0 to 4095 with optional shift by 12.
				3703	- ``J``: An immediate integer that, when negated, is valid for an ``ADD`` or
				3704	``SUB`` instruction, i.e. -1 to -4095 with optional left shift by 12.
				3705	- ``K``: An immediate integer that is valid for the 'bitmask immediate 32' of a
				3706	logical instruction like ``AND``, ``EOR``, or ``ORR`` with a 32-bit register.
				3707	- ``L``: An immediate integer that is valid for the 'bitmask immediate 64' of a
				3708	logical instruction like ``AND``, ``EOR``, or ``ORR`` with a 64-bit register.
				3709	- ``M``: An immediate integer for use with the ``MOV`` assembly alias on a
				3710	32-bit register. This is a superset of ``K``: in addition to the bitmask
				3711	immediate, also allows immediate integers which can be loaded with a single
				3712	``MOVZ`` or ``MOVL`` instruction.
				3713	- ``N``: An immediate integer for use with the ``MOV`` assembly alias on a
				3714	64-bit register. This is a superset of ``L``.
				3715	- ``Q``: Memory address operand must be in a single register (no
				3716	offsets). (However, LLVM currently does this for the ``m`` constraint as
				3717	well.)
				3718	- ``r``: A 32 or 64-bit integer register (W* or X*).
				3719	- ``w``: A 32, 64, or 128-bit floating-point/SIMD register.
				3720	- ``x``: A lower 128-bit floating-point/SIMD register (``V0`` to ``V15``).
				3721
				3722	AMDGPU:
				3723
				3724	- ``r``: A 32 or 64-bit integer register.
				3725	- ``[0-9]v``: The 32-bit VGPR register, number 0-9.
				3726	- ``[0-9]s``: The 32-bit SGPR register, number 0-9.
				3727
				3728
				3729	All ARM modes:
				3730
				3731	- ``Q``, ``Um``, ``Un``, ``Uq``, ``Us``, ``Ut``, ``Uv``, ``Uy``: Memory address
				3732	operand. Treated the same as operand ``m``, at the moment.
				3733
				3734	ARM and ARM's Thumb2 mode:
				3735
				3736	- ``j``: An immediate integer between 0 and 65535 (valid for ``MOVW``)
				3737	- ``I``: An immediate integer valid for a data-processing instruction.
				3738	- ``J``: An immediate integer between -4095 and 4095.
				3739	- ``K``: An immediate integer whose bitwise inverse is valid for a
				3740	data-processing instruction. (Can be used with template modifier "``B``" to
				3741	print the inverted value).
				3742	- ``L``: An immediate integer whose negation is valid for a data-processing
				3743	instruction. (Can be used with template modifier "``n``" to print the negated
				3744	value).
				3745	- ``M``: A power of two or a integer between 0 and 32.
				3746	- ``N``: Invalid immediate constraint.
				3747	- ``O``: Invalid immediate constraint.
				3748	- ``r``: A general-purpose 32-bit integer register (``r0-r15``).
				3749	- ``l``: In Thumb2 mode, low 32-bit GPR registers (``r0-r7``). In ARM mode, same
				3750	as ``r``.
				3751	- ``h``: In Thumb2 mode, a high 32-bit GPR register (``r8-r15``). In ARM mode,
				3752	invalid.
				3753	- ``w``: A 32, 64, or 128-bit floating-point/SIMD register: ``s0-s31``,
				3754	``d0-d31``, or ``q0-q15``.
				3755	- ``x``: A 32, 64, or 128-bit floating-point/SIMD register: ``s0-s15``,
				3756	``d0-d7``, or ``q0-q3``.
Pablo Barrio	e28cb83	2018-02-15 14:44:22 +0000	[diff] [blame]	3757	- ``t``: A low floating-point/SIMD register: ``s0-s31``, ``d0-d16``, or
				3758	``q0-q8``.
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3759
				3760	ARM's Thumb1 mode:
				3761
				3762	- ``I``: An immediate integer between 0 and 255.
				3763	- ``J``: An immediate integer between -255 and -1.
				3764	- ``K``: An immediate integer between 0 and 255, with optional left-shift by
				3765	some amount.
				3766	- ``L``: An immediate integer between -7 and 7.
				3767	- ``M``: An immediate integer which is a multiple of 4 between 0 and 1020.
				3768	- ``N``: An immediate integer between 0 and 31.
				3769	- ``O``: An immediate integer which is a multiple of 4 between -508 and 508.
				3770	- ``r``: A low 32-bit GPR register (``r0-r7``).
				3771	- ``l``: A low 32-bit GPR register (``r0-r7``).
				3772	- ``h``: A high GPR register (``r0-r7``).
				3773	- ``w``: A 32, 64, or 128-bit floating-point/SIMD register: ``s0-s31``,
				3774	``d0-d31``, or ``q0-q15``.
				3775	- ``x``: A 32, 64, or 128-bit floating-point/SIMD register: ``s0-s15``,
				3776	``d0-d7``, or ``q0-q3``.
Pablo Barrio	e28cb83	2018-02-15 14:44:22 +0000	[diff] [blame]	3777	- ``t``: A low floating-point/SIMD register: ``s0-s31``, ``d0-d16``, or
				3778	``q0-q8``.
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3779
				3780
				3781	Hexagon:
				3782
				3783	- ``o``, ``v``: A memory address operand, treated the same as constraint ``m``,
				3784	at the moment.
				3785	- ``r``: A 32 or 64-bit register.
				3786
				3787	MSP430:
				3788
				3789	- ``r``: An 8 or 16-bit register.
				3790
				3791	MIPS:
				3792
				3793	- ``I``: An immediate signed 16-bit integer.
				3794	- ``J``: An immediate integer zero.
				3795	- ``K``: An immediate unsigned 16-bit integer.
				3796	- ``L``: An immediate 32-bit integer, where the lower 16 bits are 0.
				3797	- ``N``: An immediate integer between -65535 and -1.
				3798	- ``O``: An immediate signed 15-bit integer.
				3799	- ``P``: An immediate integer between 1 and 65535.
				3800	- ``m``: A memory address operand. In MIPS-SE mode, allows a base address
				3801	register plus 16-bit immediate offset. In MIPS mode, just a base register.
				3802	- ``R``: A memory address operand. In MIPS-SE mode, allows a base address
				3803	register plus a 9-bit signed offset. In MIPS mode, the same as constraint
				3804	``m``.
				3805	- ``ZC``: A memory address operand, suitable for use in a ``pref``, ``ll``, or
				3806	``sc`` instruction on the given subtarget (details vary).
				3807	- ``r``, ``d``, ``y``: A 32 or 64-bit GPR register.
				3808	- ``f``: A 32 or 64-bit FPU register (``F0-F31``), or a 128-bit MSA register
Daniel Sanders	3745e02	2015-07-13 09:24:21 +0000	[diff] [blame]	3809	(``W0-W31``). In the case of MSA registers, it is recommended to use the ``w``
				3810	argument modifier for compatibility with GCC.
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3811	- ``c``: A 32-bit or 64-bit GPR register suitable for indirect jump (always
				3812	``25``).
				3813	- ``l``: The ``lo`` register, 32 or 64-bit.
				3814	- ``x``: Invalid.
				3815
				3816	NVPTX:
				3817
				3818	- ``b``: A 1-bit integer register.
				3819	- ``c`` or ``h``: A 16-bit integer register.
				3820	- ``r``: A 32-bit integer register.
				3821	- ``l`` or ``N``: A 64-bit integer register.
				3822	- ``f``: A 32-bit float register.
				3823	- ``d``: A 64-bit float register.
				3824
				3825
				3826	PowerPC:
				3827
				3828	- ``I``: An immediate signed 16-bit integer.
				3829	- ``J``: An immediate unsigned 16-bit integer, shifted left 16 bits.
				3830	- ``K``: An immediate unsigned 16-bit integer.
				3831	- ``L``: An immediate signed 16-bit integer, shifted left 16 bits.
				3832	- ``M``: An immediate integer greater than 31.
				3833	- ``N``: An immediate integer that is an exact power of 2.
				3834	- ``O``: The immediate integer constant 0.
				3835	- ``P``: An immediate integer constant whose negation is a signed 16-bit
				3836	constant.
				3837	- ``es``, ``o``, ``Q``, ``Z``, ``Zy``: A memory address operand, currently
				3838	treated the same as ``m``.
				3839	- ``r``: A 32 or 64-bit integer register.
				3840	- ``b``: A 32 or 64-bit integer register, excluding ``R0`` (that is:
				3841	``R1-R31``).
				3842	- ``f``: A 32 or 64-bit float register (``F0-F31``), or when QPX is enabled, a
				3843	128 or 256-bit QPX register (``Q0-Q31``; aliases the ``F`` registers).
				3844	- ``v``: For ``4 x f32`` or ``4 x f64`` types, when QPX is enabled, a
				3845	128 or 256-bit QPX register (``Q0-Q31``), otherwise a 128-bit
				3846	altivec vector register (``V0-V31``).
				3847
				3848	.. FIXME: is this a bug that v accepts QPX registers? I think this
				3849	is supposed to only use the altivec vector registers?
				3850
				3851	- ``y``: Condition register (``CR0-CR7``).
				3852	- ``wc``: An individual CR bit in a CR register.
				3853	- ``wa``, ``wd``, ``wf``: Any 128-bit VSX vector register, from the full VSX
				3854	register set (overlapping both the floating-point and vector register files).
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3855	- ``ws``: A 32 or 64-bit floating-point register, from the full VSX register
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3856	set.
				3857
				3858	Sparc:
				3859
				3860	- ``I``: An immediate 13-bit signed integer.
				3861	- ``r``: A 32-bit integer register.
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3862	- ``f``: Any floating-point register on SparcV8, or a floating-point
James Y Knight	d4e1b00	2017-05-12 15:59:10 +0000	[diff] [blame]	3863	register in the "low" half of the registers on SparcV9.
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3864	- ``e``: Any floating-point register. (Same as ``f`` on SparcV8.)
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3865
				3866	SystemZ:
				3867
				3868	- ``I``: An immediate unsigned 8-bit integer.
				3869	- ``J``: An immediate unsigned 12-bit integer.
				3870	- ``K``: An immediate signed 16-bit integer.
				3871	- ``L``: An immediate signed 20-bit integer.
				3872	- ``M``: An immediate integer 0x7fffffff.
Ulrich Weigand	daae87aa	2016-06-13 14:24:05 +0000	[diff] [blame]	3873	- ``Q``: A memory address operand with a base address and a 12-bit immediate
				3874	unsigned displacement.
				3875	- ``R``: A memory address operand with a base address, a 12-bit immediate
				3876	unsigned displacement, and an index register.
				3877	- ``S``: A memory address operand with a base address and a 20-bit immediate
				3878	signed displacement.
				3879	- ``T``: A memory address operand with a base address, a 20-bit immediate
				3880	signed displacement, and an index register.
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3881	- ``r`` or ``d``: A 32, 64, or 128-bit integer register.
				3882	- ``a``: A 32, 64, or 128-bit integer address register (excludes R0, which in an
				3883	address context evaluates as zero).
				3884	- ``h``: A 32-bit value in the high part of a 64bit data register
				3885	(LLVM-specific)
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	3886	- ``f``: A 32, 64, or 128-bit floating-point register.
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3887
				3888	X86:
				3889
				3890	- ``I``: An immediate integer between 0 and 31.
				3891	- ``J``: An immediate integer between 0 and 64.
				3892	- ``K``: An immediate signed 8-bit integer.
				3893	- ``L``: An immediate integer, 0xff or 0xffff or (in 64-bit mode only)
				3894	0xffffffff.
				3895	- ``M``: An immediate integer between 0 and 3.
				3896	- ``N``: An immediate unsigned 8-bit integer.
				3897	- ``O``: An immediate integer between 0 and 127.
				3898	- ``e``: An immediate 32-bit signed integer.
				3899	- ``Z``: An immediate 32-bit unsigned integer.
				3900	- ``o``, ``v``: Treated the same as ``m``, at the moment.
				3901	- ``q``: An 8, 16, 32, or 64-bit register which can be accessed as an 8-bit
				3902	``l`` integer register. On X86-32, this is the ``a``, ``b``, ``c``, and ``d``
				3903	registers, and on X86-64, it is all of the integer registers.
				3904	- ``Q``: An 8, 16, 32, or 64-bit register which can be accessed as an 8-bit
				3905	``h`` integer register. This is the ``a``, ``b``, ``c``, and ``d`` registers.
				3906	- ``r`` or ``l``: An 8, 16, 32, or 64-bit integer register.
				3907	- ``R``: An 8, 16, 32, or 64-bit "legacy" integer register -- one which has
				3908	existed since i386, and can be accessed without the REX prefix.
				3909	- ``f``: A 32, 64, or 80-bit '387 FPU stack pseudo-register.
				3910	- ``y``: A 64-bit MMX register, if MMX is enabled.
				3911	- ``x``: If SSE is enabled: a 32 or 64-bit scalar operand, or 128-bit vector
				3912	operand in a SSE register. If AVX is also enabled, can also be a 256-bit
				3913	vector operand in an AVX register. If AVX-512 is also enabled, can also be a
				3914	512-bit vector operand in an AVX512 register, Otherwise, an error.
				3915	- ``Y``: The same as ``x``, if SSE2 is enabled, otherwise an error.
				3916	- ``A``: Special case: allocates EAX first, then EDX, for a single operand (in
				3917	32-bit mode, a 64-bit integer operand will get split into two registers). It
				3918	is not recommended to use this constraint, as in 64-bit mode, the 64-bit
				3919	operand will get allocated only to RAX -- if two 32-bit operands are needed,
				3920	you're better off splitting it yourself, before passing it to the asm
				3921	statement.
				3922
				3923	XCore:
				3924
				3925	- ``r``: A 32-bit integer register.
				3926
				3927
				3928	.. _inline-asm-modifiers:
				3929
				3930	Asm template argument modifiers
				3931	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				3932
				3933	In the asm template string, modifiers can be used on the operand reference, like
				3934	"``${0:n}``".
				3935
				3936	The modifiers are, in general, expected to behave the same way they do in
				3937	GCC. LLVM's support is often implemented on an 'as-needed' basis, to support C
				3938	inline asm code which was supported by GCC. A mismatch in behavior between LLVM
				3939	and GCC likely indicates a bug in LLVM.
				3940
				3941	Target-independent:
				3942
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	3943	- ``c``: Print an immediate integer constant unadorned, without
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	3944	the target-specific immediate punctuation (e.g. no ``$`` prefix).
				3945	- ``n``: Negate and print immediate integer constant unadorned, without the
				3946	target-specific immediate punctuation (e.g. no ``$`` prefix).
				3947	- ``l``: Print as an unadorned label, without the target-specific label
				3948	punctuation (e.g. no ``$`` prefix).
				3949
				3950	AArch64:
				3951
				3952	- ``w``: Print a GPR register with a ``w`` name instead of ``x`` name. E.g.,
				3953	instead of ``x30``, print ``w30``.
				3954	- ``x``: Print a GPR register with a ``x*`` name. (this is the default, anyhow).
				3955	- ``b``, ``h``, ``s``, ``d``, ``q``: Print a floating-point/SIMD register with a
				3956	``b``, ``h``, ``s``, ``d``, or ``q*`` name, rather than the default of
				3957	``v*``.
				3958
				3959	AMDGPU:
				3960
				3961	- ``r``: No effect.
				3962
				3963	ARM:
				3964
				3965	- ``a``: Print an operand as an address (with ``[`` and ``]`` surrounding a
				3966	register).
				3967	- ``P``: No effect.
				3968	- ``q``: No effect.
				3969	- ``y``: Print a VFP single-precision register as an indexed double (e.g. print
				3970	as ``d4[1]`` instead of ``s9``)
				3971	- ``B``: Bitwise invert and print an immediate integer constant without ``#``
				3972	prefix.
				3973	- ``L``: Print the low 16-bits of an immediate integer constant.
				3974	- ``M``: Print as a register set suitable for ldm/stm. Also prints all
				3975	register operands subsequent to the specified one (!), so use carefully.
				3976	- ``Q``: Print the low-order register of a register-pair, or the low-order
				3977	register of a two-register operand.
				3978	- ``R``: Print the high-order register of a register-pair, or the high-order
				3979	register of a two-register operand.
				3980	- ``H``: Print the second register of a register-pair. (On a big-endian system,
				3981	``H`` is equivalent to ``Q``, and on little-endian system, ``H`` is equivalent
				3982	to ``R``.)
				3983
				3984	.. FIXME: H doesn't currently support printing the second register
				3985	of a two-register operand.
				3986
				3987	- ``e``: Print the low doubleword register of a NEON quad register.
				3988	- ``f``: Print the high doubleword register of a NEON quad register.
				3989	- ``m``: Print the base register of a memory operand without the ``[`` and ``]``
				3990	adornment.
				3991
				3992	Hexagon:
				3993
				3994	- ``L``: Print the second register of a two-register operand. Requires that it
				3995	has been allocated consecutively to the first.
				3996
				3997	.. FIXME: why is it restricted to consecutive ones? And there's
				3998	nothing that ensures that happens, is there?
				3999
				4000	- ``I``: Print the letter 'i' if the operand is an integer constant, otherwise
				4001	nothing. Used to print 'addi' vs 'add' instructions.
				4002
				4003	MSP430:
				4004
				4005	No additional modifiers.
				4006
				4007	MIPS:
				4008
				4009	- ``X``: Print an immediate integer as hexadecimal
				4010	- ``x``: Print the low 16 bits of an immediate integer as hexadecimal.
				4011	- ``d``: Print an immediate integer as decimal.
				4012	- ``m``: Subtract one and print an immediate integer as decimal.
				4013	- ``z``: Print $0 if an immediate zero, otherwise print normally.
				4014	- ``L``: Print the low-order register of a two-register operand, or prints the
				4015	address of the low-order word of a double-word memory operand.
				4016
				4017	.. FIXME: L seems to be missing memory operand support.
				4018
				4019	- ``M``: Print the high-order register of a two-register operand, or prints the
				4020	address of the high-order word of a double-word memory operand.
				4021
				4022	.. FIXME: M seems to be missing memory operand support.
				4023
				4024	- ``D``: Print the second register of a two-register operand, or prints the
				4025	second word of a double-word memory operand. (On a big-endian system, ``D`` is
				4026	equivalent to ``L``, and on little-endian system, ``D`` is equivalent to
				4027	``M``.)
Daniel Sanders	3745e02	2015-07-13 09:24:21 +0000	[diff] [blame]	4028	- ``w``: No effect. Provided for compatibility with GCC which requires this
				4029	modifier in order to print MSA registers (``W0-W31``) with the ``f``
				4030	constraint.
James Y Knight	bc832ed	2015-07-08 18:08:36 +0000	[diff] [blame]	4031
				4032	NVPTX:
				4033
				4034	- ``r``: No effect.
				4035
				4036	PowerPC:
				4037
				4038	- ``L``: Print the second register of a two-register operand. Requires that it
				4039	has been allocated consecutively to the first.
				4040
				4041	.. FIXME: why is it restricted to consecutive ones? And there's
				4042	nothing that ensures that happens, is there?
				4043
				4044	- ``I``: Print the letter 'i' if the operand is an integer constant, otherwise
				4045	nothing. Used to print 'addi' vs 'add' instructions.
				4046	- ``y``: For a memory operand, prints formatter for a two-register X-form
				4047	instruction. (Currently always prints ``r0,OPERAND``).
				4048	- ``U``: Prints 'u' if the memory operand is an update form, and nothing
				4049	otherwise. (NOTE: LLVM does not support update form, so this will currently
				4050	always print nothing)
				4051	- ``X``: Prints 'x' if the memory operand is an indexed form. (NOTE: LLVM does
				4052	not support indexed form, so this will currently always print nothing)
				4053
				4054	Sparc:
				4055
				4056	- ``r``: No effect.
				4057
				4058	SystemZ:
				4059
				4060	SystemZ implements only ``n``, and does not support any of the other
				4061	target-independent modifiers.
				4062
				4063	X86:
				4064
				4065	- ``c``: Print an unadorned integer or symbol name. (The latter is
				4066	target-specific behavior for this typically target-independent modifier).
				4067	- ``A``: Print a register name with a '``*``' before it.
				4068	- ``b``: Print an 8-bit register name (e.g. ``al``); do nothing on a memory
				4069	operand.
				4070	- ``h``: Print the upper 8-bit register name (e.g. ``ah``); do nothing on a
				4071	memory operand.
				4072	- ``w``: Print the 16-bit register name (e.g. ``ax``); do nothing on a memory
				4073	operand.
				4074	- ``k``: Print the 32-bit register name (e.g. ``eax``); do nothing on a memory
				4075	operand.
				4076	- ``q``: Print the 64-bit register name (e.g. ``rax``), if 64-bit registers are
				4077	available, otherwise the 32-bit register name; do nothing on a memory operand.
				4078	- ``n``: Negate and print an unadorned integer, or, for operands other than an
				4079	immediate integer (e.g. a relocatable symbol expression), print a '-' before
				4080	the operand. (The behavior for relocatable symbol expressions is a
				4081	target-specific behavior for this typically target-independent modifier)
				4082	- ``H``: Print a memory reference with additional offset +8.
				4083	- ``P``: Print a memory reference or operand for use as the argument of a call
				4084	instruction. (E.g. omit ``(rip)``, even though it's PC-relative.)
				4085
				4086	XCore:
				4087
				4088	No additional modifiers.
				4089
				4090
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4091	Inline Asm Metadata
				4092	^^^^^^^^^^^^^^^^^^^
				4093
				4094	The call instructions that wrap inline asm nodes may have a
				4095	"``!srcloc``" MDNode attached to it that contains a list of constant
				4096	integers. If present, the code generator will use the integer as the
				4097	location cookie value when report errors through the ``LLVMContext``
				4098	error reporting mechanisms. This allows a front-end to correlate backend
				4099	errors that occur with inline asm back to the source code that produced
				4100	it. For example:
				4101
				4102	.. code-block:: llvm
				4103
				4104	call void asm sideeffect "something bad", ""(), !srcloc !42
				4105	...
				4106	!42 = !{ i32 1234567 }
				4107
				4108	It is up to the front-end to make sense of the magic numbers it places
				4109	in the IR. If the MDNode contains multiple constants, the code generator
				4110	will use the one that corresponds to the line of the asm that the error
				4111	occurs on.
				4112
				4113	.. _metadata:
				4114
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4115	Metadata
				4116	========
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4117
				4118	LLVM IR allows metadata to be attached to instructions in the program
				4119	that can convey extra information about the code to the optimizers and
				4120	code generator. One example application of metadata is source-level
				4121	debug information. There are two metadata primitives: strings and nodes.
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4122
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4123	Metadata does not have a type, and is not a value. If referenced from a
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4124	``call`` instruction, it uses the ``metadata`` type.
				4125
				4126	All metadata are identified in syntax by a exclamation point ('``!``').
				4127
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4128	.. _metadata-string:
				4129
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4130	Metadata Nodes and Metadata Strings
				4131	-----------------------------------
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4132
				4133	A metadata string is a string surrounded by double quotes. It can
				4134	contain any character by escaping non-printable characters with
				4135	"``\xx``" where "``xx``" is the two digit hex code. For example:
				4136	"``!"test\00"``".
				4137
				4138	Metadata nodes are represented with notation similar to structure
				4139	constants (a comma separated list of elements, surrounded by braces and
				4140	preceded by an exclamation point). Metadata nodes can have any values as
				4141	their operand. For example:
				4142
				4143	.. code-block:: llvm
				4144
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4145	!{ !"test\00", i32 10}
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4146
Duncan P. N. Exon Smith	090a19b	2015-01-08 22:38:29 +0000	[diff] [blame]	4147	Metadata nodes that aren't uniqued use the ``distinct`` keyword. For example:
				4148
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4149	.. code-block:: text
Duncan P. N. Exon Smith	090a19b	2015-01-08 22:38:29 +0000	[diff] [blame]	4150
				4151	!0 = distinct !{!"test\00", i32 10}
				4152
Duncan P. N. Exon Smith	9901034	2015-01-08 23:50:26 +0000	[diff] [blame]	4153	``distinct`` nodes are useful when nodes shouldn't be merged based on their
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4154	content. They can also occur when transformations cause uniquing collisions
Duncan P. N. Exon Smith	9901034	2015-01-08 23:50:26 +0000	[diff] [blame]	4155	when metadata operands change.
				4156
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4157	A :ref:`named metadata <namedmetadatastructure>` is a collection of
				4158	metadata nodes, which can be looked up in the module symbol table. For
				4159	example:
				4160
				4161	.. code-block:: llvm
				4162
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4163	!foo = !{!4, !3}
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4164
Adrian Prantl	1b842da	2017-07-28 20:44:29 +0000	[diff] [blame]	4165	Metadata can be used as function arguments. Here the ``llvm.dbg.value``
				4166	intrinsic is using three metadata arguments:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4167
				4168	.. code-block:: llvm
				4169
Adrian Prantl	abe0475	2017-07-28 20:21:02 +0000	[diff] [blame]	4170	call void @llvm.dbg.value(metadata !24, metadata !25, metadata !26)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4171
Peter Collingbourne	5010868	2015-11-06 02:41:02 +0000	[diff] [blame]	4172	Metadata can be attached to an instruction. Here metadata ``!21`` is attached
				4173	to the ``add`` instruction using the ``!dbg`` identifier:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4174
				4175	.. code-block:: llvm
				4176
				4177	%indvar.next = add i64 %indvar, 1, !dbg !21
				4178
Peter Collingbourne	7b5b7c7	2017-01-25 21:50:14 +0000	[diff] [blame]	4179	Metadata can also be attached to a function or a global variable. Here metadata
				4180	``!22`` is attached to the ``f1`` and ``f2 functions, and the globals ``g1``
				4181	and ``g2`` using the ``!dbg`` identifier:
Peter Collingbourne	5010868	2015-11-06 02:41:02 +0000	[diff] [blame]	4182
				4183	.. code-block:: llvm
				4184
Peter Collingbourne	7b5b7c7	2017-01-25 21:50:14 +0000	[diff] [blame]	4185	declare !dbg !22 void @f1()
				4186	define void @f2() !dbg !22 {
Peter Collingbourne	5010868	2015-11-06 02:41:02 +0000	[diff] [blame]	4187	ret void
				4188	}
				4189
Peter Collingbourne	7b5b7c7	2017-01-25 21:50:14 +0000	[diff] [blame]	4190	@g1 = global i32 0, !dbg !22
				4191	@g2 = external global i32, !dbg !22
				4192
				4193	A transformation is required to drop any metadata attachment that it does not
				4194	know or know it can't preserve. Currently there is an exception for metadata
				4195	attachment to globals for ``!type`` and ``!absolute_symbol`` which can't be
				4196	unconditionally dropped unless the global is itself deleted.
				4197
				4198	Metadata attached to a module using named metadata may not be dropped, with
				4199	the exception of debug metadata (named metadata with the name ``!llvm.dbg.*``).
				4200
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4201	More information about specific metadata nodes recognized by the
				4202	optimizers and code generator is found below.
				4203
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4204	.. _specialized-metadata:
				4205
Duncan P. N. Exon Smith	6a48483	2015-01-13 21:10:44 +0000	[diff] [blame]	4206	Specialized Metadata Nodes
				4207	^^^^^^^^^^^^^^^^^^^^^^^^^^
				4208
				4209	Specialized metadata nodes are custom data structures in metadata (as opposed
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4210	to generic tuples). Their fields are labelled, and can be specified in any
Duncan P. N. Exon Smith	6a48483	2015-01-13 21:10:44 +0000	[diff] [blame]	4211	order.
				4212
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4213	These aren't inherently debug info centric, but currently all the specialized
				4214	metadata nodes are related to debug info.
				4215
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4216	.. _DICompileUnit:
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4217
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4218	DICompileUnit
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4219	"""""""""""""
				4220
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4221	``DICompileUnit`` nodes represent a compile unit. The ``enums:``,
Adrian Prantl	6c2497f	2017-06-12 23:59:43 +0000	[diff] [blame]	4222	``retainedTypes:``, ``globals:``, ``imports:`` and ``macros:`` fields are tuples
				4223	containing the debug info to be emitted along with the compile unit, regardless
				4224	of code optimizations (some nodes are only emitted if there are references to
				4225	them from instructions). The ``debugInfoForProfiling:`` field is a boolean
				4226	indicating whether or not line-table discriminators are updated to provide
				4227	more-accurate debug info for profiling results.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4228
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4229	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4230
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4231	!0 = !DICompileUnit(language: DW_LANG_C99, file: !1, producer: "clang",
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4232	isOptimized: true, flags: "-O2", runtimeVersion: 2,
Adrian Prantl	b808951	2016-04-01 00:16:49 +0000	[diff] [blame]	4233	splitDebugFilename: "abc.debug", emissionKind: FullDebug,
Adrian Prantl	6c2497f	2017-06-12 23:59:43 +0000	[diff] [blame]	4234	enums: !2, retainedTypes: !3, globals: !4, imports: !5,
				4235	macros: !6, dwoId: 0x0abcd)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4236
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4237	Compile unit descriptors provide the root scope for objects declared in a
Adrian Prantl	6c2497f	2017-06-12 23:59:43 +0000	[diff] [blame]	4238	specific compilation unit. File descriptors are defined using this scope. These
				4239	descriptors are collected by a named metadata node ``!llvm.dbg.cu``. They keep
				4240	track of global variables, type information, and imported entities (declarations
				4241	and namespaces).
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4242
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4243	.. _DIFile:
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4244
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4245	DIFile
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4246	""""""
				4247
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4248	``DIFile`` nodes represent files. The ``filename:`` can include slashes.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4249
Aaron Ballman	b3c5151	2017-01-17 21:48:31 +0000	[diff] [blame]	4250	.. code-block:: none
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4251
Amjad Aboud	7faeecc	2016-12-25 10:12:09 +0000	[diff] [blame]	4252	!0 = !DIFile(filename: "path/to/file", directory: "/path/to/dir",
				4253	checksumkind: CSK_MD5,
				4254	checksum: "000102030405060708090a0b0c0d0e0f")
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4255
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4256	Files are sometimes used in ``scope:`` fields, and are the only valid target
				4257	for ``file:`` fields.
Amjad Aboud	7faeecc	2016-12-25 10:12:09 +0000	[diff] [blame]	4258	Valid values for ``checksumkind:`` field are: {CSK_None, CSK_MD5, CSK_SHA1}
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4259
Michael Kuperstein	605308a	2015-05-14 10:58:59 +0000	[diff] [blame]	4260	.. _DIBasicType:
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4261
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4262	DIBasicType
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4263	"""""""""""
				4264
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4265	``DIBasicType`` nodes represent primitive types, such as ``int``, ``bool`` and
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4266	``float``. ``tag:`` defaults to ``DW_TAG_base_type``.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4267
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4268	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4269
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4270	!0 = !DIBasicType(name: "unsigned char", size: 8, align: 8,
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4271	encoding: DW_ATE_unsigned_char)
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4272	!1 = !DIBasicType(tag: DW_TAG_unspecified_type, name: "decltype(nullptr)")
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4273
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4274	The ``encoding:`` describes the details of the type. Usually it's one of the
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4275	following:
				4276
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4277	.. code-block:: text
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4278
				4279	DW_ATE_address = 1
				4280	DW_ATE_boolean = 2
				4281	DW_ATE_float = 4
				4282	DW_ATE_signed = 5
				4283	DW_ATE_signed_char = 6
				4284	DW_ATE_unsigned = 7
				4285	DW_ATE_unsigned_char = 8
				4286
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4287	.. _DISubroutineType:
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4288
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4289	DISubroutineType
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4290	""""""""""""""""
				4291
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4292	``DISubroutineType`` nodes represent subroutine types. Their ``types:`` field
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4293	refers to a tuple; the first operand is the return type, while the rest are the
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4294	types of the formal arguments in order. If the first operand is ``null``, that
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4295	represents a function with no return value (such as ``void foo() {}`` in C++).
				4296
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4297	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4298
				4299	!0 = !BasicType(name: "int", size: 32, align: 32, DW_ATE_signed)
				4300	!1 = !BasicType(name: "char", size: 8, align: 8, DW_ATE_signed_char)
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4301	!2 = !DISubroutineType(types: !{null, !0, !1}) ; void (int, char)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4302
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4303	.. _DIDerivedType:
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4304
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4305	DIDerivedType
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4306	"""""""""""""
				4307
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4308	``DIDerivedType`` nodes represent types derived from other types, such as
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4309	qualified types.
				4310
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4311	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4312
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4313	!0 = !DIBasicType(name: "unsigned char", size: 8, align: 8,
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4314	encoding: DW_ATE_unsigned_char)
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4315	!1 = !DIDerivedType(tag: DW_TAG_pointer_type, baseType: !0, size: 32,
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4316	align: 32)
				4317
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4318	The following ``tag:`` values are valid:
				4319
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4320	.. code-block:: text
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4321
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4322	DW_TAG_member = 13
				4323	DW_TAG_pointer_type = 15
				4324	DW_TAG_reference_type = 16
				4325	DW_TAG_typedef = 22
Duncan P. N. Exon Smith	a3f3de1	2016-04-16 22:46:47 +0000	[diff] [blame]	4326	DW_TAG_inheritance = 28
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4327	DW_TAG_ptr_to_member_type = 31
				4328	DW_TAG_const_type = 38
Duncan P. N. Exon Smith	a3f3de1	2016-04-16 22:46:47 +0000	[diff] [blame]	4329	DW_TAG_friend = 42
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4330	DW_TAG_volatile_type = 53
				4331	DW_TAG_restrict_type = 55
Victor Leschuk	e1156c2	2016-10-31 19:09:38 +0000	[diff] [blame]	4332	DW_TAG_atomic_type = 71
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4333
Duncan P. N. Exon Smith	a59d3e5	2016-04-23 21:08:00 +0000	[diff] [blame]	4334	.. _DIDerivedTypeMember:
				4335
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4336	``DW_TAG_member`` is used to define a member of a :ref:`composite type
Duncan P. N. Exon Smith	90990cd	2016-04-17 00:45:00 +0000	[diff] [blame]	4337	<DICompositeType>`. The type of the member is the ``baseType:``. The
Duncan P. N. Exon Smith	a59d3e5	2016-04-23 21:08:00 +0000	[diff] [blame]	4338	``offset:`` is the member's bit offset. If the composite type has an ODR
				4339	``identifier:`` and does not set ``flags: DIFwdDecl``, then the member is
				4340	uniqued based only on its ``name:`` and ``scope:``.
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4341
Duncan P. N. Exon Smith	a3f3de1	2016-04-16 22:46:47 +0000	[diff] [blame]	4342	``DW_TAG_inheritance`` and ``DW_TAG_friend`` are used in the ``elements:``
				4343	field of :ref:`composite types <DICompositeType>` to describe parents and
				4344	friends.
				4345
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4346	``DW_TAG_typedef`` is used to provide a name for the ``baseType:``.
				4347
				4348	``DW_TAG_pointer_type``, ``DW_TAG_reference_type``, ``DW_TAG_const_type``,
Victor Leschuk	e1156c2	2016-10-31 19:09:38 +0000	[diff] [blame]	4349	``DW_TAG_volatile_type``, ``DW_TAG_restrict_type`` and ``DW_TAG_atomic_type``
				4350	are used to qualify the ``baseType:``.
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4351
				4352	Note that the ``void *`` type is expressed as a type derived from NULL.
				4353
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4354	.. _DICompositeType:
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4355
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4356	DICompositeType
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4357	"""""""""""""""
				4358
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4359	``DICompositeType`` nodes represent types composed of other types, like
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4360	structures and unions. ``elements:`` points to a tuple of the composed types.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4361
				4362	If the source language supports ODR, the ``identifier:`` field gives the unique
Duncan P. N. Exon Smith	a59d3e5	2016-04-23 21:08:00 +0000	[diff] [blame]	4363	identifier used for type merging between modules. When specified,
				4364	:ref:`subprogram declarations <DISubprogramDeclaration>` and :ref:`member
				4365	derived types <DIDerivedTypeMember>` that reference the ODR-type in their
				4366	``scope:`` change uniquing rules.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4367
Duncan P. N. Exon Smith	5ab2be0	2016-04-17 03:58:21 +0000	[diff] [blame]	4368	For a given ``identifier:``, there should only be a single composite type that
				4369	does not have ``flags: DIFlagFwdDecl`` set. LLVM tools that link modules
				4370	together will unique such definitions at parse time via the ``identifier:``
				4371	field, even if the nodes are ``distinct``.
				4372
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4373	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4374
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4375	!0 = !DIEnumerator(name: "SixKind", value: 7)
				4376	!1 = !DIEnumerator(name: "SevenKind", value: 7)
				4377	!2 = !DIEnumerator(name: "NegEightKind", value: -8)
				4378	!3 = !DICompositeType(tag: DW_TAG_enumeration_type, name: "Enum", file: !12,
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4379	line: 2, size: 32, align: 32, identifier: "_M4Enum",
				4380	elements: !{!0, !1, !2})
				4381
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4382	The following ``tag:`` values are valid:
				4383
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4384	.. code-block:: text
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4385
				4386	DW_TAG_array_type = 1
				4387	DW_TAG_class_type = 2
				4388	DW_TAG_enumeration_type = 4
				4389	DW_TAG_structure_type = 19
				4390	DW_TAG_union_type = 23
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4391
				4392	For ``DW_TAG_array_type``, the ``elements:`` should be :ref:`subrange
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4393	descriptors <DISubrange>`, each representing the range of subscripts at that
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4394	level of indexing. The ``DIFlagVector`` flag to ``flags:`` indicates that an
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4395	array type is a native packed vector.
				4396
				4397	For ``DW_TAG_enumeration_type``, the ``elements:`` should be :ref:`enumerator
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4398	descriptors <DIEnumerator>`, each representing the definition of an enumeration
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4399	value for the set. All enumeration type descriptors are collected in the
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4400	``enums:`` field of the :ref:`compile unit <DICompileUnit>`.
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4401
				4402	For ``DW_TAG_structure_type``, ``DW_TAG_class_type``, and
				4403	``DW_TAG_union_type``, the ``elements:`` should be :ref:`derived types
Duncan P. N. Exon Smith	a3f3de1	2016-04-16 22:46:47 +0000	[diff] [blame]	4404	<DIDerivedType>` with ``tag: DW_TAG_member``, ``tag: DW_TAG_inheritance``, or
				4405	``tag: DW_TAG_friend``; or :ref:`subprograms <DISubprogram>` with
				4406	``isDefinition: false``.
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4407
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4408	.. _DISubrange:
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4409
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4410	DISubrange
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4411	""""""""""
				4412
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4413	``DISubrange`` nodes are the elements for ``DW_TAG_array_type`` variants of
Sander de Smalen	1cb9431	2018-01-24 10:30:23 +0000	[diff] [blame]	4414	:ref:`DICompositeType`.
				4415
				4416	- ``count: -1`` indicates an empty array.
				4417	- ``count: !9`` describes the count with a :ref:`DILocalVariable`.
				4418	- ``count: !11`` describes the count with a :ref:`DIGlobalVariable`.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4419
Chandler Carruth	4a73aa1	2018-08-06 03:35:36 +0000	[diff] [blame]	4420	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4421
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4422	!0 = !DISubrange(count: 5, lowerBound: 0) ; array counting from 0
				4423	!1 = !DISubrange(count: 5, lowerBound: 1) ; array counting from 1
				4424	!2 = !DISubrange(count: -1) ; empty array.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4425
Sander de Smalen	fdf4091	2018-01-24 09:56:07 +0000	[diff] [blame]	4426	; Scopes used in rest of example
				4427	!6 = !DIFile(filename: "vla.c", directory: "/path/to/file")
Chandler Carruth	24dd211	2018-08-06 02:30:01 +0000	[diff] [blame]	4428	!7 = distinct !DICompileUnit(language: DW_LANG_C99, file: !6)
				4429	!8 = distinct !DISubprogram(name: "foo", scope: !7, file: !6, line: 5)
Sander de Smalen	fdf4091	2018-01-24 09:56:07 +0000	[diff] [blame]	4430
				4431	; Use of local variable as count value
				4432	!9 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)
				4433	!10 = !DILocalVariable(name: "count", scope: !8, file: !6, line: 42, type: !9)
Chandler Carruth	24dd211	2018-08-06 02:30:01 +0000	[diff] [blame]	4434	!11 = !DISubrange(count: !10, lowerBound: 0)
Sander de Smalen	fdf4091	2018-01-24 09:56:07 +0000	[diff] [blame]	4435
				4436	; Use of global variable as count value
				4437	!12 = !DIGlobalVariable(name: "count", scope: !8, file: !6, line: 22, type: !9)
Chandler Carruth	24dd211	2018-08-06 02:30:01 +0000	[diff] [blame]	4438	!13 = !DISubrange(count: !12, lowerBound: 0)
Sander de Smalen	fdf4091	2018-01-24 09:56:07 +0000	[diff] [blame]	4439
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4440	.. _DIEnumerator:
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4441
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4442	DIEnumerator
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4443	""""""""""""
				4444
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4445	``DIEnumerator`` nodes are the elements for ``DW_TAG_enumeration_type``
				4446	variants of :ref:`DICompositeType`.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4447
Chandler Carruth	4a73aa1	2018-08-06 03:35:36 +0000	[diff] [blame]	4448	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4449
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4450	!0 = !DIEnumerator(name: "SixKind", value: 7)
				4451	!1 = !DIEnumerator(name: "SevenKind", value: 7)
				4452	!2 = !DIEnumerator(name: "NegEightKind", value: -8)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4453
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4454	DITemplateTypeParameter
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4455	"""""""""""""""""""""""
				4456
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4457	``DITemplateTypeParameter`` nodes represent type parameters to generic source
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4458	language constructs. They are used (optionally) in :ref:`DICompositeType` and
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4459	:ref:`DISubprogram` ``templateParams:`` fields.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4460
Chandler Carruth	4a73aa1	2018-08-06 03:35:36 +0000	[diff] [blame]	4461	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4462
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4463	!0 = !DITemplateTypeParameter(name: "Ty", type: !1)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4464
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4465	DITemplateValueParameter
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4466	""""""""""""""""""""""""
				4467
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4468	``DITemplateValueParameter`` nodes represent value parameters to generic source
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4469	language constructs. ``tag:`` defaults to ``DW_TAG_template_value_parameter``,
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4470	but if specified can also be set to ``DW_TAG_GNU_template_template_param`` or
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4471	``DW_TAG_GNU_template_param_pack``. They are used (optionally) in
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4472	:ref:`DICompositeType` and :ref:`DISubprogram` ``templateParams:`` fields.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4473
Chandler Carruth	4a73aa1	2018-08-06 03:35:36 +0000	[diff] [blame]	4474	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4475
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4476	!0 = !DITemplateValueParameter(name: "Ty", type: !1, value: i32 7)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4477
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4478	DINamespace
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4479	"""""""""""
				4480
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4481	``DINamespace`` nodes represent namespaces in the source language.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4482
Chandler Carruth	4a73aa1	2018-08-06 03:35:36 +0000	[diff] [blame]	4483	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4484
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4485	!0 = !DINamespace(name: "myawesomeproject", scope: !1, file: !2, line: 7)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4486
Sander de Smalen	1cb9431	2018-01-24 10:30:23 +0000	[diff] [blame]	4487	.. _DIGlobalVariable:
				4488
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4489	DIGlobalVariable
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4490	""""""""""""""""
				4491
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4492	``DIGlobalVariable`` nodes represent global variables in the source language.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4493
Chandler Carruth	4a73aa1	2018-08-06 03:35:36 +0000	[diff] [blame]	4494	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4495
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4496	!0 = !DIGlobalVariable(name: "foo", linkageName: "foo", scope: !1,
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4497	file: !2, line: 7, type: !3, isLocal: true,
				4498	isDefinition: false, variable: i32* @foo,
				4499	declaration: !4)
				4500
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4501	All global variables should be referenced by the `globals:` field of a
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4502	:ref:`compile unit <DICompileUnit>`.
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4503
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4504	.. _DISubprogram:
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4505
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4506	DISubprogram
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4507	""""""""""""
				4508
Peter Collingbourne	5010868	2015-11-06 02:41:02 +0000	[diff] [blame]	4509	``DISubprogram`` nodes represent functions from the source language. A
				4510	``DISubprogram`` may be attached to a function definition using ``!dbg``
				4511	metadata. The ``variables:`` field points at :ref:`variables <DILocalVariable>`
				4512	that must be retained, even if their IR counterparts are optimized out of
				4513	the IR. The ``type:`` field must point at an :ref:`DISubroutineType`.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4514
Duncan P. N. Exon Smith	a59d3e5	2016-04-23 21:08:00 +0000	[diff] [blame]	4515	.. _DISubprogramDeclaration:
				4516
Duncan P. N. Exon Smith	05ebfd0	2016-04-17 02:30:20 +0000	[diff] [blame]	4517	When ``isDefinition: false``, subprograms describe a declaration in the type
Duncan P. N. Exon Smith	a59d3e5	2016-04-23 21:08:00 +0000	[diff] [blame]	4518	tree as opposed to a definition of a function. If the scope is a composite
				4519	type with an ODR ``identifier:`` and that does not set ``flags: DIFwdDecl``,
				4520	then the subprogram declaration is uniqued based only on its ``linkageName:``
				4521	and ``scope:``.
Duncan P. N. Exon Smith	05ebfd0	2016-04-17 02:30:20 +0000	[diff] [blame]	4522
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4523	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4524
Peter Collingbourne	5010868	2015-11-06 02:41:02 +0000	[diff] [blame]	4525	define void @_Z3foov() !dbg !0 {
				4526	...
				4527	}
				4528
				4529	!0 = distinct !DISubprogram(name: "foo", linkageName: "_Zfoov", scope: !1,
				4530	file: !2, line: 7, type: !3, isLocal: true,
Duncan P. N. Exon Smith	05ebfd0	2016-04-17 02:30:20 +0000	[diff] [blame]	4531	isDefinition: true, scopeLine: 8,
Peter Collingbourne	5010868	2015-11-06 02:41:02 +0000	[diff] [blame]	4532	containingType: !4,
				4533	virtuality: DW_VIRTUALITY_pure_virtual,
				4534	virtualIndex: 10, flags: DIFlagPrototyped,
Adrian Prantl	6c2497f	2017-06-12 23:59:43 +0000	[diff] [blame]	4535	isOptimized: true, unit: !5, templateParams: !6,
				4536	declaration: !7, variables: !8, thrownTypes: !9)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4537
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4538	.. _DILexicalBlock:
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4539
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4540	DILexicalBlock
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4541	""""""""""""""
				4542
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4543	``DILexicalBlock`` nodes describe nested blocks within a :ref:`subprogram
Bruce Mitchener	e9ffb45	2015-09-12 01:17:08 +0000	[diff] [blame]	4544	<DISubprogram>`. The line number and column numbers are used to distinguish
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4545	two lexical blocks at same depth. They are valid targets for ``scope:``
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4546	fields.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4547
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4548	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4549
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4550	!0 = distinct !DILexicalBlock(scope: !1, file: !2, line: 7, column: 35)
Duncan P. N. Exon Smith	d937cd9	2015-03-17 23:41:05 +0000	[diff] [blame]	4551
				4552	Usually lexical blocks are ``distinct`` to prevent node merging based on
				4553	operands.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4554
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4555	.. _DILexicalBlockFile:
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4556
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4557	DILexicalBlockFile
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4558	""""""""""""""""""
				4559
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4560	``DILexicalBlockFile`` nodes are used to discriminate between sections of a
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4561	:ref:`lexical block <DILexicalBlock>`. The ``file:`` field can be changed to
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4562	indicate textual inclusion, or the ``discriminator:`` field can be used to
				4563	discriminate between control flow within a single block in the source language.
				4564
Chandler Carruth	4a73aa1	2018-08-06 03:35:36 +0000	[diff] [blame]	4565	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4566
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4567	!0 = !DILexicalBlock(scope: !3, file: !4, line: 7, column: 35)
				4568	!1 = !DILexicalBlockFile(scope: !0, file: !4, discriminator: 0)
				4569	!2 = !DILexicalBlockFile(scope: !0, file: !4, discriminator: 1)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4570
Michael Kuperstein	605308a	2015-05-14 10:58:59 +0000	[diff] [blame]	4571	.. _DILocation:
				4572
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4573	DILocation
Duncan P. N. Exon Smith	6a48483	2015-01-13 21:10:44 +0000	[diff] [blame]	4574	""""""""""
				4575
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4576	``DILocation`` nodes represent source debug locations. The ``scope:`` field is
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4577	mandatory, and points at an :ref:`DILexicalBlockFile`, an
				4578	:ref:`DILexicalBlock`, or an :ref:`DISubprogram`.
Duncan P. N. Exon Smith	6a48483	2015-01-13 21:10:44 +0000	[diff] [blame]	4579
Chandler Carruth	4a73aa1	2018-08-06 03:35:36 +0000	[diff] [blame]	4580	.. code-block:: text
Duncan P. N. Exon Smith	6a48483	2015-01-13 21:10:44 +0000	[diff] [blame]	4581
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4582	!0 = !DILocation(line: 2900, column: 42, scope: !1, inlinedAt: !2)
Duncan P. N. Exon Smith	6a48483	2015-01-13 21:10:44 +0000	[diff] [blame]	4583
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4584	.. _DILocalVariable:
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4585
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4586	DILocalVariable
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4587	"""""""""""""""
				4588
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	4589	``DILocalVariable`` nodes represent local variables in the source language. If
Duncan P. N. Exon Smith	ed013cd	2015-07-31 18:58:39 +0000	[diff] [blame]	4590	the ``arg:`` field is set to non-zero, then this variable is a subprogram
				4591	parameter, and it will be included in the ``variables:`` field of its
				4592	:ref:`DISubprogram`.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4593
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4594	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4595
Duncan P. N. Exon Smith	ed013cd	2015-07-31 18:58:39 +0000	[diff] [blame]	4596	!0 = !DILocalVariable(name: "this", arg: 1, scope: !3, file: !2, line: 7,
				4597	type: !3, flags: DIFlagArtificial)
				4598	!1 = !DILocalVariable(name: "x", arg: 2, scope: !4, file: !2, line: 7,
				4599	type: !3)
				4600	!2 = !DILocalVariable(name: "y", scope: !5, file: !2, line: 7, type: !3)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4601
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4602	DIExpression
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4603	""""""""""""
				4604
Adrian Prantl	b44c776	2017-03-22 18:01:01 +0000	[diff] [blame]	4605	``DIExpression`` nodes represent expressions that are inspired by the DWARF
				4606	expression language. They are used in :ref:`debug intrinsics<dbg_intrinsics>`
				4607	(such as ``llvm.dbg.declare`` and ``llvm.dbg.value``) to describe how the
Vedant Kumar	8a05b01	2018-07-28 00:33:47 +0000	[diff] [blame]	4608	referenced LLVM variable relates to the source language variable. Debug
				4609	intrinsics are interpreted left-to-right: start by pushing the value/address
				4610	operand of the intrinsic onto a stack, then repeatedly push and evaluate
				4611	opcodes from the DIExpression until the final variable description is produced.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4612
Vedant Kumar	8a05b01	2018-07-28 00:33:47 +0000	[diff] [blame]	4613	The current supported opcode vocabulary is limited:
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4614
Adrian Prantl	6825fb6	2017-04-18 01:21:53 +0000	[diff] [blame]	4615	- ``DW_OP_deref`` dereferences the top of the expression stack.
Florian Hahn	ffc498d	2017-06-14 13:14:38 +0000	[diff] [blame]	4616	- ``DW_OP_plus`` pops the last two entries from the expression stack, adds
				4617	them together and appends the result to the expression stack.
				4618	- ``DW_OP_minus`` pops the last two entries from the expression stack, subtracts
				4619	the last entry from the second last entry and appends the result to the
				4620	expression stack.
Florian Hahn	c9c403c	2017-06-13 16:54:44 +0000	[diff] [blame]	4621	- ``DW_OP_plus_uconst, 93`` adds ``93`` to the working expression.
Adrian Prantl	b44c776	2017-03-22 18:01:01 +0000	[diff] [blame]	4622	- ``DW_OP_LLVM_fragment, 16, 8`` specifies the offset and size (``16`` and ``8``
				4623	here, respectively) of the variable fragment from the working expression. Note
Hiroshi Inoue	760c0c9	2018-01-16 13:19:48 +0000	[diff] [blame]	4624	that contrary to DW_OP_bit_piece, the offset is describing the location
Adrian Prantl	b44c776	2017-03-22 18:01:01 +0000	[diff] [blame]	4625	within the described source variable.
Konstantin Zhuravlyov	f9b41cd	2017-03-08 00:28:57 +0000	[diff] [blame]	4626	- ``DW_OP_swap`` swaps top two stack entries.
				4627	- ``DW_OP_xderef`` provides extended dereference mechanism. The entry at the top
				4628	of the stack is treated as an address. The second stack entry is treated as an
				4629	address space identifier.
Adrian Prantl	b44c776	2017-03-22 18:01:01 +0000	[diff] [blame]	4630	- ``DW_OP_stack_value`` marks a constant value.
				4631
Adrian Prantl	6825fb6	2017-04-18 01:21:53 +0000	[diff] [blame]	4632	DWARF specifies three kinds of simple location descriptions: Register, memory,
Vedant Kumar	8a05b01	2018-07-28 00:33:47 +0000	[diff] [blame]	4633	and implicit location descriptions. Note that a location description is
				4634	defined over certain ranges of a program, i.e the location of a variable may
				4635	change over the course of the program. Register and memory location
				4636	descriptions describe the concrete location of a source variable (in the
				4637	sense that a debugger might modify its value), whereas implicit locations
				4638	describe merely the actual value of a source variable which might not exist
				4639	in registers or in memory (see ``DW_OP_stack_value``).
				4640
				4641	A ``llvm.dbg.addr`` or ``llvm.dbg.declare`` intrinsic describes an indirect
				4642	value (the address) of a source variable. The first operand of the intrinsic
				4643	must be an address of some kind. A DIExpression attached to the intrinsic
				4644	refines this address to produce a concrete location for the source variable.
				4645
				4646	A ``llvm.dbg.value`` intrinsic describes the direct value of a source variable.
				4647	The first operand of the intrinsic may be a direct or indirect value. A
				4648	DIExpresion attached to the intrinsic refines the first operand to produce a
				4649	direct value. For example, if the first operand is an indirect value, it may be
				4650	necessary to insert ``DW_OP_deref`` into the DIExpresion in order to produce a
				4651	valid debug intrinsic.
				4652
				4653	.. note::
				4654
				4655	A DIExpression is interpreted in the same way regardless of which kind of
				4656	debug intrinsic it's attached to.
Adrian Prantl	6825fb6	2017-04-18 01:21:53 +0000	[diff] [blame]	4657
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	4658	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4659
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4660	!0 = !DIExpression(DW_OP_deref)
Florian Hahn	c9c403c	2017-06-13 16:54:44 +0000	[diff] [blame]	4661	!1 = !DIExpression(DW_OP_plus_uconst, 3)
Florian Hahn	ffc498d	2017-06-14 13:14:38 +0000	[diff] [blame]	4662	!1 = !DIExpression(DW_OP_constu, 3, DW_OP_plus)
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4663	!2 = !DIExpression(DW_OP_bit_piece, 3, 7)
Florian Hahn	ffc498d	2017-06-14 13:14:38 +0000	[diff] [blame]	4664	!3 = !DIExpression(DW_OP_deref, DW_OP_constu, 3, DW_OP_plus, DW_OP_LLVM_fragment, 3, 7)
Konstantin Zhuravlyov	f9b41cd	2017-03-08 00:28:57 +0000	[diff] [blame]	4665	!4 = !DIExpression(DW_OP_constu, 2, DW_OP_swap, DW_OP_xderef)
Adrian Prantl	b44c776	2017-03-22 18:01:01 +0000	[diff] [blame]	4666	!5 = !DIExpression(DW_OP_constu, 42, DW_OP_stack_value)
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4667
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4668	DIObjCProperty
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4669	""""""""""""""
				4670
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4671	``DIObjCProperty`` nodes represent Objective-C property nodes.
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4672
Chandler Carruth	4a73aa1	2018-08-06 03:35:36 +0000	[diff] [blame]	4673	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4674
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4675	!3 = !DIObjCProperty(name: "foo", file: !1, line: 7, setter: "setFoo",
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4676	getter: "getFoo", attributes: 7, type: !2)
				4677
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4678	DIImportedEntity
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4679	""""""""""""""""
				4680
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4681	``DIImportedEntity`` nodes represent entities (such as modules) imported into a
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4682	compile unit.
				4683
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4684	.. code-block:: text
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4685
Duncan P. N. Exon Smith	a9308c4	2015-04-29 16:38:44 +0000	[diff] [blame]	4686	!2 = !DIImportedEntity(tag: DW_TAG_imported_module, name: "foo", scope: !0,
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	4687	entity: !1, line: 7)
				4688
Amjad Aboud	a9bcf16	2015-12-10 12:56:35 +0000	[diff] [blame]	4689	DIMacro
				4690	"""""""
				4691
				4692	``DIMacro`` nodes represent definition or undefinition of a macro identifiers.
				4693	The ``name:`` field is the macro identifier, followed by macro parameters when
Sylvestre Ledru	7d54050	2016-07-02 19:28:40 +0000	[diff] [blame]	4694	defining a function-like macro, and the ``value`` field is the token-string
Amjad Aboud	a9bcf16	2015-12-10 12:56:35 +0000	[diff] [blame]	4695	used to expand the macro identifier.
				4696
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4697	.. code-block:: text
Amjad Aboud	a9bcf16	2015-12-10 12:56:35 +0000	[diff] [blame]	4698
				4699	!2 = !DIMacro(macinfo: DW_MACINFO_define, line: 7, name: "foo(x)",
				4700	value: "((x) + 1)")
				4701	!3 = !DIMacro(macinfo: DW_MACINFO_undef, line: 30, name: "foo")
				4702
				4703	DIMacroFile
				4704	"""""""""""
				4705
				4706	``DIMacroFile`` nodes represent inclusion of source files.
				4707	The ``nodes:`` field is a list of ``DIMacro`` and ``DIMacroFile`` nodes that
				4708	appear in the included source file.
				4709
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	4710	.. code-block:: text
Amjad Aboud	a9bcf16	2015-12-10 12:56:35 +0000	[diff] [blame]	4711
				4712	!2 = !DIMacroFile(macinfo: DW_MACINFO_start_file, line: 7, file: !2,
				4713	nodes: !3)
				4714
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4715	'``tbaa``' Metadata
				4716	^^^^^^^^^^^^^^^^^^^
				4717
				4718	In LLVM IR, memory does not have types, so LLVM's own type system is not
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4719	suitable for doing type based alias analysis (TBAA). Instead, metadata is
				4720	added to the IR to describe a type system of a higher level language. This
				4721	can be used to implement C/C++ strict type aliasing rules, but it can also
				4722	be used to implement custom alias analysis behavior for other languages.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4723
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4724	This description of LLVM's TBAA system is broken into two parts:
				4725	:ref:`Semantics<tbaa_node_semantics>` talks about high level issues, and
				4726	:ref:`Representation<tbaa_node_representation>` talks about the metadata
				4727	encoding of various entities.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4728
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4729	It is always possible to trace any TBAA node to a "root" TBAA node (details
				4730	in the :ref:`Representation<tbaa_node_representation>` section). TBAA
				4731	nodes with different roots have an unknown aliasing relationship, and LLVM
				4732	conservatively infers ``MayAlias`` between them. The rules mentioned in
				4733	this section only pertain to TBAA nodes living under the same root.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4734
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4735	.. _tbaa_node_semantics:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4736
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4737	Semantics
				4738	"""""""""
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4739
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4740	The TBAA metadata system, referred to as "struct path TBAA" (not to be
				4741	confused with ``tbaa.struct``), consists of the following high level
				4742	concepts: Type Descriptors, further subdivided into scalar type
				4743	descriptors and struct type descriptors; and Access Tags.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4744
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4745	Type descriptors describe the type system of the higher level language
				4746	being compiled. Scalar type descriptors describe types that do not
				4747	contain other types. Each scalar type has a parent type, which must also
				4748	be a scalar type or the TBAA root. Via this parent relation, scalar types
				4749	within a TBAA root form a tree. Struct type descriptors denote types
				4750	that contain a sequence of other type descriptors, at known offsets. These
				4751	contained type descriptors can either be struct type descriptors themselves
				4752	or scalar type descriptors.
				4753
				4754	Access tags are metadata nodes attached to load and store instructions.
				4755	Access tags use type descriptors to describe the location being accessed
				4756	in terms of the type system of the higher level language. Access tags are
				4757	tuples consisting of a base type, an access type and an offset. The base
				4758	type is a scalar type descriptor or a struct type descriptor, the access
				4759	type is a scalar type descriptor, and the offset is a constant integer.
				4760
				4761	The access tag ``(BaseTy, AccessTy, Offset)`` can describe one of two
				4762	things:
				4763
				4764	* If ``BaseTy`` is a struct type, the tag describes a memory access (load
				4765	or store) of a value of type ``AccessTy`` contained in the struct type
				4766	``BaseTy`` at offset ``Offset``.
				4767
				4768	* If ``BaseTy`` is a scalar type, ``Offset`` must be 0 and ``BaseTy`` and
				4769	``AccessTy`` must be the same; and the access tag describes a scalar
				4770	access with scalar type ``AccessTy``.
				4771
				4772	We first define an ``ImmediateParent`` relation on ``(BaseTy, Offset)``
				4773	tuples this way:
				4774
				4775	* If ``BaseTy`` is a scalar type then ``ImmediateParent(BaseTy, 0)`` is
				4776	``(ParentTy, 0)`` where ``ParentTy`` is the parent of the scalar type as
				4777	described in the TBAA metadata. ``ImmediateParent(BaseTy, Offset)`` is
				4778	undefined if ``Offset`` is non-zero.
				4779
				4780	* If ``BaseTy`` is a struct type then ``ImmediateParent(BaseTy, Offset)``
				4781	is ``(NewTy, NewOffset)`` where ``NewTy`` is the type contained in
				4782	``BaseTy`` at offset ``Offset`` and ``NewOffset`` is ``Offset`` adjusted
				4783	to be relative within that inner type.
				4784
				4785	A memory access with an access tag ``(BaseTy1, AccessTy1, Offset1)``
				4786	aliases a memory access with an access tag ``(BaseTy2, AccessTy2,
				4787	Offset2)`` if either ``(BaseTy1, Offset1)`` is reachable from ``(Base2,
				4788	Offset2)`` via the ``Parent`` relation or vice versa.
				4789
				4790	As a concrete example, the type descriptor graph for the following program
				4791
				4792	.. code-block:: c
				4793
				4794	struct Inner {
				4795	int i; // offset 0
				4796	float f; // offset 4
				4797	};
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	4798
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4799	struct Outer {
				4800	float f; // offset 0
				4801	double d; // offset 4
				4802	struct Inner inner_a; // offset 12
				4803	};
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	4804
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4805	void f(struct Outer* outer, struct Inner* inner, float* f, int* i, char* c) {
				4806	outer->f = 0; // tag0: (OuterStructTy, FloatScalarTy, 0)
				4807	outer->inner_a.i = 0; // tag1: (OuterStructTy, IntScalarTy, 12)
Fangrui Song	74d6a74	2018-05-29 05:38:05 +0000	[diff] [blame]	4808	outer->inner_a.f = 0.0; // tag2: (OuterStructTy, FloatScalarTy, 16)
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4809	*f = 0.0; // tag3: (FloatScalarTy, FloatScalarTy, 0)
				4810	}
				4811
				4812	is (note that in C and C++, ``char`` can be used to access any arbitrary
				4813	type):
				4814
				4815	.. code-block:: text
				4816
				4817	Root = "TBAA Root"
				4818	CharScalarTy = ("char", Root, 0)
				4819	FloatScalarTy = ("float", CharScalarTy, 0)
				4820	DoubleScalarTy = ("double", CharScalarTy, 0)
				4821	IntScalarTy = ("int", CharScalarTy, 0)
				4822	InnerStructTy = {"Inner" (IntScalarTy, 0), (FloatScalarTy, 4)}
				4823	OuterStructTy = {"Outer", (FloatScalarTy, 0), (DoubleScalarTy, 4),
				4824	(InnerStructTy, 12)}
				4825
				4826
				4827	with (e.g.) ``ImmediateParent(OuterStructTy, 12)`` = ``(InnerStructTy,
				4828	0)``, ``ImmediateParent(InnerStructTy, 0)`` = ``(IntScalarTy, 0)``, and
				4829	``ImmediateParent(IntScalarTy, 0)`` = ``(CharScalarTy, 0)``.
				4830
				4831	.. _tbaa_node_representation:
				4832
				4833	Representation
				4834	""""""""""""""
				4835
				4836	The root node of a TBAA type hierarchy is an ``MDNode`` with 0 operands or
				4837	with exactly one ``MDString`` operand.
				4838
				4839	Scalar type descriptors are represented as an ``MDNode`` s with two
				4840	operands. The first operand is an ``MDString`` denoting the name of the
				4841	struct type. LLVM does not assign meaning to the value of this operand, it
				4842	only cares about it being an ``MDString``. The second operand is an
				4843	``MDNode`` which points to the parent for said scalar type descriptor,
				4844	which is either another scalar type descriptor or the TBAA root. Scalar
				4845	type descriptors can have an optional third argument, but that must be the
				4846	constant integer zero.
				4847
				4848	Struct type descriptors are represented as ``MDNode`` s with an odd number
				4849	of operands greater than 1. The first operand is an ``MDString`` denoting
				4850	the name of the struct type. Like in scalar type descriptors the actual
				4851	value of this name operand is irrelevant to LLVM. After the name operand,
				4852	the struct type descriptors have a sequence of alternating ``MDNode`` and
				4853	``ConstantInt`` operands. With N starting from 1, the 2N - 1 th operand,
				4854	an ``MDNode``, denotes a contained field, and the 2N th operand, a
				4855	``ConstantInt``, is the offset of the said contained field. The offsets
				4856	must be in non-decreasing order.
				4857
				4858	Access tags are represented as ``MDNode`` s with either 3 or 4 operands.
				4859	The first operand is an ``MDNode`` pointing to the node representing the
				4860	base type. The second operand is an ``MDNode`` pointing to the node
				4861	representing the access type. The third operand is a ``ConstantInt`` that
				4862	states the offset of the access. If a fourth field is present, it must be
				4863	a ``ConstantInt`` valued at 0 or 1. If it is 1 then the access tag states
				4864	that the location being accessed is "constant" (meaning
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4865	``pointsToConstantMemory`` should return true; see `other useful
Sanjoy Das	a3ff994	2017-02-13 23:14:03 +0000	[diff] [blame]	4866	AliasAnalysis methods <AliasAnalysis.html#OtherItfs>`_). The TBAA root of
				4867	the access type and the base type of an access tag must be the same, and
				4868	that is the TBAA root of the access tag.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4869
				4870	'``tbaa.struct``' Metadata
				4871	^^^^^^^^^^^^^^^^^^^^^^^^^^
				4872
				4873	The :ref:`llvm.memcpy <int_memcpy>` is often used to implement
				4874	aggregate assignment operations in C and similar languages, however it
				4875	is defined to copy a contiguous region of memory, which is more than
				4876	strictly necessary for aggregate types which contain holes due to
				4877	padding. Also, it doesn't contain any TBAA information about the fields
				4878	of the aggregate.
				4879
				4880	``!tbaa.struct`` metadata can describe which memory subregions in a
				4881	memcpy are padding and what the TBAA tags of the struct are.
				4882
				4883	The current metadata format is very simple. ``!tbaa.struct`` metadata
				4884	nodes are a list of operands which are in conceptual groups of three.
				4885	For each group of three, the first operand gives the byte offset of a
				4886	field in bytes, the second gives its size in bytes, and the third gives
				4887	its tbaa tag. e.g.:
				4888
				4889	.. code-block:: llvm
				4890
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4891	!4 = !{ i64 0, i64 4, !1, i64 8, i64 4, !2 }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4892
				4893	This describes a struct with two fields. The first is at offset 0 bytes
				4894	with size 4 bytes, and has tbaa tag !1. The second is at offset 8 bytes
				4895	and has size 4 bytes and has tbaa tag !2.
				4896
				4897	Note that the fields need not be contiguous. In this example, there is a
				4898	4 byte gap between the two fields. This gap represents padding which
				4899	does not carry useful data and need not be preserved.
				4900
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4901	'``noalias``' and '``alias.scope``' Metadata
Dan Liew	bafdcba	2014-07-28 13:33:51 +0000	[diff] [blame]	4902	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4903
				4904	``noalias`` and ``alias.scope`` metadata provide the ability to specify generic
				4905	noalias memory-access sets. This means that some collection of memory access
				4906	instructions (loads, stores, memory-accessing calls, etc.) that carry
				4907	``noalias`` metadata can specifically be specified not to alias with some other
				4908	collection of memory access instructions that carry ``alias.scope`` metadata.
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4909	Each type of metadata specifies a list of scopes where each scope has an id and
Adam Nemet	569a5b3	2016-04-27 00:52:48 +0000	[diff] [blame]	4910	a domain.
				4911
				4912	When evaluating an aliasing query, if for some domain, the set
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4913	of scopes with that domain in one instruction's ``alias.scope`` list is a
Arch D. Robison	96cf7ab	2015-02-24 20:11:49 +0000	[diff] [blame]	4914	subset of (or equal to) the set of scopes for that domain in another
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4915	instruction's ``noalias`` list, then the two memory accesses are assumed not to
				4916	alias.
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4917
Adam Nemet	569a5b3	2016-04-27 00:52:48 +0000	[diff] [blame]	4918	Because scopes in one domain don't affect scopes in other domains, separate
				4919	domains can be used to compose multiple independent noalias sets. This is
				4920	used for example during inlining. As the noalias function parameters are
				4921	turned into noalias scope metadata, a new domain is used every time the
				4922	function is inlined.
				4923
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4924	The metadata identifying each domain is itself a list containing one or two
				4925	entries. The first entry is the name of the domain. Note that if the name is a
Bruce Mitchener	e9ffb45	2015-09-12 01:17:08 +0000	[diff] [blame]	4926	string then it can be combined across functions and translation units. A
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4927	self-reference can be used to create globally unique domain names. A
				4928	descriptive string may optionally be provided as a second list entry.
				4929
				4930	The metadata identifying each scope is also itself a list containing two or
				4931	three entries. The first entry is the name of the scope. Note that if the name
Bruce Mitchener	e9ffb45	2015-09-12 01:17:08 +0000	[diff] [blame]	4932	is a string then it can be combined across functions and translation units. A
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4933	self-reference can be used to create globally unique scope names. A metadata
				4934	reference to the scope's domain is the second entry. A descriptive string may
				4935	optionally be provided as a third list entry.
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4936
				4937	For example,
				4938
				4939	.. code-block:: llvm
				4940
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4941	; Two scope domains:
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4942	!0 = !{!0}
				4943	!1 = !{!1}
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4944
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4945	; Some scopes in these domains:
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4946	!2 = !{!2, !0}
				4947	!3 = !{!3, !0}
				4948	!4 = !{!4, !1}
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4949
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4950	; Some scope lists:
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4951	!5 = !{!4} ; A list containing only scope !4
				4952	!6 = !{!4, !3, !2}
				4953	!7 = !{!3}
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4954
				4955	; These two instructions don't alias:
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	4956	%0 = load float, float* %c, align 4, !alias.scope !5
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4957	store float %0, float* %arrayidx.i, align 4, !noalias !5
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4958
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4959	; These two instructions also don't alias (for domain !1, the set of scopes
				4960	; in the !alias.scope equals that in the !noalias list):
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	4961	%2 = load float, float* %c, align 4, !alias.scope !5
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4962	store float %2, float* %arrayidx.i2, align 4, !noalias !6
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4963
Adam Nemet	0a8416f	2015-05-11 08:30:28 +0000	[diff] [blame]	4964	; These two instructions may alias (for domain !0, the set of scopes in
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4965	; the !noalias list is not a superset of, or equal to, the scopes in the
				4966	; !alias.scope list):
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	4967	%2 = load float, float* %c, align 4, !alias.scope !6
Hal Finkel	029cde6	2014-07-25 15:50:02 +0000	[diff] [blame]	4968	store float %0, float* %arrayidx.i, align 4, !noalias !7
Hal Finkel	9414665	2014-07-24 14:25:39 +0000	[diff] [blame]	4969
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4970	'``fpmath``' Metadata
				4971	^^^^^^^^^^^^^^^^^^^^^
				4972
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	4973	``fpmath`` metadata may be attached to any instruction of floating-point
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4974	type. It can be used to express the maximum acceptable error in the
				4975	result of that instruction, in ULPs, thus potentially allowing the
				4976	compiler to use a more efficient but less accurate method of computing
				4977	it. ULP is defined as follows:
				4978
				4979	If ``x`` is a real number that lies between two finite consecutive
				4980	floating-point numbers ``a`` and ``b``, without being equal to one
				4981	of them, then ``ulp(x) = \|b - a\|``, otherwise ``ulp(x)`` is the
				4982	distance between the two non-equal finite floating-point numbers
				4983	nearest ``x``. Moreover, ``ulp(NaN)`` is ``NaN``.
				4984
Matt Arsenault	82f4151	2016-06-27 19:43:15 +0000	[diff] [blame]	4985	The metadata node shall consist of a single positive float type number
				4986	representing the maximum relative error, for example:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4987
				4988	.. code-block:: llvm
				4989
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	4990	!0 = !{ float 2.5 } ; maximum acceptable inaccuracy is 2.5 ULPs
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4991
Philip Reames	f8bf9dd	2015-02-27 23:14:50 +0000	[diff] [blame]	4992	.. _range-metadata:
				4993
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	4994	'``range``' Metadata
				4995	^^^^^^^^^^^^^^^^^^^^
				4996
Jingyue Wu	37fcb59	2014-06-19 16:50:16 +0000	[diff] [blame]	4997	``range`` metadata may be attached only to ``load``, ``call`` and ``invoke`` of
				4998	integer types. It expresses the possible ranges the loaded value or the value
Eli Friedman	e15a111	2018-07-17 20:38:11 +0000	[diff] [blame]	4999	returned by the called function at this call site is in. If the loaded or
				5000	returned value is not in the specified range, the behavior is undefined. The
				5001	ranges are represented with a flattened list of integers. The loaded value or
				5002	the value returned is known to be in the union of the ranges defined by each
				5003	consecutive pair. Each pair has the following properties:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5004
				5005	- The type must match the type loaded by the instruction.
				5006	- The pair ``a,b`` represents the range ``[a,b)``.
				5007	- Both ``a`` and ``b`` are constants.
				5008	- The range is allowed to wrap.
				5009	- The range should not represent the full or empty set. That is,
				5010	``a!=b``.
				5011
				5012	In addition, the pairs must be in signed order of the lower bound and
				5013	they must be non-contiguous.
				5014
				5015	Examples:
				5016
				5017	.. code-block:: llvm
				5018
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	5019	%a = load i8, i8* %x, align 1, !range !0 ; Can only be 0 or 1
				5020	%b = load i8, i8* %y, align 1, !range !1 ; Can only be 255 (-1), 0 or 1
Jingyue Wu	37fcb59	2014-06-19 16:50:16 +0000	[diff] [blame]	5021	%c = call i8 @foo(), !range !2 ; Can only be 0, 1, 3, 4 or 5
				5022	%d = invoke i8 @bar() to label %cont
				5023	unwind label %lpad, !range !3 ; Can only be -2, -1, 3, 4 or 5
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5024	...
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5025	!0 = !{ i8 0, i8 2 }
				5026	!1 = !{ i8 255, i8 2 }
				5027	!2 = !{ i8 0, i8 2, i8 3, i8 6 }
				5028	!3 = !{ i8 -2, i8 0, i8 3, i8 6 }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5029
Peter Collingbourne	235c275	2016-12-08 19:01:00 +0000	[diff] [blame]	5030	'``absolute_symbol``' Metadata
				5031	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5032
				5033	``absolute_symbol`` metadata may be attached to a global variable
				5034	declaration. It marks the declaration as a reference to an absolute symbol,
				5035	which causes the backend to use absolute relocations for the symbol even
				5036	in position independent code, and expresses the possible ranges that the
				5037	global variable's address (not its value) is in, in the same format as
Peter Collingbourne	d88f928	2017-01-20 21:56:37 +0000	[diff] [blame]	5038	``range`` metadata, with the extension that the pair ``all-ones,all-ones``
				5039	may be used to represent the full set.
Peter Collingbourne	235c275	2016-12-08 19:01:00 +0000	[diff] [blame]	5040
Peter Collingbourne	d88f928	2017-01-20 21:56:37 +0000	[diff] [blame]	5041	Example (assuming 64-bit pointers):
Peter Collingbourne	235c275	2016-12-08 19:01:00 +0000	[diff] [blame]	5042
				5043	.. code-block:: llvm
				5044
				5045	@a = external global i8, !absolute_symbol !0 ; Absolute symbol in range [0,256)
Peter Collingbourne	d88f928	2017-01-20 21:56:37 +0000	[diff] [blame]	5046	@b = external global i8, !absolute_symbol !1 ; Absolute symbol in range [0,2^64)
Peter Collingbourne	235c275	2016-12-08 19:01:00 +0000	[diff] [blame]	5047
				5048	...
				5049	!0 = !{ i64 0, i64 256 }
Peter Collingbourne	d88f928	2017-01-20 21:56:37 +0000	[diff] [blame]	5050	!1 = !{ i64 -1, i64 -1 }
Peter Collingbourne	235c275	2016-12-08 19:01:00 +0000	[diff] [blame]	5051
Matthew Simpson	36bbc8c	2017-10-16 22:22:11 +0000	[diff] [blame]	5052	'``callees``' Metadata
				5053	^^^^^^^^^^^^^^^^^^^^^^
				5054
				5055	``callees`` metadata may be attached to indirect call sites. If ``callees``
				5056	metadata is attached to a call site, and any callee is not among the set of
				5057	functions provided by the metadata, the behavior is undefined. The intent of
				5058	this metadata is to facilitate optimizations such as indirect-call promotion.
				5059	For example, in the code below, the call instruction may only target the
				5060	``add`` or ``sub`` functions:
				5061
				5062	.. code-block:: llvm
				5063
				5064	%result = call i64 %binop(i64 %x, i64 %y), !callees !0
				5065
				5066	...
				5067	!0 = !{i64 (i64, i64)* @add, i64 (i64, i64)* @sub}
				5068
Sanjay Patel	a99ab1f	2015-09-02 19:06:43 +0000	[diff] [blame]	5069	'``unpredictable``' Metadata
Sanjay Patel	1f12b34	2015-09-02 19:35:31 +0000	[diff] [blame]	5070	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Sanjay Patel	a99ab1f	2015-09-02 19:06:43 +0000	[diff] [blame]	5071
				5072	``unpredictable`` metadata may be attached to any branch or switch
				5073	instruction. It can be used to express the unpredictability of control
				5074	flow. Similar to the llvm.expect intrinsic, it may be used to alter
				5075	optimizations related to compare and branch instructions. The metadata
				5076	is treated as a boolean value; if it exists, it signals that the branch
				5077	or switch that it is attached to is completely unpredictable.
				5078
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5079	'``llvm.loop``'
				5080	^^^^^^^^^^^^^^^
				5081
				5082	It is sometimes useful to attach information to loop constructs. Currently,
				5083	loop metadata is implemented as metadata attached to the branch instruction
				5084	in the loop latch block. This type of metadata refer to a metadata node that is
Matt Arsenault	24b49c4	2013-07-31 17:49:08 +0000	[diff] [blame]	5085	guaranteed to be separate for each loop. The loop identifier metadata is
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5086	specified with the name ``llvm.loop``.
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5087
				5088	The loop identifier metadata is implemented using a metadata that refers to
Michael Liao	a769908	2013-03-06 18:24:34 +0000	[diff] [blame]	5089	itself to avoid merging it with any other identifier metadata, e.g.,
				5090	during module linkage or function inlining. That is, each loop should refer
				5091	to their own identification metadata even if they reside in separate functions.
				5092	The following example contains loop identifier metadata for two separate loop
Pekka Jaaskelainen	119a2b6	2013-02-22 12:03:07 +0000	[diff] [blame]	5093	constructs:
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5094
				5095	.. code-block:: llvm
Paul Redmond	eaaed3b	2013-02-21 17:20:45 +0000	[diff] [blame]	5096
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5097	!0 = !{!0}
				5098	!1 = !{!1}
Pekka Jaaskelainen	119a2b6	2013-02-22 12:03:07 +0000	[diff] [blame]	5099
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5100	The loop identifier metadata can be used to specify additional
				5101	per-loop metadata. Any operands after the first operand can be treated
				5102	as user-defined metadata. For example the ``llvm.loop.unroll.count``
				5103	suggests an unroll factor to the loop unroller:
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5104
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5105	.. code-block:: llvm
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5106
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5107	br i1 %exitcond, label %._crit_edge, label %.lr.ph, !llvm.loop !0
				5108	...
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5109	!0 = !{!0, !1}
				5110	!1 = !{!"llvm.loop.unroll.count", i32 4}
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5111
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5112	'``llvm.loop.vectorize``' and '``llvm.loop.interleave``'
				5113	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5114
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5115	Metadata prefixed with ``llvm.loop.vectorize`` or ``llvm.loop.interleave`` are
				5116	used to control per-loop vectorization and interleaving parameters such as
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	5117	vectorization width and interleave count. These metadata should be used in
				5118	conjunction with ``llvm.loop`` loop identification metadata. The
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5119	``llvm.loop.vectorize`` and ``llvm.loop.interleave`` metadata are only
				5120	optimization hints and the optimizer will only interleave and vectorize loops if
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	5121	it believes it is safe to do so. The ``llvm.mem.parallel_loop_access`` metadata
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5122	which contains information about loop-carried memory dependencies can be helpful
				5123	in determining the safety of these transformations.
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5124
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5125	'``llvm.loop.interleave.count``' Metadata
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5126	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5127
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5128	This metadata suggests an interleave count to the loop interleaver.
				5129	The first operand is the string ``llvm.loop.interleave.count`` and the
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5130	second operand is an integer specifying the interleave count. For
				5131	example:
				5132
				5133	.. code-block:: llvm
				5134
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5135	!0 = !{!"llvm.loop.interleave.count", i32 4}
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5136
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5137	Note that setting ``llvm.loop.interleave.count`` to 1 disables interleaving
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	5138	multiple iterations of the loop. If ``llvm.loop.interleave.count`` is set to 0
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5139	then the interleave count will be determined automatically.
				5140
				5141	'``llvm.loop.vectorize.enable``' Metadata
Dan Liew	9a1829d	2014-07-22 14:59:38 +0000	[diff] [blame]	5142	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5143
				5144	This metadata selectively enables or disables vectorization for the loop. The
				5145	first operand is the string ``llvm.loop.vectorize.enable`` and the second operand
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	5146	is a bit. If the bit operand value is 1 vectorization is enabled. A value of
Mark Heffernan	9d20e42	2014-07-21 23:11:03 +0000	[diff] [blame]	5147	0 disables vectorization:
				5148
				5149	.. code-block:: llvm
				5150
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5151	!0 = !{!"llvm.loop.vectorize.enable", i1 0}
				5152	!1 = !{!"llvm.loop.vectorize.enable", i1 1}
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5153
				5154	'``llvm.loop.vectorize.width``' Metadata
				5155	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5156
				5157	This metadata sets the target width of the vectorizer. The first
				5158	operand is the string ``llvm.loop.vectorize.width`` and the second
				5159	operand is an integer specifying the width. For example:
				5160
				5161	.. code-block:: llvm
				5162
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5163	!0 = !{!"llvm.loop.vectorize.width", i32 4}
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5164
				5165	Note that setting ``llvm.loop.vectorize.width`` to 1 disables
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	5166	vectorization of the loop. If ``llvm.loop.vectorize.width`` is set to
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5167	0 or if the loop does not have this metadata the width will be
				5168	determined automatically.
				5169
				5170	'``llvm.loop.unroll``'
				5171	^^^^^^^^^^^^^^^^^^^^^^
				5172
				5173	Metadata prefixed with ``llvm.loop.unroll`` are loop unrolling
				5174	optimization hints such as the unroll factor. ``llvm.loop.unroll``
				5175	metadata should be used in conjunction with ``llvm.loop`` loop
				5176	identification metadata. The ``llvm.loop.unroll`` metadata are only
				5177	optimization hints and the unrolling will only be performed if the
				5178	optimizer believes it is safe to do so.
				5179
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5180	'``llvm.loop.unroll.count``' Metadata
				5181	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5182
				5183	This metadata suggests an unroll factor to the loop unroller. The
				5184	first operand is the string ``llvm.loop.unroll.count`` and the second
				5185	operand is a positive integer specifying the unroll factor. For
				5186	example:
				5187
				5188	.. code-block:: llvm
				5189
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5190	!0 = !{!"llvm.loop.unroll.count", i32 4}
Mark Heffernan	893752a	2014-07-18 19:24:51 +0000	[diff] [blame]	5191
				5192	If the trip count of the loop is less than the unroll count the loop
				5193	will be partially unrolled.
				5194
Mark Heffernan	e6b4ba1	2014-07-23 17:31:37 +0000	[diff] [blame]	5195	'``llvm.loop.unroll.disable``' Metadata
				5196	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5197
Mark Heffernan	3e32a4e	2015-06-30 22:48:51 +0000	[diff] [blame]	5198	This metadata disables loop unrolling. The metadata has a single operand
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	5199	which is the string ``llvm.loop.unroll.disable``. For example:
Mark Heffernan	e6b4ba1	2014-07-23 17:31:37 +0000	[diff] [blame]	5200
				5201	.. code-block:: llvm
				5202
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5203	!0 = !{!"llvm.loop.unroll.disable"}
Mark Heffernan	e6b4ba1	2014-07-23 17:31:37 +0000	[diff] [blame]	5204
Kevin Qin	715b01e	2015-03-09 06:14:18 +0000	[diff] [blame]	5205	'``llvm.loop.unroll.runtime.disable``' Metadata
Dan Liew	868b074	2015-03-11 13:34:49 +0000	[diff] [blame]	5206	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Kevin Qin	715b01e	2015-03-09 06:14:18 +0000	[diff] [blame]	5207
Mark Heffernan	3e32a4e	2015-06-30 22:48:51 +0000	[diff] [blame]	5208	This metadata disables runtime loop unrolling. The metadata has a single
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	5209	operand which is the string ``llvm.loop.unroll.runtime.disable``. For example:
Kevin Qin	715b01e	2015-03-09 06:14:18 +0000	[diff] [blame]	5210
				5211	.. code-block:: llvm
				5212
				5213	!0 = !{!"llvm.loop.unroll.runtime.disable"}
				5214
Mark Heffernan	8939154	2015-08-10 17:28:08 +0000	[diff] [blame]	5215	'``llvm.loop.unroll.enable``' Metadata
				5216	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5217
				5218	This metadata suggests that the loop should be fully unrolled if the trip count
				5219	is known at compile time and partially unrolled if the trip count is not known
				5220	at compile time. The metadata has a single operand which is the string
				5221	``llvm.loop.unroll.enable``. For example:
				5222
				5223	.. code-block:: llvm
				5224
				5225	!0 = !{!"llvm.loop.unroll.enable"}
				5226
Mark Heffernan	e6b4ba1	2014-07-23 17:31:37 +0000	[diff] [blame]	5227	'``llvm.loop.unroll.full``' Metadata
				5228	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5229
Mark Heffernan	3e32a4e	2015-06-30 22:48:51 +0000	[diff] [blame]	5230	This metadata suggests that the loop should be unrolled fully. The
				5231	metadata has a single operand which is the string ``llvm.loop.unroll.full``.
Mark Heffernan	e6b4ba1	2014-07-23 17:31:37 +0000	[diff] [blame]	5232	For example:
				5233
				5234	.. code-block:: llvm
				5235
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5236	!0 = !{!"llvm.loop.unroll.full"}
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5237
David Green	7fbf06c	2018-07-19 12:37:00 +0000	[diff] [blame]	5238	'``llvm.loop.unroll_and_jam``'
				5239	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5240
				5241	This metadata is treated very similarly to the ``llvm.loop.unroll`` metadata
				5242	above, but affect the unroll and jam pass. In addition any loop with
				5243	``llvm.loop.unroll`` metadata but no ``llvm.loop.unroll_and_jam`` metadata will
				5244	disable unroll and jam (so ``llvm.loop.unroll`` metadata will be left to the
				5245	unroller, plus ``llvm.loop.unroll.disable`` metadata will disable unroll and jam
				5246	too.)
				5247
				5248	The metadata for unroll and jam otherwise is the same as for ``unroll``.
				5249	``llvm.loop.unroll_and_jam.enable``, ``llvm.loop.unroll_and_jam.disable`` and
				5250	``llvm.loop.unroll_and_jam.count`` do the same as for unroll.
				5251	``llvm.loop.unroll_and_jam.full`` is not supported. Again these are only hints
				5252	and the normal safety checks will still be performed.
				5253
				5254	'``llvm.loop.unroll_and_jam.count``' Metadata
				5255	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5256
				5257	This metadata suggests an unroll and jam factor to use, similarly to
				5258	``llvm.loop.unroll.count``. The first operand is the string
				5259	``llvm.loop.unroll_and_jam.count`` and the second operand is a positive integer
				5260	specifying the unroll factor. For example:
				5261
				5262	.. code-block:: llvm
				5263
				5264	!0 = !{!"llvm.loop.unroll_and_jam.count", i32 4}
				5265
				5266	If the trip count of the loop is less than the unroll count the loop
				5267	will be partially unroll and jammed.
				5268
				5269	'``llvm.loop.unroll_and_jam.disable``' Metadata
				5270	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5271
				5272	This metadata disables loop unroll and jamming. The metadata has a single
				5273	operand which is the string ``llvm.loop.unroll_and_jam.disable``. For example:
				5274
				5275	.. code-block:: llvm
				5276
				5277	!0 = !{!"llvm.loop.unroll_and_jam.disable"}
				5278
				5279	'``llvm.loop.unroll_and_jam.enable``' Metadata
				5280	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5281
				5282	This metadata suggests that the loop should be fully unroll and jammed if the
				5283	trip count is known at compile time and partially unrolled if the trip count is
				5284	not known at compile time. The metadata has a single operand which is the
				5285	string ``llvm.loop.unroll_and_jam.enable``. For example:
				5286
				5287	.. code-block:: llvm
				5288
				5289	!0 = !{!"llvm.loop.unroll_and_jam.enable"}
				5290
Ashutosh Nema	df6763a	2016-02-06 07:47:48 +0000	[diff] [blame]	5291	'``llvm.loop.licm_versioning.disable``' Metadata
Ashutosh Nema	5f0e472	2016-02-06 09:24:37 +0000	[diff] [blame]	5292	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Ashutosh Nema	df6763a	2016-02-06 07:47:48 +0000	[diff] [blame]	5293
				5294	This metadata indicates that the loop should not be versioned for the purpose
				5295	of enabling loop-invariant code motion (LICM). The metadata has a single operand
				5296	which is the string ``llvm.loop.licm_versioning.disable``. For example:
				5297
				5298	.. code-block:: llvm
				5299
				5300	!0 = !{!"llvm.loop.licm_versioning.disable"}
				5301
Adam Nemet	d2fa414	2016-04-27 05:28:18 +0000	[diff] [blame]	5302	'``llvm.loop.distribute.enable``' Metadata
Adam Nemet	55dc0af	2016-04-27 05:59:51 +0000	[diff] [blame]	5303	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Adam Nemet	d2fa414	2016-04-27 05:28:18 +0000	[diff] [blame]	5304
				5305	Loop distribution allows splitting a loop into multiple loops. Currently,
				5306	this is only performed if the entire loop cannot be vectorized due to unsafe
Hiroshi Inoue	b93daec	2017-07-02 12:44:27 +0000	[diff] [blame]	5307	memory dependencies. The transformation will attempt to isolate the unsafe
Adam Nemet	d2fa414	2016-04-27 05:28:18 +0000	[diff] [blame]	5308	dependencies into their own loop.
				5309
				5310	This metadata can be used to selectively enable or disable distribution of the
				5311	loop. The first operand is the string ``llvm.loop.distribute.enable`` and the
				5312	second operand is a bit. If the bit operand value is 1 distribution is
				5313	enabled. A value of 0 disables distribution:
				5314
				5315	.. code-block:: llvm
				5316
				5317	!0 = !{!"llvm.loop.distribute.enable", i1 0}
				5318	!1 = !{!"llvm.loop.distribute.enable", i1 1}
				5319
				5320	This metadata should be used in conjunction with ``llvm.loop`` loop
				5321	identification metadata.
				5322
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5323	'``llvm.mem``'
				5324	^^^^^^^^^^^^^^^
				5325
				5326	Metadata types used to annotate memory accesses with information helpful
				5327	for optimizations are prefixed with ``llvm.mem``.
				5328
				5329	'``llvm.mem.parallel_loop_access``' Metadata
				5330	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5331
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	5332	The ``llvm.mem.parallel_loop_access`` metadata refers to a loop identifier,
				5333	or metadata containing a list of loop identifiers for nested loops.
				5334	The metadata is attached to memory accessing instructions and denotes that
				5335	no loop carried memory dependence exist between it and other instructions denoted
Hal Finkel	411d31a	2016-04-26 02:00:36 +0000	[diff] [blame]	5336	with the same loop identifier. The metadata on memory reads also implies that
				5337	if conversion (i.e. speculative execution within a loop iteration) is safe.
Pekka Jaaskelainen	23b222cc	2014-05-23 11:35:46 +0000	[diff] [blame]	5338
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	5339	Precisely, given two instructions ``m1`` and ``m2`` that both have the
				5340	``llvm.mem.parallel_loop_access`` metadata, with ``L1`` and ``L2`` being the
				5341	set of loops associated with that metadata, respectively, then there is no loop
				5342	carried dependence between ``m1`` and ``m2`` for loops in both ``L1`` and
Pekka Jaaskelainen	23b222cc	2014-05-23 11:35:46 +0000	[diff] [blame]	5343	``L2``.
				5344
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	5345	As a special case, if all memory accessing instructions in a loop have
				5346	``llvm.mem.parallel_loop_access`` metadata that refers to that loop, then the
				5347	loop has no loop carried memory dependences and is considered to be a parallel
				5348	loop.
Pekka Jaaskelainen	23b222cc	2014-05-23 11:35:46 +0000	[diff] [blame]	5349
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	5350	Note that if not all memory access instructions have such metadata referring to
				5351	the loop, then the loop is considered not being trivially parallel. Additional
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	5352	memory dependence analysis is required to make that determination. As a fail
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	5353	safe mechanism, this causes loops that were originally parallel to be considered
				5354	sequential (if optimization passes that are unaware of the parallel semantics
Pekka Jaaskelainen	23b222cc	2014-05-23 11:35:46 +0000	[diff] [blame]	5355	insert new memory instructions into the loop body).
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5356
				5357	Example of a loop that is considered parallel due to its correct use of
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5358	both ``llvm.loop`` and ``llvm.mem.parallel_loop_access``
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5359	metadata types that refer to the same loop identifier metadata.
				5360
				5361	.. code-block:: llvm
				5362
				5363	for.body:
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5364	...
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	5365	%val0 = load i32, i32* %arrayidx, !llvm.mem.parallel_loop_access !0
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5366	...
Tobias Grosser	fbe95dc	2014-03-05 13:36:04 +0000	[diff] [blame]	5367	store i32 %val0, i32* %arrayidx1, !llvm.mem.parallel_loop_access !0
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5368	...
				5369	br i1 %exitcond, label %for.end, label %for.body, !llvm.loop !0
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5370
				5371	for.end:
				5372	...
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5373	!0 = !{!0}
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5374
				5375	It is also possible to have nested parallel loops. In that case the
				5376	memory accesses refer to a list of loop identifier metadata nodes instead of
				5377	the loop identifier metadata node directly:
				5378
				5379	.. code-block:: llvm
				5380
				5381	outer.for.body:
Tobias Grosser	fbe95dc	2014-03-05 13:36:04 +0000	[diff] [blame]	5382	...
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	5383	%val1 = load i32, i32* %arrayidx3, !llvm.mem.parallel_loop_access !2
Tobias Grosser	fbe95dc	2014-03-05 13:36:04 +0000	[diff] [blame]	5384	...
				5385	br label %inner.for.body
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5386
				5387	inner.for.body:
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5388	...
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	5389	%val0 = load i32, i32* %arrayidx1, !llvm.mem.parallel_loop_access !0
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5390	...
Tobias Grosser	fbe95dc	2014-03-05 13:36:04 +0000	[diff] [blame]	5391	store i32 %val0, i32* %arrayidx2, !llvm.mem.parallel_loop_access !0
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5392	...
				5393	br i1 %exitcond, label %inner.for.end, label %inner.for.body, !llvm.loop !1
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5394
				5395	inner.for.end:
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5396	...
Tobias Grosser	fbe95dc	2014-03-05 13:36:04 +0000	[diff] [blame]	5397	store i32 %val1, i32* %arrayidx4, !llvm.mem.parallel_loop_access !2
Paul Redmond	5fdf836	2013-05-28 20:00:34 +0000	[diff] [blame]	5398	...
				5399	br i1 %exitcond, label %outer.for.end, label %outer.for.body, !llvm.loop !2
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5400
				5401	outer.for.end: ; preds = %for.body
				5402	...
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5403	!0 = !{!1, !2} ; a list of loop identifiers
				5404	!1 = !{!1} ; an identifier for the inner loop
				5405	!2 = !{!2} ; an identifier for the outer loop
Pekka Jaaskelainen	0d23725	2013-02-13 18:08:57 +0000	[diff] [blame]	5406
Hiroshi Yamauchi	dce9def	2017-11-02 22:26:51 +0000	[diff] [blame]	5407	'``irr_loop``' Metadata
				5408	^^^^^^^^^^^^^^^^^^^^^^^
				5409
				5410	``irr_loop`` metadata may be attached to the terminator instruction of a basic
				5411	block that's an irreducible loop header (note that an irreducible loop has more
				5412	than once header basic blocks.) If ``irr_loop`` metadata is attached to the
				5413	terminator instruction of a basic block that is not really an irreducible loop
				5414	header, the behavior is undefined. The intent of this metadata is to improve the
				5415	accuracy of the block frequency propagation. For example, in the code below, the
				5416	block ``header0`` may have a loop header weight (relative to the other headers of
				5417	the irreducible loop) of 100:
				5418
				5419	.. code-block:: llvm
				5420
				5421	header0:
				5422	...
				5423	br i1 %cmp, label %t1, label %t2, !irr_loop !0
				5424
				5425	...
				5426	!0 = !{"loop_header_weight", i64 100}
				5427
				5428	Irreducible loop header weights are typically based on profile data.
				5429
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5430	'``invariant.group``' Metadata
				5431	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				5432
Piotr Padlewski	74b155f	2018-04-08 13:53:04 +0000	[diff] [blame]	5433	The experimental ``invariant.group`` metadata may be attached to
Piotr Padlewski	ce35826	2018-05-18 23:53:46 +0000	[diff] [blame]	5434	``load``/``store`` instructions referencing a single metadata with no entries.
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	5435	The existence of the ``invariant.group`` metadata on the instruction tells
				5436	the optimizer that every ``load`` and ``store`` to the same pointer operand
Piotr Padlewski	ce35826	2018-05-18 23:53:46 +0000	[diff] [blame]	5437	can be assumed to load or store the same
Piotr Padlewski	5dde809	2018-05-03 11:03:01 +0000	[diff] [blame]	5438	value (but see the ``llvm.launder.invariant.group`` intrinsic which affects
Piotr Padlewski	da36215	2016-12-30 18:45:07 +0000	[diff] [blame]	5439	when two pointers are considered the same). Pointers returned by bitcast or
				5440	getelementptr with only zero indices are considered the same.
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5441
				5442	Examples:
				5443
				5444	.. code-block:: llvm
				5445
				5446	@unknownPtr = external global i8
				5447	...
				5448	%ptr = alloca i8
				5449	store i8 42, i8* %ptr, !invariant.group !0
				5450	call void @foo(i8* %ptr)
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	5451
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5452	%a = load i8, i8* %ptr, !invariant.group !0 ; Can assume that value under %ptr didn't change
				5453	call void @foo(i8* %ptr)
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	5454
				5455	%newPtr = call i8* @getPointer(i8* %ptr)
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5456	%c = load i8, i8* %newPtr, !invariant.group !0 ; Can't assume anything, because we only have information about %ptr
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	5457
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5458	%unknownValue = load i8, i8* @unknownPtr
				5459	store i8 %unknownValue, i8* %ptr, !invariant.group !0 ; Can assume that %unknownValue == 42
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	5460
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5461	call void @foo(i8* %ptr)
Piotr Padlewski	5dde809	2018-05-03 11:03:01 +0000	[diff] [blame]	5462	%newPtr2 = call i8* @llvm.launder.invariant.group(i8* %ptr)
				5463	%d = load i8, i8* %newPtr2, !invariant.group !0 ; Can't step through launder.invariant.group to get value of %ptr
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	5464
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5465	...
				5466	declare void @foo(i8*)
				5467	declare i8* @getPointer(i8*)
Piotr Padlewski	5dde809	2018-05-03 11:03:01 +0000	[diff] [blame]	5468	declare i8* @llvm.launder.invariant.group(i8*)
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	5469
Piotr Padlewski	ce35826	2018-05-18 23:53:46 +0000	[diff] [blame]	5470	!0 = !{}
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5471
Piotr Padlewski	f8486e3	2017-04-12 07:59:35 +0000	[diff] [blame]	5472	The invariant.group metadata must be dropped when replacing one pointer by
				5473	another based on aliasing information. This is because invariant.group is tied
				5474	to the SSA value of the pointer operand.
				5475
				5476	.. code-block:: llvm
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	5477
Piotr Padlewski	f8486e3	2017-04-12 07:59:35 +0000	[diff] [blame]	5478	%v = load i8, i8* %x, !invariant.group !0
				5479	; if %x mustalias %y then we can replace the above instruction with
				5480	%v = load i8, i8* %y
				5481
Piotr Padlewski	74b155f	2018-04-08 13:53:04 +0000	[diff] [blame]	5482	Note that this is an experimental feature, which means that its semantics might
				5483	change in the future.
Piotr Padlewski	f8486e3	2017-04-12 07:59:35 +0000	[diff] [blame]	5484
Peter Collingbourne	a333db8	2016-07-26 22:31:30 +0000	[diff] [blame]	5485	'``type``' Metadata
				5486	^^^^^^^^^^^^^^^^^^^
				5487
				5488	See :doc:`TypeMetadata`.
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5489
Evgeniy Stepanov	51c962f72	2017-03-17 22:17:24 +0000	[diff] [blame]	5490	'``associated``' Metadata
Evgeniy Stepanov	4d490de	2017-03-17 22:31:13 +0000	[diff] [blame]	5491	^^^^^^^^^^^^^^^^^^^^^^^^^
Evgeniy Stepanov	51c962f72	2017-03-17 22:17:24 +0000	[diff] [blame]	5492
				5493	The ``associated`` metadata may be attached to a global object
				5494	declaration with a single argument that references another global object.
				5495
				5496	This metadata prevents discarding of the global object in linker GC
				5497	unless the referenced object is also discarded. The linker support for
				5498	this feature is spotty. For best compatibility, globals carrying this
				5499	metadata may also:
				5500
				5501	- Be in a comdat with the referenced global.
				5502	- Be in @llvm.compiler.used.
				5503	- Have an explicit section with a name which is a valid C identifier.
				5504
				5505	It does not have any effect on non-ELF targets.
				5506
				5507	Example:
				5508
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	5509	.. code-block:: text
Evgeniy Stepanov	4d490de	2017-03-17 22:31:13 +0000	[diff] [blame]	5510
Evgeniy Stepanov	51c962f72	2017-03-17 22:17:24 +0000	[diff] [blame]	5511	$a = comdat any
				5512	@a = global i32 1, comdat $a
				5513	@b = internal global i32 2, comdat $a, section "abc", !associated !0
				5514	!0 = !{i32* @a}
				5515
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	5516
Teresa Johnson	d72f51c	2017-06-15 15:57:12 +0000	[diff] [blame]	5517	'``prof``' Metadata
				5518	^^^^^^^^^^^^^^^^^^^
				5519
				5520	The ``prof`` metadata is used to record profile data in the IR.
				5521	The first operand of the metadata node indicates the profile metadata
				5522	type. There are currently 3 types:
				5523	:ref:`branch_weights<prof_node_branch_weights>`,
				5524	:ref:`function_entry_count<prof_node_function_entry_count>`, and
				5525	:ref:`VP<prof_node_VP>`.
				5526
				5527	.. _prof_node_branch_weights:
				5528
				5529	branch_weights
				5530	""""""""""""""
				5531
				5532	Branch weight metadata attached to a branch, select, switch or call instruction
				5533	represents the likeliness of the associated branch being taken.
				5534	For more information, see :doc:`BranchWeightMetadata`.
				5535
				5536	.. _prof_node_function_entry_count:
				5537
				5538	function_entry_count
				5539	""""""""""""""""""""
				5540
				5541	Function entry count metadata can be attached to function definitions
				5542	to record the number of times the function is called. Used with BFI
				5543	information, it is also used to derive the basic block profile count.
				5544	For more information, see :doc:`BranchWeightMetadata`.
				5545
				5546	.. _prof_node_VP:
				5547
				5548	VP
				5549	""
				5550
				5551	VP (value profile) metadata can be attached to instructions that have
				5552	value profile information. Currently this is indirect calls (where it
				5553	records the hottest callees) and calls to memory intrinsics such as memcpy,
				5554	memmove, and memset (where it records the hottest byte lengths).
				5555
				5556	Each VP metadata node contains "VP" string, then a uint32_t value for the value
				5557	profiling kind, a uint64_t value for the total number of times the instruction
				5558	is executed, followed by uint64_t value and execution count pairs.
				5559	The value profiling kind is 0 for indirect call targets and 1 for memory
				5560	operations. For indirect call targets, each profile value is a hash
				5561	of the callee function name, and for memory operations each value is the
				5562	byte length.
				5563
				5564	Note that the value counts do not need to add up to the total count
				5565	listed in the third operand (in practice only the top hottest values
				5566	are tracked and reported).
				5567
				5568	Indirect call example:
				5569
				5570	.. code-block:: llvm
				5571
				5572	call void %f(), !prof !1
				5573	!1 = !{!"VP", i32 0, i64 1600, i64 7651369219802541373, i64 1030, i64 -4377547752858689819, i64 410}
				5574
				5575	Note that the VP type is 0 (the second operand), which indicates this is
				5576	an indirect call value profile data. The third operand indicates that the
				5577	indirect call executed 1600 times. The 4th and 6th operands give the
				5578	hashes of the 2 hottest target functions' names (this is the same hash used
				5579	to represent function names in the profile database), and the 5th and 7th
				5580	operands give the execution count that each of the respective prior target
				5581	functions was called.
				5582
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5583	Module Flags Metadata
				5584	=====================
				5585
				5586	Information about the module as a whole is difficult to convey to LLVM's
				5587	subsystems. The LLVM IR isn't sufficient to transmit this information.
				5588	The ``llvm.module.flags`` named metadata exists in order to facilitate
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	5589	this. These flags are in the form of key / value pairs --- much like a
				5590	dictionary --- making it easy for any subsystem who cares about a flag to
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5591	look it up.
				5592
				5593	The ``llvm.module.flags`` metadata contains a list of metadata triplets.
				5594	Each triplet has the following form:
				5595
				5596	- The first element is a behavior flag, which specifies the behavior
				5597	when two (or more) modules are merged together, and it encounters two
				5598	(or more) metadata with the same ID. The supported behaviors are
				5599	described below.
				5600	- The second element is a metadata string that is a unique ID for the
Daniel Dunbar	25c4b57	2013-01-15 01:22:53 +0000	[diff] [blame]	5601	metadata. Each module may only have one flag entry for each unique ID (not
				5602	including entries with the Require behavior).
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5603	- The third element is the value of the flag.
				5604
				5605	When two (or more) modules are merged together, the resulting
Daniel Dunbar	25c4b57	2013-01-15 01:22:53 +0000	[diff] [blame]	5606	``llvm.module.flags`` metadata is the union of the modules' flags. That is, for
				5607	each unique metadata ID string, there will be exactly one entry in the merged
				5608	modules ``llvm.module.flags`` metadata table, and the value for that entry will
				5609	be determined by the merge behavior flag, as described below. The only exception
				5610	is that entries with the Require behavior are always preserved.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5611
				5612	The following behaviors are supported:
				5613
				5614	.. list-table::
				5615	:header-rows: 1
				5616	:widths: 10 90
				5617
				5618	* - Value
				5619	- Behavior
				5620
				5621	* - 1
				5622	- Error
Daniel Dunbar	25c4b57	2013-01-15 01:22:53 +0000	[diff] [blame]	5623	Emits an error if two values disagree, otherwise the resulting value
				5624	is that of the operands.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5625
				5626	* - 2
				5627	- Warning
Daniel Dunbar	25c4b57	2013-01-15 01:22:53 +0000	[diff] [blame]	5628	Emits a warning if two values disagree. The result value will be the
				5629	operand for the flag from the first module being linked.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5630
				5631	* - 3
				5632	- Require
Daniel Dunbar	25c4b57	2013-01-15 01:22:53 +0000	[diff] [blame]	5633	Adds a requirement that another module flag be present and have a
				5634	specified value after linking is performed. The value must be a
				5635	metadata pair, where the first element of the pair is the ID of the
				5636	module flag to be restricted, and the second element of the pair is
				5637	the value the module flag should be restricted to. This behavior can
				5638	be used to restrict the allowable results (via triggering of an
				5639	error) of linking IDs with the Override behavior.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5640
				5641	* - 4
				5642	- Override
Daniel Dunbar	25c4b57	2013-01-15 01:22:53 +0000	[diff] [blame]	5643	Uses the specified value, regardless of the behavior or value of the
				5644	other module. If both modules specify Override, but the values
				5645	differ, an error will be emitted.
				5646
Daniel Dunbar	d77d9fb	2013-01-16 21:38:56 +0000	[diff] [blame]	5647	* - 5
				5648	- Append
				5649	Appends the two values, which are required to be metadata nodes.
				5650
				5651	* - 6
				5652	- AppendUnique
				5653	Appends the two values, which are required to be metadata
				5654	nodes. However, duplicate entries in the second list are dropped
				5655	during the append operation.
				5656
Steven Wu	86a511e	2017-08-15 16:16:33 +0000	[diff] [blame]	5657	* - 7
				5658	- Max
				5659	Takes the max of the two values, which are required to be integers.
				5660
Daniel Dunbar	25c4b57	2013-01-15 01:22:53 +0000	[diff] [blame]	5661	It is an error for a particular unique flag ID to have multiple behaviors,
				5662	except in the case of Require (which adds restrictions on another metadata
				5663	value) or Override.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5664
				5665	An example of module flags:
				5666
				5667	.. code-block:: llvm
				5668
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5669	!0 = !{ i32 1, !"foo", i32 1 }
				5670	!1 = !{ i32 4, !"bar", i32 37 }
				5671	!2 = !{ i32 2, !"qux", i32 42 }
				5672	!3 = !{ i32 3, !"qux",
				5673	!{
				5674	!"foo", i32 1
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5675	}
				5676	}
				5677	!llvm.module.flags = !{ !0, !1, !2, !3 }
				5678
				5679	- Metadata ``!0`` has the ID ``!"foo"`` and the value '1'. The behavior
				5680	if two or more ``!"foo"`` flags are seen is to emit an error if their
				5681	values are not equal.
				5682
				5683	- Metadata ``!1`` has the ID ``!"bar"`` and the value '37'. The
				5684	behavior if two or more ``!"bar"`` flags are seen is to use the value
Daniel Dunbar	25c4b57	2013-01-15 01:22:53 +0000	[diff] [blame]	5685	'37'.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5686
				5687	- Metadata ``!2`` has the ID ``!"qux"`` and the value '42'. The
				5688	behavior if two or more ``!"qux"`` flags are seen is to emit a
				5689	warning if their values are not equal.
				5690
				5691	- Metadata ``!3`` has the ID ``!"qux"`` and the value:
				5692
				5693	::
				5694
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5695	!{ !"foo", i32 1 }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5696
Daniel Dunbar	25c4b57	2013-01-15 01:22:53 +0000	[diff] [blame]	5697	The behavior is to emit an error if the ``llvm.module.flags`` does not
				5698	contain a flag with the ID ``!"foo"`` that has the value '1' after linking is
				5699	performed.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5700
				5701	Objective-C Garbage Collection Module Flags Metadata
				5702	----------------------------------------------------
				5703
				5704	On the Mach-O platform, Objective-C stores metadata about garbage
				5705	collection in a special section called "image info". The metadata
				5706	consists of a version number and a bitmask specifying what types of
				5707	garbage collection are supported (if any) by the file. If two or more
				5708	modules are linked together their garbage collection metadata needs to
				5709	be merged rather than appended together.
				5710
				5711	The Objective-C garbage collection module flags metadata consists of the
				5712	following key-value pairs:
				5713
				5714	.. list-table::
				5715	:header-rows: 1
				5716	:widths: 30 70
				5717
				5718	* - Key
				5719	- Value
				5720
Daniel Dunbar	1dc66ca	2013-01-17 18:57:32 +0000	[diff] [blame]	5721	* - ``Objective-C Version``
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	5722	- [Required] --- The Objective-C ABI version. Valid values are 1 and 2.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5723
Daniel Dunbar	1dc66ca	2013-01-17 18:57:32 +0000	[diff] [blame]	5724	* - ``Objective-C Image Info Version``
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	5725	- [Required] --- The version of the image info section. Currently
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5726	always 0.
				5727
Daniel Dunbar	1dc66ca	2013-01-17 18:57:32 +0000	[diff] [blame]	5728	* - ``Objective-C Image Info Section``
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	5729	- [Required] --- The section to place the metadata. Valid values are
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5730	``"__OBJC, __image_info, regular"`` for Objective-C ABI version 1, and
				5731	``"__DATA,__objc_imageinfo, regular, no_dead_strip"`` for
				5732	Objective-C ABI version 2.
				5733
Daniel Dunbar	1dc66ca	2013-01-17 18:57:32 +0000	[diff] [blame]	5734	* - ``Objective-C Garbage Collection``
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	5735	- [Required] --- Specifies whether garbage collection is supported or
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5736	not. Valid values are 0, for no garbage collection, and 2, for garbage
				5737	collection supported.
				5738
Daniel Dunbar	1dc66ca	2013-01-17 18:57:32 +0000	[diff] [blame]	5739	* - ``Objective-C GC Only``
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	5740	- [Optional] --- Specifies that only garbage collection is supported.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	5741	If present, its value must be 6. This flag requires that the
				5742	``Objective-C Garbage Collection`` flag have the value 2.
				5743
				5744	Some important flag interactions:
				5745
				5746	- If a module with ``Objective-C Garbage Collection`` set to 0 is
				5747	merged with a module with ``Objective-C Garbage Collection`` set to
				5748	2, then the resulting module has the
				5749	``Objective-C Garbage Collection`` flag set to 0.
				5750	- A module with ``Objective-C Garbage Collection`` set to 0 cannot be
				5751	merged with a module with ``Objective-C GC Only`` set to 6.
				5752
Oliver Stannard	5dc2934	2014-06-20 10:08:11 +0000	[diff] [blame]	5753	C type width Module Flags Metadata
				5754	----------------------------------
				5755
				5756	The ARM backend emits a section into each generated object file describing the
				5757	options that it was compiled with (in a compiler-independent way) to prevent
				5758	linking incompatible objects, and to allow automatic library selection. Some
				5759	of these options are not visible at the IR level, namely wchar_t width and enum
				5760	width.
				5761
				5762	To pass this information to the backend, these options are encoded in module
				5763	flags metadata, using the following key-value pairs:
				5764
				5765	.. list-table::
				5766	:header-rows: 1
				5767	:widths: 30 70
				5768
				5769	* - Key
				5770	- Value
				5771
				5772	* - short_wchar
				5773	- * 0 --- sizeof(wchar_t) == 4
				5774	* 1 --- sizeof(wchar_t) == 2
				5775
				5776	* - short_enum
				5777	- * 0 --- Enums are at least as large as an ``int``.
				5778	* 1 --- Enums are stored in the smallest integer type which can
				5779	represent all of its values.
				5780
				5781	For example, the following metadata section specifies that the module was
				5782	compiled with a ``wchar_t`` width of 4 bytes, and the underlying type of an
				5783	enum is the smallest type which can represent all of its values::
				5784
				5785	!llvm.module.flags = !{!0, !1}
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	5786	!0 = !{i32 1, !"short_wchar", i32 1}
				5787	!1 = !{i32 1, !"short_enum", i32 0}
Oliver Stannard	5dc2934	2014-06-20 10:08:11 +0000	[diff] [blame]	5788
Peter Collingbourne	89061b2	2017-06-12 20:10:48 +0000	[diff] [blame]	5789	Automatic Linker Flags Named Metadata
				5790	=====================================
				5791
				5792	Some targets support embedding flags to the linker inside individual object
				5793	files. Typically this is used in conjunction with language extensions which
				5794	allow source files to explicitly declare the libraries they depend on, and have
				5795	these automatically be transmitted to the linker via object files.
				5796
				5797	These flags are encoded in the IR using named metadata with the name
				5798	``!llvm.linker.options``. Each operand is expected to be a metadata node
				5799	which should be a list of other metadata nodes, each of which should be a
				5800	list of metadata strings defining linker options.
				5801
				5802	For example, the following metadata section specifies two separate sets of
				5803	linker options, presumably to link against ``libz`` and the ``Cocoa``
				5804	framework::
				5805
				5806	!0 = !{ !"-lz" },
				5807	!1 = !{ !"-framework", !"Cocoa" } } }
				5808	!llvm.linker.options = !{ !0, !1 }
				5809
				5810	The metadata encoding as lists of lists of options, as opposed to a collapsed
				5811	list of options, is chosen so that the IR encoding can use multiple option
				5812	strings to specify e.g., a single library, while still having that specifier be
				5813	preserved as an atomic element that can be recognized by a target specific
				5814	assembly writer or object file emitter.
				5815
				5816	Each individual option is required to be either a valid option for the target's
				5817	linker, or an option that is reserved by the target specific assembly writer or
				5818	object file emitter. No other aspect of these options is defined by the IR.
				5819
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5820	.. _summary:
				5821
				5822	ThinLTO Summary
				5823	===============
				5824
				5825	Compiling with `ThinLTO <https://clang.llvm.org/docs/ThinLTO.html>`_
				5826	causes the building of a compact summary of the module that is emitted into
				5827	the bitcode. The summary is emitted into the LLVM assembly and identified
				5828	in syntax by a caret ('``^``').
				5829
				5830	*Note that temporarily the summary entries are skipped when parsing the
				5831	assembly, although the parsing support is actively being implemented. The
				5832	following describes when the summary entries will be parsed once implemented.*
				5833	The summary will be parsed into a ModuleSummaryIndex object under the
				5834	same conditions where summary index is currently built from bitcode.
				5835	Specifically, tools that test the Thin Link portion of a ThinLTO compile
				5836	(i.e. llvm-lto and llvm-lto2), or when parsing a combined index
				5837	for a distributed ThinLTO backend via clang's "``-fthinlto-index=<>``" flag.
				5838	Additionally, it will be parsed into a bitcode output, along with the Module
				5839	IR, via the "``llvm-as``" tool. Tools that parse the Module IR for the purposes
				5840	of optimization (e.g. "``clang -x ir``" and "``opt``"), will ignore the
				5841	summary entries (just as they currently ignore summary entries in a bitcode
				5842	input file).
				5843
				5844	There are currently 3 types of summary entries in the LLVM assembly:
				5845	:ref:`module paths<module_path_summary>`,
				5846	:ref:`global values<gv_summary>`, and
				5847	:ref:`type identifiers<typeid_summary>`.
				5848
				5849	.. _module_path_summary:
				5850
				5851	Module Path Summary Entry
				5852	-------------------------
				5853
				5854	Each module path summary entry lists a module containing global values included
				5855	in the summary. For a single IR module there will be one such entry, but
				5856	in a combined summary index produced during the thin link, there will be
				5857	one module path entry per linked module with summary.
				5858
				5859	Example:
				5860
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	5861	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5862
				5863	^0 = module: (path: "/path/to/file.o", hash: (2468601609, 1329373163, 1565878005, 638838075, 3148790418))
				5864
				5865	The ``path`` field is a string path to the bitcode file, and the ``hash``
				5866	field is the 160-bit SHA-1 hash of the IR bitcode contents, used for
				5867	incremental builds and caching.
				5868
				5869	.. _gv_summary:
				5870
				5871	Global Value Summary Entry
				5872	--------------------------
				5873
				5874	Each global value summary entry corresponds to a global value defined or
				5875	referenced by a summarized module.
				5876
				5877	Example:
				5878
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	5879	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5880
				5881	^4 = gv: (name: "f"[, summaries: (Summary)[, (Summary)]*]?) ; guid = 14740650423002898831
				5882
				5883	For declarations, there will not be a summary list. For definitions, a
				5884	global value will contain a list of summaries, one per module containing
				5885	a definition. There can be multiple entries in a combined summary index
				5886	for symbols with weak linkage.
				5887
				5888	Each ``Summary`` format will depend on whether the global value is a
				5889	:ref:`function<function_summary>`, :ref:`variable<variable_summary>`, or
				5890	:ref:`alias<alias_summary>`.
				5891
				5892	.. _function_summary:
				5893
				5894	Function Summary
				5895	^^^^^^^^^^^^^^^^
				5896
				5897	If the global value is a function, the ``Summary`` entry will look like:
				5898
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	5899	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5900
				5901	function: (module: ^0, flags: (linkage: external, notEligibleToImport: 0, live: 0, dsoLocal: 0), insts: 2[, FuncFlags]?[, Calls]?[, TypeIdInfo]?[, Refs]?
				5902
				5903	The ``module`` field includes the summary entry id for the module containing
				5904	this definition, and the ``flags`` field contains information such as
				5905	the linkage type, a flag indicating whether it is legal to import the
				5906	definition, whether it is globally live and whether the linker resolved it
				5907	to a local definition (the latter two are populated during the thin link).
				5908	The ``insts`` field contains the number of IR instructions in the function.
				5909	Finally, there are several optional fields: :ref:`FuncFlags<funcflags_summary>`,
				5910	:ref:`Calls<calls_summary>`, :ref:`TypeIdInfo<typeidinfo_summary>`,
				5911	:ref:`Refs<refs_summary>`.
				5912
				5913	.. _variable_summary:
				5914
				5915	Global Variable Summary
				5916	^^^^^^^^^^^^^^^^^^^^^^^
				5917
				5918	If the global value is a variable, the ``Summary`` entry will look like:
				5919
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	5920	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5921
				5922	variable: (module: ^0, flags: (linkage: external, notEligibleToImport: 0, live: 0, dsoLocal: 0)[, Refs]?
				5923
				5924	The variable entry contains a subset of the fields in a
				5925	:ref:`function summary <function_summary>`, see the descriptions there.
				5926
				5927	.. _alias_summary:
				5928
				5929	Alias Summary
				5930	^^^^^^^^^^^^^
				5931
				5932	If the global value is an alias, the ``Summary`` entry will look like:
				5933
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	5934	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5935
				5936	alias: (module: ^0, flags: (linkage: external, notEligibleToImport: 0, live: 0, dsoLocal: 0), aliasee: ^2)
				5937
				5938	The ``module`` and ``flags`` fields are as described for a
				5939	:ref:`function summary <function_summary>`. The ``aliasee`` field
				5940	contains a reference to the global value summary entry of the aliasee.
				5941
				5942	.. _funcflags_summary:
				5943
				5944	Function Flags
				5945	^^^^^^^^^^^^^^
				5946
				5947	The optional ``FuncFlags`` field looks like:
				5948
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	5949	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5950
				5951	funcFlags: (readNone: 0, readOnly: 0, noRecurse: 0, returnDoesNotAlias: 0)
				5952
				5953	If unspecified, flags are assumed to hold the conservative ``false`` value of
				5954	``0``.
				5955
				5956	.. _calls_summary:
				5957
				5958	Calls
				5959	^^^^^
				5960
				5961	The optional ``Calls`` field looks like:
				5962
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	5963	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5964
				5965	calls: ((Callee)[, (Callee)]*)
				5966
				5967	where each ``Callee`` looks like:
				5968
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	5969	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5970
				5971	callee: ^1[, hotness: None]?[, relbf: 0]?
				5972
				5973	The ``callee`` refers to the summary entry id of the callee. At most one
				5974	of ``hotness`` (which can take the values ``Unknown``, ``Cold``, ``None``,
				5975	``Hot``, and ``Critical``), and ``relbf`` (which holds the integer
				5976	branch frequency relative to the entry frequency, scaled down by 2^8)
				5977	may be specified. The defaults are ``Unknown`` and ``0``, respectively.
				5978
				5979	.. _refs_summary:
				5980
				5981	Refs
				5982	^^^^
				5983
				5984	The optional ``Refs`` field looks like:
				5985
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	5986	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	5987
				5988	refs: ((Ref)[, (Ref)]*)
				5989
				5990	where each ``Ref`` contains a reference to the summary id of the referenced
				5991	value (e.g. ``^1``).
				5992
				5993	.. _typeidinfo_summary:
				5994
				5995	TypeIdInfo
				5996	^^^^^^^^^^
				5997
				5998	The optional ``TypeIdInfo`` field, used for
				5999	`Control Flow Integrity <http://clang.llvm.org/docs/ControlFlowIntegrity.html>`_,
				6000	looks like:
				6001
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6002	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6003
				6004	typeIdInfo: [(TypeTests)]?[, (TypeTestAssumeVCalls)]?[, (TypeCheckedLoadVCalls)]?[, (TypeTestAssumeConstVCalls)]?[, (TypeCheckedLoadConstVCalls)]?
				6005
				6006	These optional fields have the following forms:
				6007
				6008	TypeTests
				6009	"""""""""
				6010
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6011	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6012
				6013	typeTests: (TypeIdRef[, TypeIdRef]*)
				6014
				6015	Where each ``TypeIdRef`` refers to a :ref:`type id<typeid_summary>`
				6016	by summary id or ``GUID``.
				6017
				6018	TypeTestAssumeVCalls
				6019	""""""""""""""""""""
				6020
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6021	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6022
				6023	typeTestAssumeVCalls: (VFuncId[, VFuncId]*)
				6024
				6025	Where each VFuncId has the format:
				6026
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6027	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6028
				6029	vFuncId: (TypeIdRef, offset: 16)
				6030
				6031	Where each ``TypeIdRef`` refers to a :ref:`type id<typeid_summary>`
				6032	by summary id or ``GUID`` preceeded by a ``guid:`` tag.
				6033
				6034	TypeCheckedLoadVCalls
				6035	"""""""""""""""""""""
				6036
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6037	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6038
				6039	typeCheckedLoadVCalls: (VFuncId[, VFuncId]*)
				6040
				6041	Where each VFuncId has the format described for ``TypeTestAssumeVCalls``.
				6042
				6043	TypeTestAssumeConstVCalls
				6044	"""""""""""""""""""""""""
				6045
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6046	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6047
				6048	typeTestAssumeConstVCalls: (ConstVCall[, ConstVCall]*)
				6049
				6050	Where each ConstVCall has the format:
				6051
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6052	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6053
				6054	VFuncId, args: (Arg[, Arg]*)
				6055
				6056	and where each VFuncId has the format described for ``TypeTestAssumeVCalls``,
				6057	and each Arg is an integer argument number.
				6058
				6059	TypeCheckedLoadConstVCalls
				6060	""""""""""""""""""""""""""
				6061
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6062	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6063
				6064	typeCheckedLoadConstVCalls: (ConstVCall[, ConstVCall]*)
				6065
				6066	Where each ConstVCall has the format described for
				6067	``TypeTestAssumeConstVCalls``.
				6068
				6069	.. _typeid_summary:
				6070
				6071	Type ID Summary Entry
				6072	---------------------
				6073
				6074	Each type id summary entry corresponds to a type identifier resolution
				6075	which is generated during the LTO link portion of the compile when building
				6076	with `Control Flow Integrity <http://clang.llvm.org/docs/ControlFlowIntegrity.html>`_,
				6077	so these are only present in a combined summary index.
				6078
				6079	Example:
				6080
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6081	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6082
				6083	^4 = typeid: (name: "_ZTS1A", summary: (typeTestRes: (kind: allOnes, sizeM1BitWidth: 7[, alignLog2: 0]?[, sizeM1: 0]?[, bitMask: 0]?[, inlineBits: 0]?)[, WpdResolutions]?)) ; guid = 7004155349499253778
				6084
				6085	The ``typeTestRes`` gives the type test resolution ``kind`` (which may
				6086	be ``unsat``, ``byteArray``, ``inline``, ``single``, or ``allOnes``), and
				6087	the ``size-1`` bit width. It is followed by optional flags, which default to 0,
				6088	and an optional WpdResolutions (whole program devirtualization resolution)
				6089	field that looks like:
				6090
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6091	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6092
				6093	wpdResolutions: ((offset: 0, WpdRes)[, (offset: 1, WpdRes)]*
				6094
				6095	where each entry is a mapping from the given byte offset to the whole-program
				6096	devirtualization resolution WpdRes, that has one of the following formats:
				6097
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6098	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6099
				6100	wpdRes: (kind: branchFunnel)
				6101	wpdRes: (kind: singleImpl, singleImplName: "_ZN1A1nEi")
				6102	wpdRes: (kind: indir)
				6103
				6104	Additionally, each wpdRes has an optional ``resByArg`` field, which
				6105	describes the resolutions for calls with all constant integer arguments:
				6106
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6107	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6108
				6109	resByArg: (ResByArg[, ResByArg]*)
				6110
				6111	where ResByArg is:
				6112
Chandler Carruth	3a56e3f	2018-08-06 09:46:59 +0000	[diff] [blame]	6113	.. code-block:: text
Teresa Johnson	08d5b4e	2018-05-26 02:34:13 +0000	[diff] [blame]	6114
				6115	args: (Arg[, Arg]*), byArg: (kind: UniformRetVal[, info: 0][, byte: 0][, bit: 0])
				6116
				6117	Where the ``kind`` can be ``Indir``, ``UniformRetVal``, ``UniqueRetVal``
				6118	or ``VirtualConstProp``. The ``info`` field is only used if the kind
				6119	is ``UniformRetVal`` (indicates the uniform return value), or
				6120	``UniqueRetVal`` (holds the return value associated with the unique vtable
				6121	(0 or 1)). The ``byte`` and ``bit`` fields are only used if the target does
				6122	not support the use of absolute symbols to store constants.
				6123
Eli Bendersky	0220e6b	2013-06-07 20:24:43 +0000	[diff] [blame]	6124	.. _intrinsicglobalvariables:
				6125
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6126	Intrinsic Global Variables
				6127	==========================
				6128
				6129	LLVM has a number of "magic" global variables that contain data that
				6130	affect code generation or other IR semantics. These are documented here.
				6131	All globals of this sort should have a section specified as
				6132	"``llvm.metadata``". This section and all globals that start with
				6133	"``llvm.``" are reserved for use by LLVM.
				6134
Eli Bendersky	0220e6b	2013-06-07 20:24:43 +0000	[diff] [blame]	6135	.. _gv_llvmused:
				6136
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6137	The '``llvm.used``' Global Variable
				6138	-----------------------------------
				6139
Rafael Espindola	74f2e46	2013-04-22 14:58:02 +0000	[diff] [blame]	6140	The ``@llvm.used`` global is an array which has
Paul Redmond	219ef81	2013-05-30 17:24:32 +0000	[diff] [blame]	6141	:ref:`appending linkage <linkage_appending>`. This array contains a list of
Rafael Espindola	70a729d	2013-06-11 13:18:13 +0000	[diff] [blame]	6142	pointers to named global variables, functions and aliases which may optionally
				6143	have a pointer cast formed of bitcast or getelementptr. For example, a legal
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6144	use of it is:
				6145
				6146	.. code-block:: llvm
				6147
				6148	@X = global i8 4
				6149	@Y = global i32 123
				6150
				6151	@llvm.used = appending global [2 x i8*] [
				6152	i8* @X,
				6153	i8* bitcast (i32* @Y to i8*)
				6154	], section "llvm.metadata"
				6155
Rafael Espindola	74f2e46	2013-04-22 14:58:02 +0000	[diff] [blame]	6156	If a symbol appears in the ``@llvm.used`` list, then the compiler, assembler,
				6157	and linker are required to treat the symbol as if there is a reference to the
Rafael Espindola	70a729d	2013-06-11 13:18:13 +0000	[diff] [blame]	6158	symbol that it cannot see (which is why they have to be named). For example, if
				6159	a variable has internal linkage and no references other than that from the
				6160	``@llvm.used`` list, it cannot be deleted. This is commonly used to represent
				6161	references from inline asms and other things the compiler cannot "see", and
				6162	corresponds to "``attribute((used))``" in GNU C.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6163
				6164	On some targets, the code generator must emit a directive to the
				6165	assembler or object file to prevent the assembler and linker from
				6166	molesting the symbol.
				6167
Eli Bendersky	0220e6b	2013-06-07 20:24:43 +0000	[diff] [blame]	6168	.. _gv_llvmcompilerused:
				6169
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6170	The '``llvm.compiler.used``' Global Variable
				6171	--------------------------------------------
				6172
				6173	The ``@llvm.compiler.used`` directive is the same as the ``@llvm.used``
				6174	directive, except that it only prevents the compiler from touching the
				6175	symbol. On targets that support it, this allows an intelligent linker to
				6176	optimize references to the symbol without being impeded as it would be
				6177	by ``@llvm.used``.
				6178
				6179	This is a rare construct that should only be used in rare circumstances,
				6180	and should not be exposed to source languages.
				6181
Eli Bendersky	0220e6b	2013-06-07 20:24:43 +0000	[diff] [blame]	6182	.. _gv_llvmglobalctors:
				6183
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6184	The '``llvm.global_ctors``' Global Variable
				6185	-------------------------------------------
				6186
				6187	.. code-block:: llvm
				6188
Reid Kleckner	fceb76f	2014-05-16 20:39:27 +0000	[diff] [blame]	6189	%0 = type { i32, void (), i8 }
				6190	@llvm.global_ctors = appending global [1 x %0] [%0 { i32 65535, void ()* @ctor, i8* @data }]
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6191
				6192	The ``@llvm.global_ctors`` array contains a list of constructor
Reid Kleckner	fceb76f	2014-05-16 20:39:27 +0000	[diff] [blame]	6193	functions, priorities, and an optional associated global or function.
				6194	The functions referenced by this array will be called in ascending order
				6195	of priority (i.e. lowest first) when the module is loaded. The order of
				6196	functions with the same priority is not defined.
				6197
				6198	If the third field is present, non-null, and points to a global variable
				6199	or function, the initializer function will only run if the associated
				6200	data from the current module is not discarded.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6201
Eli Bendersky	0220e6b	2013-06-07 20:24:43 +0000	[diff] [blame]	6202	.. _llvmglobaldtors:
				6203
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6204	The '``llvm.global_dtors``' Global Variable
				6205	-------------------------------------------
				6206
				6207	.. code-block:: llvm
				6208
Reid Kleckner	fceb76f	2014-05-16 20:39:27 +0000	[diff] [blame]	6209	%0 = type { i32, void (), i8 }
				6210	@llvm.global_dtors = appending global [1 x %0] [%0 { i32 65535, void ()* @dtor, i8* @data }]
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6211
Reid Kleckner	fceb76f	2014-05-16 20:39:27 +0000	[diff] [blame]	6212	The ``@llvm.global_dtors`` array contains a list of destructor
				6213	functions, priorities, and an optional associated global or function.
				6214	The functions referenced by this array will be called in descending
Reid Kleckner	bffbcc5	2014-05-27 21:35:17 +0000	[diff] [blame]	6215	order of priority (i.e. highest first) when the module is unloaded. The
Reid Kleckner	fceb76f	2014-05-16 20:39:27 +0000	[diff] [blame]	6216	order of functions with the same priority is not defined.
				6217
				6218	If the third field is present, non-null, and points to a global variable
				6219	or function, the destructor function will only run if the associated
				6220	data from the current module is not discarded.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6221
				6222	Instruction Reference
				6223	=====================
				6224
				6225	The LLVM instruction set consists of several different classifications
				6226	of instructions: :ref:`terminator instructions <terminators>`, :ref:`binary
				6227	instructions <binaryops>`, :ref:`bitwise binary
				6228	instructions <bitwiseops>`, :ref:`memory instructions <memoryops>`, and
				6229	:ref:`other instructions <otherops>`.
				6230
				6231	.. _terminators:
				6232
				6233	Terminator Instructions
				6234	-----------------------
				6235
				6236	As mentioned :ref:`previously <functionstructure>`, every basic block in a
				6237	program ends with a "Terminator" instruction, which indicates which
				6238	block should be executed after the current block is finished. These
				6239	terminator instructions typically yield a '``void``' value: they produce
				6240	control flow, not values (the one exception being the
				6241	':ref:`invoke <i_invoke>`' instruction).
				6242
				6243	The terminator instructions are: ':ref:`ret <i_ret>`',
				6244	':ref:`br <i_br>`', ':ref:`switch <i_switch>`',
				6245	':ref:`indirectbr <i_indirectbr>`', ':ref:`invoke <i_invoke>`',
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6246	':ref:`resume <i_resume>`', ':ref:`catchswitch <i_catchswitch>`',
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6247	':ref:`catchret <i_catchret>`',
				6248	':ref:`cleanupret <i_cleanupret>`',
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6249	and ':ref:`unreachable <i_unreachable>`'.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6250
				6251	.. _i_ret:
				6252
				6253	'``ret``' Instruction
				6254	^^^^^^^^^^^^^^^^^^^^^
				6255
				6256	Syntax:
				6257	"""""""
				6258
				6259	::
				6260
				6261	ret <type> <value> ; Return a value from a non-void function
				6262	ret void ; Return from void function
				6263
				6264	Overview:
				6265	"""""""""
				6266
				6267	The '``ret``' instruction is used to return control flow (and optionally
				6268	a value) from a function back to the caller.
				6269
				6270	There are two forms of the '``ret``' instruction: one that returns a
				6271	value and then causes control flow, and one that just causes control
				6272	flow to occur.
				6273
				6274	Arguments:
				6275	""""""""""
				6276
				6277	The '``ret``' instruction optionally accepts a single argument, the
				6278	return value. The type of the return value must be a ':ref:`first
				6279	class <t_firstclass>`' type.
				6280
				6281	A function is not :ref:`well formed <wellformed>` if it it has a non-void
				6282	return type and contains a '``ret``' instruction with no return value or
				6283	a return value with a type that does not match its type, or if it has a
				6284	void return type and contains a '``ret``' instruction with a return
				6285	value.
				6286
				6287	Semantics:
				6288	""""""""""
				6289
				6290	When the '``ret``' instruction is executed, control flow returns back to
				6291	the calling function's context. If the caller is a
				6292	":ref:`call <i_call>`" instruction, execution continues at the
				6293	instruction after the call. If the caller was an
				6294	":ref:`invoke <i_invoke>`" instruction, execution continues at the
				6295	beginning of the "normal" destination block. If the instruction returns
				6296	a value, that value shall set the call or invoke instruction's return
				6297	value.
				6298
				6299	Example:
				6300	""""""""
				6301
				6302	.. code-block:: llvm
				6303
				6304	ret i32 5 ; Return an integer value of 5
				6305	ret void ; Return from a void function
				6306	ret { i32, i8 } { i32 4, i8 2 } ; Return a struct of values 4 and 2
				6307
				6308	.. _i_br:
				6309
				6310	'``br``' Instruction
				6311	^^^^^^^^^^^^^^^^^^^^
				6312
				6313	Syntax:
				6314	"""""""
				6315
				6316	::
				6317
				6318	br i1 <cond>, label <iftrue>, label <iffalse>
				6319	br label <dest> ; Unconditional branch
				6320
				6321	Overview:
				6322	"""""""""
				6323
				6324	The '``br``' instruction is used to cause control flow to transfer to a
				6325	different basic block in the current function. There are two forms of
				6326	this instruction, corresponding to a conditional branch and an
				6327	unconditional branch.
				6328
				6329	Arguments:
				6330	""""""""""
				6331
				6332	The conditional branch form of the '``br``' instruction takes a single
				6333	'``i1``' value and two '``label``' values. The unconditional form of the
				6334	'``br``' instruction takes a single '``label``' value as a target.
				6335
				6336	Semantics:
				6337	""""""""""
				6338
				6339	Upon execution of a conditional '``br``' instruction, the '``i1``'
				6340	argument is evaluated. If the value is ``true``, control flows to the
				6341	'``iftrue``' ``label`` argument. If "cond" is ``false``, control flows
				6342	to the '``iffalse``' ``label`` argument.
				6343
				6344	Example:
				6345	""""""""
				6346
				6347	.. code-block:: llvm
				6348
				6349	Test:
				6350	%cond = icmp eq i32 %a, %b
				6351	br i1 %cond, label %IfEqual, label %IfUnequal
				6352	IfEqual:
				6353	ret i32 1
				6354	IfUnequal:
				6355	ret i32 0
				6356
				6357	.. _i_switch:
				6358
				6359	'``switch``' Instruction
				6360	^^^^^^^^^^^^^^^^^^^^^^^^
				6361
				6362	Syntax:
				6363	"""""""
				6364
				6365	::
				6366
				6367	switch <intty> <value>, label <defaultdest> [ <intty> <val>, label <dest> ... ]
				6368
				6369	Overview:
				6370	"""""""""
				6371
				6372	The '``switch``' instruction is used to transfer control flow to one of
				6373	several different places. It is a generalization of the '``br``'
				6374	instruction, allowing a branch to occur to one of many possible
				6375	destinations.
				6376
				6377	Arguments:
				6378	""""""""""
				6379
				6380	The '``switch``' instruction uses three parameters: an integer
				6381	comparison value '``value``', a default '``label``' destination, and an
				6382	array of pairs of comparison value constants and '``label``'s. The table
				6383	is not allowed to contain duplicate constant entries.
				6384
				6385	Semantics:
				6386	""""""""""
				6387
				6388	The ``switch`` instruction specifies a table of values and destinations.
				6389	When the '``switch``' instruction is executed, this table is searched
				6390	for the given value. If the value is found, control flow is transferred
				6391	to the corresponding destination; otherwise, control flow is transferred
				6392	to the default destination.
				6393
				6394	Implementation:
				6395	"""""""""""""""
				6396
				6397	Depending on properties of the target machine and the particular
				6398	``switch`` instruction, this instruction may be code generated in
				6399	different ways. For example, it could be generated as a series of
				6400	chained conditional branches or with a lookup table.
				6401
				6402	Example:
				6403	""""""""
				6404
				6405	.. code-block:: llvm
				6406
				6407	; Emulate a conditional br instruction
				6408	%Val = zext i1 %value to i32
				6409	switch i32 %Val, label %truedest [ i32 0, label %falsedest ]
				6410
				6411	; Emulate an unconditional br instruction
				6412	switch i32 0, label %dest [ ]
				6413
				6414	; Implement a jump table:
				6415	switch i32 %val, label %otherwise [ i32 0, label %onzero
				6416	i32 1, label %onone
				6417	i32 2, label %ontwo ]
				6418
				6419	.. _i_indirectbr:
				6420
				6421	'``indirectbr``' Instruction
				6422	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				6423
				6424	Syntax:
				6425	"""""""
				6426
				6427	::
				6428
				6429	indirectbr <somety>* <address>, [ label <dest1>, label <dest2>, ... ]
				6430
				6431	Overview:
				6432	"""""""""
				6433
				6434	The '``indirectbr``' instruction implements an indirect branch to a
				6435	label within the current function, whose address is specified by
				6436	"``address``". Address must be derived from a
				6437	:ref:`blockaddress <blockaddress>` constant.
				6438
				6439	Arguments:
				6440	""""""""""
				6441
				6442	The '``address``' argument is the address of the label to jump to. The
				6443	rest of the arguments indicate the full set of possible destinations
				6444	that the address may point to. Blocks are allowed to occur multiple
				6445	times in the destination list, though this isn't particularly useful.
				6446
				6447	This destination list is required so that dataflow analysis has an
				6448	accurate understanding of the CFG.
				6449
				6450	Semantics:
				6451	""""""""""
				6452
				6453	Control transfers to the block specified in the address argument. All
				6454	possible destination blocks must be listed in the label list, otherwise
				6455	this instruction has undefined behavior. This implies that jumps to
				6456	labels defined in other functions have undefined behavior as well.
				6457
				6458	Implementation:
				6459	"""""""""""""""
				6460
				6461	This is typically implemented with a jump through a register.
				6462
				6463	Example:
				6464	""""""""
				6465
				6466	.. code-block:: llvm
				6467
				6468	indirectbr i8* %Addr, [ label %bb1, label %bb2, label %bb3 ]
				6469
				6470	.. _i_invoke:
				6471
				6472	'``invoke``' Instruction
				6473	^^^^^^^^^^^^^^^^^^^^^^^^
				6474
				6475	Syntax:
				6476	"""""""
				6477
				6478	::
				6479
Alexander Richardson	6bcf2ba	2018-08-23 09:25:17 +0000	[diff] [blame]	6480	<result> = invoke [cconv] [ret attrs] [addrspace(<num>)] [<ty>\|<fnty> <fnptrval>(<function args>) [fn attrs]
Sanjoy Das	b513a9f	2015-09-24 23:34:52 +0000	[diff] [blame]	6481	[operand bundles] to label <normal label> unwind label <exception label>
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6482
				6483	Overview:
				6484	"""""""""
				6485
				6486	The '``invoke``' instruction causes control to transfer to a specified
				6487	function, with the possibility of control flow transfer to either the
				6488	'``normal``' label or the '``exception``' label. If the callee function
				6489	returns with the "``ret``" instruction, control flow will return to the
				6490	"normal" label. If the callee (or any indirect callees) returns via the
				6491	":ref:`resume <i_resume>`" instruction or other exception handling
				6492	mechanism, control is interrupted and continued at the dynamically
				6493	nearest "exception" label.
				6494
				6495	The '``exception``' label is a `landing
				6496	pad <ExceptionHandling.html#overview>`_ for the exception. As such,
				6497	'``exception``' label is required to have the
				6498	":ref:`landingpad <i_landingpad>`" instruction, which contains the
				6499	information about the behavior of the program after unwinding happens,
				6500	as its first non-PHI instruction. The restrictions on the
				6501	"``landingpad``" instruction's tightly couples it to the "``invoke``"
				6502	instruction, so that the important information contained within the
				6503	"``landingpad``" instruction can't be lost through normal code motion.
				6504
				6505	Arguments:
				6506	""""""""""
				6507
				6508	This instruction requires several arguments:
				6509
				6510	#. The optional "cconv" marker indicates which :ref:`calling
				6511	convention <callingconv>` the call should use. If none is
				6512	specified, the call defaults to using C calling conventions.
				6513	#. The optional :ref:`Parameter Attributes <paramattrs>` list for return
				6514	values. Only '``zeroext``', '``signext``', and '``inreg``' attributes
				6515	are valid here.
Alexander Richardson	6bcf2ba	2018-08-23 09:25:17 +0000	[diff] [blame]	6516	#. The optional addrspace attribute can be used to indicate the adress space
				6517	of the called function. If it is not specified, the program address space
				6518	from the :ref:`datalayout string<langref_datalayout>` will be used.
David Blaikie	b83cf10	2016-07-13 17:21:34 +0000	[diff] [blame]	6519	#. '``ty``': the type of the call instruction itself which is also the
				6520	type of the return value. Functions that return no value are marked
				6521	``void``.
				6522	#. '``fnty``': shall be the signature of the function being invoked. The
				6523	argument types must match the types implied by this signature. This
				6524	type can be omitted if the function is not varargs.
				6525	#. '``fnptrval``': An LLVM value containing a pointer to a function to
				6526	be invoked. In most cases, this is a direct function invocation, but
				6527	indirect ``invoke``'s are just as possible, calling an arbitrary pointer
				6528	to function value.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6529	#. '``function args``': argument list whose types match the function
				6530	signature argument types and parameter attributes. All arguments must
				6531	be of :ref:`first class <t_firstclass>` type. If the function signature
				6532	indicates the function accepts a variable number of arguments, the
				6533	extra arguments can be specified.
				6534	#. '``normal label``': the label reached when the called function
				6535	executes a '``ret``' instruction.
				6536	#. '``exception label``': the label reached when a callee returns via
				6537	the :ref:`resume <i_resume>` instruction or other exception handling
				6538	mechanism.
George Burgess IV	8a464a7	2017-04-13 05:00:31 +0000	[diff] [blame]	6539	#. The optional :ref:`function attributes <fnattrs>` list.
Sanjoy Das	b513a9f	2015-09-24 23:34:52 +0000	[diff] [blame]	6540	#. The optional :ref:`operand bundles <opbundles>` list.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6541
				6542	Semantics:
				6543	""""""""""
				6544
				6545	This instruction is designed to operate as a standard '``call``'
				6546	instruction in most regards. The primary difference is that it
				6547	establishes an association with a label, which is used by the runtime
				6548	library to unwind the stack.
				6549
				6550	This instruction is used in languages with destructors to ensure that
				6551	proper cleanup is performed in the case of either a ``longjmp`` or a
				6552	thrown exception. Additionally, this is important for implementation of
				6553	'``catch``' clauses in high-level languages that support them.
				6554
				6555	For the purposes of the SSA form, the definition of the value returned
				6556	by the '``invoke``' instruction is deemed to occur on the edge from the
				6557	current block to the "normal" label. If the callee unwinds then no
				6558	return value is available.
				6559
				6560	Example:
				6561	""""""""
				6562
				6563	.. code-block:: llvm
				6564
				6565	%retval = invoke i32 @Test(i32 15) to label %Continue
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6566	unwind label %TestCleanup ; i32:retval set
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6567	%retval = invoke coldcc i32 %Testfnptr(i32 15) to label %Continue
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6568	unwind label %TestCleanup ; i32:retval set
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6569
				6570	.. _i_resume:
				6571
				6572	'``resume``' Instruction
				6573	^^^^^^^^^^^^^^^^^^^^^^^^
				6574
				6575	Syntax:
				6576	"""""""
				6577
				6578	::
				6579
				6580	resume <type> <value>
				6581
				6582	Overview:
				6583	"""""""""
				6584
				6585	The '``resume``' instruction is a terminator instruction that has no
				6586	successors.
				6587
				6588	Arguments:
				6589	""""""""""
				6590
				6591	The '``resume``' instruction requires one argument, which must have the
				6592	same type as the result of any '``landingpad``' instruction in the same
				6593	function.
				6594
				6595	Semantics:
				6596	""""""""""
				6597
				6598	The '``resume``' instruction resumes propagation of an existing
				6599	(in-flight) exception whose unwinding was interrupted with a
				6600	:ref:`landingpad <i_landingpad>` instruction.
				6601
				6602	Example:
				6603	""""""""
				6604
				6605	.. code-block:: llvm
				6606
				6607	resume { i8*, i32 } %exn
				6608
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6609	.. _i_catchswitch:
				6610
				6611	'``catchswitch``' Instruction
Akira Hatanaka	cedf8e9	2015-12-14 05:15:40 +0000	[diff] [blame]	6612	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6613
				6614	Syntax:
				6615	"""""""
				6616
				6617	::
				6618
				6619	<resultval> = catchswitch within <parent> [ label <handler1>, label <handler2>, ... ] unwind to caller
				6620	<resultval> = catchswitch within <parent> [ label <handler1>, label <handler2>, ... ] unwind label <default>
				6621
				6622	Overview:
				6623	"""""""""
				6624
				6625	The '``catchswitch``' instruction is used by `LLVM's exception handling system
				6626	<ExceptionHandling.html#overview>`_ to describe the set of possible catch handlers
				6627	that may be executed by the :ref:`EH personality routine <personalityfn>`.
				6628
				6629	Arguments:
				6630	""""""""""
				6631
				6632	The ``parent`` argument is the token of the funclet that contains the
				6633	``catchswitch`` instruction. If the ``catchswitch`` is not inside a funclet,
				6634	this operand may be the token ``none``.
				6635
Joseph Tremoulet	e28885e	2016-01-10 04:28:38 +0000	[diff] [blame]	6636	The ``default`` argument is the label of another basic block beginning with
				6637	either a ``cleanuppad`` or ``catchswitch`` instruction. This unwind destination
				6638	must be a legal target with respect to the ``parent`` links, as described in
				6639	the `exception handling documentation\ <ExceptionHandling.html#wineh-constraints>`_.
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6640
Joseph Tremoulet	e28885e	2016-01-10 04:28:38 +0000	[diff] [blame]	6641	The ``handlers`` are a nonempty list of successor blocks that each begin with a
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6642	:ref:`catchpad <i_catchpad>` instruction.
				6643
				6644	Semantics:
				6645	""""""""""
				6646
				6647	Executing this instruction transfers control to one of the successors in
				6648	``handlers``, if appropriate, or continues to unwind via the unwind label if
				6649	present.
				6650
				6651	The ``catchswitch`` is both a terminator and a "pad" instruction, meaning that
				6652	it must be both the first non-phi instruction and last instruction in the basic
				6653	block. Therefore, it must be the only non-phi instruction in the block.
				6654
				6655	Example:
				6656	""""""""
				6657
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	6658	.. code-block:: text
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6659
				6660	dispatch1:
				6661	%cs1 = catchswitch within none [label %handler0, label %handler1] unwind to caller
				6662	dispatch2:
				6663	%cs2 = catchswitch within %parenthandler [label %handler0] unwind label %cleanup
				6664
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6665	.. _i_catchret:
				6666
				6667	'``catchret``' Instruction
				6668	^^^^^^^^^^^^^^^^^^^^^^^^^^
				6669
				6670	Syntax:
				6671	"""""""
				6672
				6673	::
				6674
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6675	catchret from <token> to label <normal>
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6676
				6677	Overview:
				6678	"""""""""
				6679
				6680	The '``catchret``' instruction is a terminator instruction that has a
				6681	single successor.
				6682
				6683
				6684	Arguments:
				6685	""""""""""
				6686
Joseph Tremoulet	8220bcc	2015-08-23 00:26:33 +0000	[diff] [blame]	6687	The first argument to a '``catchret``' indicates which ``catchpad`` it
				6688	exits. It must be a :ref:`catchpad <i_catchpad>`.
				6689	The second argument to a '``catchret``' specifies where control will
				6690	transfer to next.
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6691
				6692	Semantics:
				6693	""""""""""
				6694
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6695	The '``catchret``' instruction ends an existing (in-flight) exception whose
				6696	unwinding was interrupted with a :ref:`catchpad <i_catchpad>` instruction. The
				6697	:ref:`personality function <personalityfn>` gets a chance to execute arbitrary
				6698	code to, for example, destroy the active exception. Control then transfers to
				6699	``normal``.
Joseph Tremoulet	9ce71f7	2015-09-03 09:09:43 +0000	[diff] [blame]	6700
Joseph Tremoulet	e28885e	2016-01-10 04:28:38 +0000	[diff] [blame]	6701	The ``token`` argument must be a token produced by a ``catchpad`` instruction.
				6702	If the specified ``catchpad`` is not the most-recently-entered not-yet-exited
				6703	funclet pad (as described in the `EH documentation\ <ExceptionHandling.html#wineh-constraints>`_),
				6704	the ``catchret``'s behavior is undefined.
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6705
				6706	Example:
				6707	""""""""
				6708
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	6709	.. code-block:: text
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6710
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6711	catchret from %catch label %continue
Joseph Tremoulet	9ce71f7	2015-09-03 09:09:43 +0000	[diff] [blame]	6712
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6713	.. _i_cleanupret:
				6714
				6715	'``cleanupret``' Instruction
				6716	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				6717
				6718	Syntax:
				6719	"""""""
				6720
				6721	::
				6722
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6723	cleanupret from <value> unwind label <continue>
				6724	cleanupret from <value> unwind to caller
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6725
				6726	Overview:
				6727	"""""""""
				6728
				6729	The '``cleanupret``' instruction is a terminator instruction that has
				6730	an optional successor.
				6731
				6732
				6733	Arguments:
				6734	""""""""""
				6735
Joseph Tremoulet	8220bcc	2015-08-23 00:26:33 +0000	[diff] [blame]	6736	The '``cleanupret``' instruction requires one argument, which indicates
				6737	which ``cleanuppad`` it exits, and must be a :ref:`cleanuppad <i_cleanuppad>`.
Joseph Tremoulet	e28885e	2016-01-10 04:28:38 +0000	[diff] [blame]	6738	If the specified ``cleanuppad`` is not the most-recently-entered not-yet-exited
				6739	funclet pad (as described in the `EH documentation\ <ExceptionHandling.html#wineh-constraints>`_),
				6740	the ``cleanupret``'s behavior is undefined.
				6741
				6742	The '``cleanupret``' instruction also has an optional successor, ``continue``,
				6743	which must be the label of another basic block beginning with either a
				6744	``cleanuppad`` or ``catchswitch`` instruction. This unwind destination must
				6745	be a legal target with respect to the ``parent`` links, as described in the
				6746	`exception handling documentation\ <ExceptionHandling.html#wineh-constraints>`_.
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6747
				6748	Semantics:
				6749	""""""""""
				6750
				6751	The '``cleanupret``' instruction indicates to the
				6752	:ref:`personality function <personalityfn>` that one
				6753	:ref:`cleanuppad <i_cleanuppad>` it transferred control to has ended.
				6754	It transfers control to ``continue`` or unwinds out of the function.
Joseph Tremoulet	9ce71f7	2015-09-03 09:09:43 +0000	[diff] [blame]	6755
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6756	Example:
				6757	""""""""
				6758
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	6759	.. code-block:: text
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6760
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	6761	cleanupret from %cleanup unwind to caller
				6762	cleanupret from %cleanup unwind label %continue
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	6763
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6764	.. _i_unreachable:
				6765
				6766	'``unreachable``' Instruction
				6767	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				6768
				6769	Syntax:
				6770	"""""""
				6771
				6772	::
				6773
				6774	unreachable
				6775
				6776	Overview:
				6777	"""""""""
				6778
				6779	The '``unreachable``' instruction has no defined semantics. This
				6780	instruction is used to inform the optimizer that a particular portion of
				6781	the code is not reachable. This can be used to indicate that the code
				6782	after a no-return function cannot be reached, and other facts.
				6783
				6784	Semantics:
				6785	""""""""""
				6786
				6787	The '``unreachable``' instruction has no defined semantics.
				6788
				6789	.. _binaryops:
				6790
				6791	Binary Operations
				6792	-----------------
				6793
				6794	Binary operators are used to do most of the computation in a program.
				6795	They require two operands of the same type, execute an operation on
				6796	them, and produce a single value. The operands might represent multiple
				6797	data, as is the case with the :ref:`vector <t_vector>` data type. The
				6798	result value has the same type as its operands.
				6799
				6800	There are several different binary operators:
				6801
				6802	.. _i_add:
				6803
				6804	'``add``' Instruction
				6805	^^^^^^^^^^^^^^^^^^^^^
				6806
				6807	Syntax:
				6808	"""""""
				6809
				6810	::
				6811
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6812	<result> = add <ty> <op1>, <op2> ; yields ty:result
				6813	<result> = add nuw <ty> <op1>, <op2> ; yields ty:result
				6814	<result> = add nsw <ty> <op1>, <op2> ; yields ty:result
				6815	<result> = add nuw nsw <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6816
				6817	Overview:
				6818	"""""""""
				6819
				6820	The '``add``' instruction returns the sum of its two operands.
				6821
				6822	Arguments:
				6823	""""""""""
				6824
				6825	The two arguments to the '``add``' instruction must be
				6826	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				6827	arguments must have identical types.
				6828
				6829	Semantics:
				6830	""""""""""
				6831
				6832	The value produced is the integer sum of the two operands.
				6833
				6834	If the sum has unsigned overflow, the result returned is the
				6835	mathematical result modulo 2\ :sup:`n`\ , where n is the bit width of
				6836	the result.
				6837
				6838	Because LLVM integers use a two's complement representation, this
				6839	instruction is appropriate for both signed and unsigned integers.
				6840
				6841	``nuw`` and ``nsw`` stand for "No Unsigned Wrap" and "No Signed Wrap",
				6842	respectively. If the ``nuw`` and/or ``nsw`` keywords are present, the
				6843	result value of the ``add`` is a :ref:`poison value <poisonvalues>` if
				6844	unsigned and/or signed overflow, respectively, occurs.
				6845
				6846	Example:
				6847	""""""""
				6848
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	6849	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6850
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6851	<result> = add i32 4, %var ; yields i32:result = 4 + %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6852
				6853	.. _i_fadd:
				6854
				6855	'``fadd``' Instruction
				6856	^^^^^^^^^^^^^^^^^^^^^^
				6857
				6858	Syntax:
				6859	"""""""
				6860
				6861	::
				6862
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6863	<result> = fadd [fast-math flags]* <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6864
				6865	Overview:
				6866	"""""""""
				6867
				6868	The '``fadd``' instruction returns the sum of its two operands.
				6869
				6870	Arguments:
				6871	""""""""""
				6872
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	6873	The two arguments to the '``fadd``' instruction must be
				6874	:ref:`floating-point <t_floating>` or :ref:`vector <t_vector>` of
				6875	floating-point values. Both arguments must have identical types.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6876
				6877	Semantics:
				6878	""""""""""
				6879
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	6880	The value produced is the floating-point sum of the two operands.
Sanjay Patel	ec95e0e	2018-03-20 17:05:19 +0000	[diff] [blame]	6881	This instruction is assumed to execute in the default :ref:`floating-point
				6882	environment <floatenv>`.
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	6883	This instruction can also take any number of :ref:`fast-math
				6884	flags <fastmath>`, which are optimization hints to enable otherwise
				6885	unsafe floating-point optimizations:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6886
				6887	Example:
				6888	""""""""
				6889
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	6890	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6891
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6892	<result> = fadd float 4.0, %var ; yields float:result = 4.0 + %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6893
				6894	'``sub``' Instruction
				6895	^^^^^^^^^^^^^^^^^^^^^
				6896
				6897	Syntax:
				6898	"""""""
				6899
				6900	::
				6901
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6902	<result> = sub <ty> <op1>, <op2> ; yields ty:result
				6903	<result> = sub nuw <ty> <op1>, <op2> ; yields ty:result
				6904	<result> = sub nsw <ty> <op1>, <op2> ; yields ty:result
				6905	<result> = sub nuw nsw <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6906
				6907	Overview:
				6908	"""""""""
				6909
				6910	The '``sub``' instruction returns the difference of its two operands.
				6911
				6912	Note that the '``sub``' instruction is used to represent the '``neg``'
				6913	instruction present in most other intermediate representations.
				6914
				6915	Arguments:
				6916	""""""""""
				6917
				6918	The two arguments to the '``sub``' instruction must be
				6919	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				6920	arguments must have identical types.
				6921
				6922	Semantics:
				6923	""""""""""
				6924
				6925	The value produced is the integer difference of the two operands.
				6926
				6927	If the difference has unsigned overflow, the result returned is the
				6928	mathematical result modulo 2\ :sup:`n`\ , where n is the bit width of
				6929	the result.
				6930
				6931	Because LLVM integers use a two's complement representation, this
				6932	instruction is appropriate for both signed and unsigned integers.
				6933
				6934	``nuw`` and ``nsw`` stand for "No Unsigned Wrap" and "No Signed Wrap",
				6935	respectively. If the ``nuw`` and/or ``nsw`` keywords are present, the
				6936	result value of the ``sub`` is a :ref:`poison value <poisonvalues>` if
				6937	unsigned and/or signed overflow, respectively, occurs.
				6938
				6939	Example:
				6940	""""""""
				6941
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	6942	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6943
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6944	<result> = sub i32 4, %var ; yields i32:result = 4 - %var
				6945	<result> = sub i32 0, %val ; yields i32:result = -%var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6946
				6947	.. _i_fsub:
				6948
				6949	'``fsub``' Instruction
				6950	^^^^^^^^^^^^^^^^^^^^^^
				6951
				6952	Syntax:
				6953	"""""""
				6954
				6955	::
				6956
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6957	<result> = fsub [fast-math flags]* <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6958
				6959	Overview:
				6960	"""""""""
				6961
				6962	The '``fsub``' instruction returns the difference of its two operands.
				6963
				6964	Note that the '``fsub``' instruction is used to represent the '``fneg``'
				6965	instruction present in most other intermediate representations.
				6966
				6967	Arguments:
				6968	""""""""""
				6969
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	6970	The two arguments to the '``fsub``' instruction must be
				6971	:ref:`floating-point <t_floating>` or :ref:`vector <t_vector>` of
				6972	floating-point values. Both arguments must have identical types.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6973
				6974	Semantics:
				6975	""""""""""
				6976
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	6977	The value produced is the floating-point difference of the two operands.
Sanjay Patel	ec95e0e	2018-03-20 17:05:19 +0000	[diff] [blame]	6978	This instruction is assumed to execute in the default :ref:`floating-point
				6979	environment <floatenv>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6980	This instruction can also take any number of :ref:`fast-math
				6981	flags <fastmath>`, which are optimization hints to enable otherwise
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	6982	unsafe floating-point optimizations:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6983
				6984	Example:
				6985	""""""""
				6986
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	6987	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6988
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	6989	<result> = fsub float 4.0, %var ; yields float:result = 4.0 - %var
				6990	<result> = fsub float -0.0, %val ; yields float:result = -%var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	6991
				6992	'``mul``' Instruction
				6993	^^^^^^^^^^^^^^^^^^^^^
				6994
				6995	Syntax:
				6996	"""""""
				6997
				6998	::
				6999
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7000	<result> = mul <ty> <op1>, <op2> ; yields ty:result
				7001	<result> = mul nuw <ty> <op1>, <op2> ; yields ty:result
				7002	<result> = mul nsw <ty> <op1>, <op2> ; yields ty:result
				7003	<result> = mul nuw nsw <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7004
				7005	Overview:
				7006	"""""""""
				7007
				7008	The '``mul``' instruction returns the product of its two operands.
				7009
				7010	Arguments:
				7011	""""""""""
				7012
				7013	The two arguments to the '``mul``' instruction must be
				7014	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				7015	arguments must have identical types.
				7016
				7017	Semantics:
				7018	""""""""""
				7019
				7020	The value produced is the integer product of the two operands.
				7021
				7022	If the result of the multiplication has unsigned overflow, the result
				7023	returned is the mathematical result modulo 2\ :sup:`n`\ , where n is the
				7024	bit width of the result.
				7025
				7026	Because LLVM integers use a two's complement representation, and the
				7027	result is the same width as the operands, this instruction returns the
				7028	correct result for both signed and unsigned integers. If a full product
				7029	(e.g. ``i32`` * ``i32`` -> ``i64``) is needed, the operands should be
				7030	sign-extended or zero-extended as appropriate to the width of the full
				7031	product.
				7032
				7033	``nuw`` and ``nsw`` stand for "No Unsigned Wrap" and "No Signed Wrap",
				7034	respectively. If the ``nuw`` and/or ``nsw`` keywords are present, the
				7035	result value of the ``mul`` is a :ref:`poison value <poisonvalues>` if
				7036	unsigned and/or signed overflow, respectively, occurs.
				7037
				7038	Example:
				7039	""""""""
				7040
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7041	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7042
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7043	<result> = mul i32 4, %var ; yields i32:result = 4 * %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7044
				7045	.. _i_fmul:
				7046
				7047	'``fmul``' Instruction
				7048	^^^^^^^^^^^^^^^^^^^^^^
				7049
				7050	Syntax:
				7051	"""""""
				7052
				7053	::
				7054
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7055	<result> = fmul [fast-math flags]* <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7056
				7057	Overview:
				7058	"""""""""
				7059
				7060	The '``fmul``' instruction returns the product of its two operands.
				7061
				7062	Arguments:
				7063	""""""""""
				7064
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	7065	The two arguments to the '``fmul``' instruction must be
				7066	:ref:`floating-point <t_floating>` or :ref:`vector <t_vector>` of
				7067	floating-point values. Both arguments must have identical types.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7068
				7069	Semantics:
				7070	""""""""""
				7071
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	7072	The value produced is the floating-point product of the two operands.
Sanjay Patel	ec95e0e	2018-03-20 17:05:19 +0000	[diff] [blame]	7073	This instruction is assumed to execute in the default :ref:`floating-point
				7074	environment <floatenv>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7075	This instruction can also take any number of :ref:`fast-math
				7076	flags <fastmath>`, which are optimization hints to enable otherwise
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	7077	unsafe floating-point optimizations:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7078
				7079	Example:
				7080	""""""""
				7081
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7082	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7083
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7084	<result> = fmul float 4.0, %var ; yields float:result = 4.0 * %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7085
				7086	'``udiv``' Instruction
				7087	^^^^^^^^^^^^^^^^^^^^^^
				7088
				7089	Syntax:
				7090	"""""""
				7091
				7092	::
				7093
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7094	<result> = udiv <ty> <op1>, <op2> ; yields ty:result
				7095	<result> = udiv exact <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7096
				7097	Overview:
				7098	"""""""""
				7099
				7100	The '``udiv``' instruction returns the quotient of its two operands.
				7101
				7102	Arguments:
				7103	""""""""""
				7104
				7105	The two arguments to the '``udiv``' instruction must be
				7106	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				7107	arguments must have identical types.
				7108
				7109	Semantics:
				7110	""""""""""
				7111
				7112	The value produced is the unsigned integer quotient of the two operands.
				7113
				7114	Note that unsigned integer division and signed integer division are
				7115	distinct operations; for signed integer division, use '``sdiv``'.
				7116
Sanjay Patel	2b1f6f4	2017-03-09 16:20:52 +0000	[diff] [blame]	7117	Division by zero is undefined behavior. For vectors, if any element
				7118	of the divisor is zero, the operation has undefined behavior.
				7119
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7120
				7121	If the ``exact`` keyword is present, the result value of the ``udiv`` is
				7122	a :ref:`poison value <poisonvalues>` if %op1 is not a multiple of %op2 (as
				7123	such, "((a udiv exact b) mul b) == a").
				7124
				7125	Example:
				7126	""""""""
				7127
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7128	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7129
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7130	<result> = udiv i32 4, %var ; yields i32:result = 4 / %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7131
				7132	'``sdiv``' Instruction
				7133	^^^^^^^^^^^^^^^^^^^^^^
				7134
				7135	Syntax:
				7136	"""""""
				7137
				7138	::
				7139
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7140	<result> = sdiv <ty> <op1>, <op2> ; yields ty:result
				7141	<result> = sdiv exact <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7142
				7143	Overview:
				7144	"""""""""
				7145
				7146	The '``sdiv``' instruction returns the quotient of its two operands.
				7147
				7148	Arguments:
				7149	""""""""""
				7150
				7151	The two arguments to the '``sdiv``' instruction must be
				7152	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				7153	arguments must have identical types.
				7154
				7155	Semantics:
				7156	""""""""""
				7157
				7158	The value produced is the signed integer quotient of the two operands
				7159	rounded towards zero.
				7160
				7161	Note that signed integer division and unsigned integer division are
				7162	distinct operations; for unsigned integer division, use '``udiv``'.
				7163
Sanjay Patel	2b1f6f4	2017-03-09 16:20:52 +0000	[diff] [blame]	7164	Division by zero is undefined behavior. For vectors, if any element
				7165	of the divisor is zero, the operation has undefined behavior.
				7166	Overflow also leads to undefined behavior; this is a rare case, but can
				7167	occur, for example, by doing a 32-bit division of -2147483648 by -1.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7168
				7169	If the ``exact`` keyword is present, the result value of the ``sdiv`` is
				7170	a :ref:`poison value <poisonvalues>` if the result would be rounded.
				7171
				7172	Example:
				7173	""""""""
				7174
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7175	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7176
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7177	<result> = sdiv i32 4, %var ; yields i32:result = 4 / %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7178
				7179	.. _i_fdiv:
				7180
				7181	'``fdiv``' Instruction
				7182	^^^^^^^^^^^^^^^^^^^^^^
				7183
				7184	Syntax:
				7185	"""""""
				7186
				7187	::
				7188
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7189	<result> = fdiv [fast-math flags]* <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7190
				7191	Overview:
				7192	"""""""""
				7193
				7194	The '``fdiv``' instruction returns the quotient of its two operands.
				7195
				7196	Arguments:
				7197	""""""""""
				7198
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	7199	The two arguments to the '``fdiv``' instruction must be
				7200	:ref:`floating-point <t_floating>` or :ref:`vector <t_vector>` of
				7201	floating-point values. Both arguments must have identical types.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7202
				7203	Semantics:
				7204	""""""""""
				7205
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	7206	The value produced is the floating-point quotient of the two operands.
Sanjay Patel	ec95e0e	2018-03-20 17:05:19 +0000	[diff] [blame]	7207	This instruction is assumed to execute in the default :ref:`floating-point
				7208	environment <floatenv>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7209	This instruction can also take any number of :ref:`fast-math
				7210	flags <fastmath>`, which are optimization hints to enable otherwise
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	7211	unsafe floating-point optimizations:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7212
				7213	Example:
				7214	""""""""
				7215
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7216	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7217
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7218	<result> = fdiv float 4.0, %var ; yields float:result = 4.0 / %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7219
				7220	'``urem``' Instruction
				7221	^^^^^^^^^^^^^^^^^^^^^^
				7222
				7223	Syntax:
				7224	"""""""
				7225
				7226	::
				7227
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7228	<result> = urem <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7229
				7230	Overview:
				7231	"""""""""
				7232
				7233	The '``urem``' instruction returns the remainder from the unsigned
				7234	division of its two arguments.
				7235
				7236	Arguments:
				7237	""""""""""
				7238
				7239	The two arguments to the '``urem``' instruction must be
				7240	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				7241	arguments must have identical types.
				7242
				7243	Semantics:
				7244	""""""""""
				7245
				7246	This instruction returns the unsigned integer remainder of a division.
				7247	This instruction always performs an unsigned division to get the
				7248	remainder.
				7249
				7250	Note that unsigned integer remainder and signed integer remainder are
				7251	distinct operations; for signed integer remainder, use '``srem``'.
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	7252
Sanjay Patel	2b1f6f4	2017-03-09 16:20:52 +0000	[diff] [blame]	7253	Taking the remainder of a division by zero is undefined behavior.
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	7254	For vectors, if any element of the divisor is zero, the operation has
Sanjay Patel	2b1f6f4	2017-03-09 16:20:52 +0000	[diff] [blame]	7255	undefined behavior.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7256
				7257	Example:
				7258	""""""""
				7259
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7260	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7261
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7262	<result> = urem i32 4, %var ; yields i32:result = 4 % %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7263
				7264	'``srem``' Instruction
				7265	^^^^^^^^^^^^^^^^^^^^^^
				7266
				7267	Syntax:
				7268	"""""""
				7269
				7270	::
				7271
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7272	<result> = srem <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7273
				7274	Overview:
				7275	"""""""""
				7276
				7277	The '``srem``' instruction returns the remainder from the signed
				7278	division of its two operands. This instruction can also take
				7279	:ref:`vector <t_vector>` versions of the values in which case the elements
				7280	must be integers.
				7281
				7282	Arguments:
				7283	""""""""""
				7284
				7285	The two arguments to the '``srem``' instruction must be
				7286	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				7287	arguments must have identical types.
				7288
				7289	Semantics:
				7290	""""""""""
				7291
				7292	This instruction returns the remainder of a division (where the result
				7293	is either zero or has the same sign as the dividend, ``op1``), not the
				7294	modulo operator (where the result is either zero or has the same sign
				7295	as the divisor, ``op2``) of a value. For more information about the
				7296	difference, see `The Math
				7297	Forum <http://mathforum.org/dr.math/problems/anne.4.28.99.html>`_. For a
				7298	table of how this is implemented in various languages, please see
				7299	`Wikipedia: modulo
				7300	operation <http://en.wikipedia.org/wiki/Modulo_operation>`_.
				7301
				7302	Note that signed integer remainder and unsigned integer remainder are
				7303	distinct operations; for unsigned integer remainder, use '``urem``'.
				7304
Sanjay Patel	2b1f6f4	2017-03-09 16:20:52 +0000	[diff] [blame]	7305	Taking the remainder of a division by zero is undefined behavior.
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	7306	For vectors, if any element of the divisor is zero, the operation has
Sanjay Patel	2b1f6f4	2017-03-09 16:20:52 +0000	[diff] [blame]	7307	undefined behavior.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7308	Overflow also leads to undefined behavior; this is a rare case, but can
				7309	occur, for example, by taking the remainder of a 32-bit division of
				7310	-2147483648 by -1. (The remainder doesn't actually overflow, but this
				7311	rule lets srem be implemented using instructions that return both the
				7312	result of the division and the remainder.)
				7313
				7314	Example:
				7315	""""""""
				7316
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7317	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7318
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7319	<result> = srem i32 4, %var ; yields i32:result = 4 % %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7320
				7321	.. _i_frem:
				7322
				7323	'``frem``' Instruction
				7324	^^^^^^^^^^^^^^^^^^^^^^
				7325
				7326	Syntax:
				7327	"""""""
				7328
				7329	::
				7330
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7331	<result> = frem [fast-math flags]* <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7332
				7333	Overview:
				7334	"""""""""
				7335
				7336	The '``frem``' instruction returns the remainder from the division of
				7337	its two operands.
				7338
				7339	Arguments:
				7340	""""""""""
				7341
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	7342	The two arguments to the '``frem``' instruction must be
				7343	:ref:`floating-point <t_floating>` or :ref:`vector <t_vector>` of
				7344	floating-point values. Both arguments must have identical types.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7345
				7346	Semantics:
				7347	""""""""""
				7348
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	7349	The value produced is the floating-point remainder of the two operands.
				7350	This is the same output as a libm '``fmod``' function, but without any
				7351	possibility of setting ``errno``. The remainder has the same sign as the
				7352	dividend.
Sanjay Patel	ec95e0e	2018-03-20 17:05:19 +0000	[diff] [blame]	7353	This instruction is assumed to execute in the default :ref:`floating-point
				7354	environment <floatenv>`.
Sanjay Patel	7b72240	2018-03-07 17:18:22 +0000	[diff] [blame]	7355	This instruction can also take any number of :ref:`fast-math
				7356	flags <fastmath>`, which are optimization hints to enable otherwise
				7357	unsafe floating-point optimizations:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7358
				7359	Example:
				7360	""""""""
				7361
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7362	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7363
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7364	<result> = frem float 4.0, %var ; yields float:result = 4.0 % %var
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7365
				7366	.. _bitwiseops:
				7367
				7368	Bitwise Binary Operations
				7369	-------------------------
				7370
				7371	Bitwise binary operators are used to do various forms of bit-twiddling
				7372	in a program. They are generally very efficient instructions and can
				7373	commonly be strength reduced from other instructions. They require two
				7374	operands of the same type, execute an operation on them, and produce a
				7375	single value. The resulting value is the same type as its operands.
				7376
				7377	'``shl``' Instruction
				7378	^^^^^^^^^^^^^^^^^^^^^
				7379
				7380	Syntax:
				7381	"""""""
				7382
				7383	::
				7384
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7385	<result> = shl <ty> <op1>, <op2> ; yields ty:result
				7386	<result> = shl nuw <ty> <op1>, <op2> ; yields ty:result
				7387	<result> = shl nsw <ty> <op1>, <op2> ; yields ty:result
				7388	<result> = shl nuw nsw <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7389
				7390	Overview:
				7391	"""""""""
				7392
				7393	The '``shl``' instruction returns the first operand shifted to the left
				7394	a specified number of bits.
				7395
				7396	Arguments:
				7397	""""""""""
				7398
				7399	Both arguments to the '``shl``' instruction must be the same
				7400	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer type.
				7401	'``op2``' is treated as an unsigned value.
				7402
				7403	Semantics:
				7404	""""""""""
				7405
				7406	The value produced is ``op1`` \* 2\ :sup:`op2` mod 2\ :sup:`n`,
				7407	where ``n`` is the width of the result. If ``op2`` is (statically or
Sean Silva	b8a108c	2015-04-17 21:58:55 +0000	[diff] [blame]	7408	dynamically) equal to or larger than the number of bits in
Nuno Lopes	b2781fb	2017-06-06 08:28:17 +0000	[diff] [blame]	7409	``op1``, this instruction returns a :ref:`poison value <poisonvalues>`.
				7410	If the arguments are vectors, each vector element of ``op1`` is shifted
				7411	by the corresponding shift amount in ``op2``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7412
Nuno Lopes	b2781fb	2017-06-06 08:28:17 +0000	[diff] [blame]	7413	If the ``nuw`` keyword is present, then the shift produces a poison
				7414	value if it shifts out any non-zero bits.
				7415	If the ``nsw`` keyword is present, then the shift produces a poison
Sanjay Patel	2896c77	2018-06-01 15:21:14 +0000	[diff] [blame]	7416	value if it shifts out any bits that disagree with the resultant sign bit.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7417
				7418	Example:
				7419	""""""""
				7420
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7421	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7422
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7423	<result> = shl i32 4, %var ; yields i32: 4 << %var
				7424	<result> = shl i32 4, 2 ; yields i32: 16
				7425	<result> = shl i32 1, 10 ; yields i32: 1024
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7426	<result> = shl i32 1, 32 ; undefined
				7427	<result> = shl <2 x i32> < i32 1, i32 1>, < i32 1, i32 2> ; yields: result=<2 x i32> < i32 2, i32 4>
				7428
				7429	'``lshr``' Instruction
				7430	^^^^^^^^^^^^^^^^^^^^^^
				7431
				7432	Syntax:
				7433	"""""""
				7434
				7435	::
				7436
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7437	<result> = lshr <ty> <op1>, <op2> ; yields ty:result
				7438	<result> = lshr exact <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7439
				7440	Overview:
				7441	"""""""""
				7442
				7443	The '``lshr``' instruction (logical shift right) returns the first
				7444	operand shifted to the right a specified number of bits with zero fill.
				7445
				7446	Arguments:
				7447	""""""""""
				7448
				7449	Both arguments to the '``lshr``' instruction must be the same
				7450	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer type.
				7451	'``op2``' is treated as an unsigned value.
				7452
				7453	Semantics:
				7454	""""""""""
				7455
				7456	This instruction always performs a logical shift right operation. The
				7457	most significant bits of the result will be filled with zero bits after
				7458	the shift. If ``op2`` is (statically or dynamically) equal to or larger
Nuno Lopes	b2781fb	2017-06-06 08:28:17 +0000	[diff] [blame]	7459	than the number of bits in ``op1``, this instruction returns a :ref:`poison
				7460	value <poisonvalues>`. If the arguments are vectors, each vector element
				7461	of ``op1`` is shifted by the corresponding shift amount in ``op2``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7462
				7463	If the ``exact`` keyword is present, the result value of the ``lshr`` is
Nuno Lopes	b2781fb	2017-06-06 08:28:17 +0000	[diff] [blame]	7464	a poison value if any of the bits shifted out are non-zero.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7465
				7466	Example:
				7467	""""""""
				7468
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7469	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7470
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7471	<result> = lshr i32 4, 1 ; yields i32:result = 2
				7472	<result> = lshr i32 4, 2 ; yields i32:result = 1
				7473	<result> = lshr i8 4, 3 ; yields i8:result = 0
				7474	<result> = lshr i8 -2, 1 ; yields i8:result = 0x7F
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7475	<result> = lshr i32 1, 32 ; undefined
				7476	<result> = lshr <2 x i32> < i32 -2, i32 4>, < i32 1, i32 2> ; yields: result=<2 x i32> < i32 0x7FFFFFFF, i32 1>
				7477
				7478	'``ashr``' Instruction
				7479	^^^^^^^^^^^^^^^^^^^^^^
				7480
				7481	Syntax:
				7482	"""""""
				7483
				7484	::
				7485
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7486	<result> = ashr <ty> <op1>, <op2> ; yields ty:result
				7487	<result> = ashr exact <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7488
				7489	Overview:
				7490	"""""""""
				7491
				7492	The '``ashr``' instruction (arithmetic shift right) returns the first
				7493	operand shifted to the right a specified number of bits with sign
				7494	extension.
				7495
				7496	Arguments:
				7497	""""""""""
				7498
				7499	Both arguments to the '``ashr``' instruction must be the same
				7500	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer type.
				7501	'``op2``' is treated as an unsigned value.
				7502
				7503	Semantics:
				7504	""""""""""
				7505
				7506	This instruction always performs an arithmetic shift right operation,
				7507	The most significant bits of the result will be filled with the sign bit
				7508	of ``op1``. If ``op2`` is (statically or dynamically) equal to or larger
Nuno Lopes	b2781fb	2017-06-06 08:28:17 +0000	[diff] [blame]	7509	than the number of bits in ``op1``, this instruction returns a :ref:`poison
				7510	value <poisonvalues>`. If the arguments are vectors, each vector element
				7511	of ``op1`` is shifted by the corresponding shift amount in ``op2``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7512
				7513	If the ``exact`` keyword is present, the result value of the ``ashr`` is
Nuno Lopes	b2781fb	2017-06-06 08:28:17 +0000	[diff] [blame]	7514	a poison value if any of the bits shifted out are non-zero.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7515
				7516	Example:
				7517	""""""""
				7518
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7519	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7520
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7521	<result> = ashr i32 4, 1 ; yields i32:result = 2
				7522	<result> = ashr i32 4, 2 ; yields i32:result = 1
				7523	<result> = ashr i8 4, 3 ; yields i8:result = 0
				7524	<result> = ashr i8 -2, 1 ; yields i8:result = -1
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7525	<result> = ashr i32 1, 32 ; undefined
				7526	<result> = ashr <2 x i32> < i32 -2, i32 4>, < i32 1, i32 3> ; yields: result=<2 x i32> < i32 -1, i32 0>
				7527
				7528	'``and``' Instruction
				7529	^^^^^^^^^^^^^^^^^^^^^
				7530
				7531	Syntax:
				7532	"""""""
				7533
				7534	::
				7535
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7536	<result> = and <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7537
				7538	Overview:
				7539	"""""""""
				7540
				7541	The '``and``' instruction returns the bitwise logical and of its two
				7542	operands.
				7543
				7544	Arguments:
				7545	""""""""""
				7546
				7547	The two arguments to the '``and``' instruction must be
				7548	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				7549	arguments must have identical types.
				7550
				7551	Semantics:
				7552	""""""""""
				7553
				7554	The truth table used for the '``and``' instruction is:
				7555
				7556	+-----+-----+-----+
				7557	\| In0 \| In1 \| Out \|
				7558	+-----+-----+-----+
				7559	\| 0 \| 0 \| 0 \|
				7560	+-----+-----+-----+
				7561	\| 0 \| 1 \| 0 \|
				7562	+-----+-----+-----+
				7563	\| 1 \| 0 \| 0 \|
				7564	+-----+-----+-----+
				7565	\| 1 \| 1 \| 1 \|
				7566	+-----+-----+-----+
				7567
				7568	Example:
				7569	""""""""
				7570
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7571	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7572
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7573	<result> = and i32 4, %var ; yields i32:result = 4 & %var
				7574	<result> = and i32 15, 40 ; yields i32:result = 8
				7575	<result> = and i32 4, 8 ; yields i32:result = 0
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7576
				7577	'``or``' Instruction
				7578	^^^^^^^^^^^^^^^^^^^^
				7579
				7580	Syntax:
				7581	"""""""
				7582
				7583	::
				7584
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7585	<result> = or <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7586
				7587	Overview:
				7588	"""""""""
				7589
				7590	The '``or``' instruction returns the bitwise logical inclusive or of its
				7591	two operands.
				7592
				7593	Arguments:
				7594	""""""""""
				7595
				7596	The two arguments to the '``or``' instruction must be
				7597	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				7598	arguments must have identical types.
				7599
				7600	Semantics:
				7601	""""""""""
				7602
				7603	The truth table used for the '``or``' instruction is:
				7604
				7605	+-----+-----+-----+
				7606	\| In0 \| In1 \| Out \|
				7607	+-----+-----+-----+
				7608	\| 0 \| 0 \| 0 \|
				7609	+-----+-----+-----+
				7610	\| 0 \| 1 \| 1 \|
				7611	+-----+-----+-----+
				7612	\| 1 \| 0 \| 1 \|
				7613	+-----+-----+-----+
				7614	\| 1 \| 1 \| 1 \|
				7615	+-----+-----+-----+
				7616
				7617	Example:
				7618	""""""""
				7619
				7620	::
				7621
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7622	<result> = or i32 4, %var ; yields i32:result = 4 \| %var
				7623	<result> = or i32 15, 40 ; yields i32:result = 47
				7624	<result> = or i32 4, 8 ; yields i32:result = 12
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7625
				7626	'``xor``' Instruction
				7627	^^^^^^^^^^^^^^^^^^^^^
				7628
				7629	Syntax:
				7630	"""""""
				7631
				7632	::
				7633
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7634	<result> = xor <ty> <op1>, <op2> ; yields ty:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7635
				7636	Overview:
				7637	"""""""""
				7638
				7639	The '``xor``' instruction returns the bitwise logical exclusive or of
				7640	its two operands. The ``xor`` is used to implement the "one's
				7641	complement" operation, which is the "~" operator in C.
				7642
				7643	Arguments:
				7644	""""""""""
				7645
				7646	The two arguments to the '``xor``' instruction must be
				7647	:ref:`integer <t_integer>` or :ref:`vector <t_vector>` of integer values. Both
				7648	arguments must have identical types.
				7649
				7650	Semantics:
				7651	""""""""""
				7652
				7653	The truth table used for the '``xor``' instruction is:
				7654
				7655	+-----+-----+-----+
				7656	\| In0 \| In1 \| Out \|
				7657	+-----+-----+-----+
				7658	\| 0 \| 0 \| 0 \|
				7659	+-----+-----+-----+
				7660	\| 0 \| 1 \| 1 \|
				7661	+-----+-----+-----+
				7662	\| 1 \| 0 \| 1 \|
				7663	+-----+-----+-----+
				7664	\| 1 \| 1 \| 0 \|
				7665	+-----+-----+-----+
				7666
				7667	Example:
				7668	""""""""
				7669
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7670	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7671
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7672	<result> = xor i32 4, %var ; yields i32:result = 4 ^ %var
				7673	<result> = xor i32 15, 40 ; yields i32:result = 39
				7674	<result> = xor i32 4, 8 ; yields i32:result = 12
				7675	<result> = xor i32 %V, -1 ; yields i32:result = ~%V
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7676
				7677	Vector Operations
				7678	-----------------
				7679
				7680	LLVM supports several instructions to represent vector operations in a
				7681	target-independent manner. These instructions cover the element-access
				7682	and vector-specific operations needed to process vectors effectively.
				7683	While LLVM does directly support these vector operations, many
				7684	sophisticated algorithms will want to use target-specific intrinsics to
				7685	take full advantage of a specific target.
				7686
				7687	.. _i_extractelement:
				7688
				7689	'``extractelement``' Instruction
				7690	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				7691
				7692	Syntax:
				7693	"""""""
				7694
				7695	::
				7696
Michael J. Spencer	1f10c5ea	2014-05-01 22:12:39 +0000	[diff] [blame]	7697	<result> = extractelement <n x <ty>> <val>, <ty2> <idx> ; yields <ty>
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7698
				7699	Overview:
				7700	"""""""""
				7701
				7702	The '``extractelement``' instruction extracts a single scalar element
				7703	from a vector at a specified index.
				7704
				7705	Arguments:
				7706	""""""""""
				7707
				7708	The first operand of an '``extractelement``' instruction is a value of
				7709	:ref:`vector <t_vector>` type. The second operand is an index indicating
				7710	the position from which to extract the element. The index may be a
Michael J. Spencer	1f10c5ea	2014-05-01 22:12:39 +0000	[diff] [blame]	7711	variable of any integer type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7712
				7713	Semantics:
				7714	""""""""""
				7715
				7716	The result is a scalar of the same type as the element type of ``val``.
				7717	Its value is the value at position ``idx`` of ``val``. If ``idx``
Eli Friedman	2c7a81b	2018-06-08 21:23:09 +0000	[diff] [blame]	7718	exceeds the length of ``val``, the result is a
				7719	:ref:`poison value <poisonvalues>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7720
				7721	Example:
				7722	""""""""
				7723
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7724	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7725
				7726	<result> = extractelement <4 x i32> %vec, i32 0 ; yields i32
				7727
				7728	.. _i_insertelement:
				7729
				7730	'``insertelement``' Instruction
				7731	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				7732
				7733	Syntax:
				7734	"""""""
				7735
				7736	::
				7737
Michael J. Spencer	1f10c5ea	2014-05-01 22:12:39 +0000	[diff] [blame]	7738	<result> = insertelement <n x <ty>> <val>, <ty> <elt>, <ty2> <idx> ; yields <n x <ty>>
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7739
				7740	Overview:
				7741	"""""""""
				7742
				7743	The '``insertelement``' instruction inserts a scalar element into a
				7744	vector at a specified index.
				7745
				7746	Arguments:
				7747	""""""""""
				7748
				7749	The first operand of an '``insertelement``' instruction is a value of
				7750	:ref:`vector <t_vector>` type. The second operand is a scalar value whose
				7751	type must equal the element type of the first operand. The third operand
				7752	is an index indicating the position at which to insert the value. The
Michael J. Spencer	1f10c5ea	2014-05-01 22:12:39 +0000	[diff] [blame]	7753	index may be a variable of any integer type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7754
				7755	Semantics:
				7756	""""""""""
				7757
				7758	The result is a vector of the same type as ``val``. Its element values
				7759	are those of ``val`` except at position ``idx``, where it gets the value
Eli Friedman	2c7a81b	2018-06-08 21:23:09 +0000	[diff] [blame]	7760	``elt``. If ``idx`` exceeds the length of ``val``, the result
				7761	is a :ref:`poison value <poisonvalues>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7762
				7763	Example:
				7764	""""""""
				7765
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7766	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7767
				7768	<result> = insertelement <4 x i32> %vec, i32 1, i32 0 ; yields <4 x i32>
				7769
				7770	.. _i_shufflevector:
				7771
				7772	'``shufflevector``' Instruction
				7773	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				7774
				7775	Syntax:
				7776	"""""""
				7777
				7778	::
				7779
				7780	<result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> ; yields <m x <ty>>
				7781
				7782	Overview:
				7783	"""""""""
				7784
				7785	The '``shufflevector``' instruction constructs a permutation of elements
				7786	from two input vectors, returning a vector with the same element type as
				7787	the input and length that is the same as the shuffle mask.
				7788
				7789	Arguments:
				7790	""""""""""
				7791
				7792	The first two operands of a '``shufflevector``' instruction are vectors
				7793	with the same type. The third argument is a shuffle mask whose element
				7794	type is always 'i32'. The result of the instruction is a vector whose
				7795	length is the same as the shuffle mask and whose element type is the
				7796	same as the element type of the first two operands.
				7797
				7798	The shuffle mask operand is required to be a constant vector with either
				7799	constant integer or undef values.
				7800
				7801	Semantics:
				7802	""""""""""
				7803
				7804	The elements of the two input vectors are numbered from left to right
				7805	across both of the vectors. The shuffle mask operand specifies, for each
				7806	element of the result vector, which element of the two input vectors the
Sanjay Patel	6e41018	2017-04-12 18:39:53 +0000	[diff] [blame]	7807	result element gets. If the shuffle mask is undef, the result vector is
				7808	undef. If any element of the mask operand is undef, that element of the
				7809	result is undef. If the shuffle mask selects an undef element from one
				7810	of the input vectors, the resulting element is undef.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7811
				7812	Example:
				7813	""""""""
				7814
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7815	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7816
				7817	<result> = shufflevector <4 x i32> %v1, <4 x i32> %v2,
				7818	<4 x i32> <i32 0, i32 4, i32 1, i32 5> ; yields <4 x i32>
				7819	<result> = shufflevector <4 x i32> %v1, <4 x i32> undef,
				7820	<4 x i32> <i32 0, i32 1, i32 2, i32 3> ; yields <4 x i32> - Identity shuffle.
				7821	<result> = shufflevector <8 x i32> %v1, <8 x i32> undef,
				7822	<4 x i32> <i32 0, i32 1, i32 2, i32 3> ; yields <4 x i32>
				7823	<result> = shufflevector <4 x i32> %v1, <4 x i32> %v2,
				7824	<8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 4, i32 5, i32 6, i32 7 > ; yields <8 x i32>
				7825
				7826	Aggregate Operations
				7827	--------------------
				7828
				7829	LLVM supports several instructions for working with
				7830	:ref:`aggregate <t_aggregate>` values.
				7831
				7832	.. _i_extractvalue:
				7833
				7834	'``extractvalue``' Instruction
				7835	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				7836
				7837	Syntax:
				7838	"""""""
				7839
				7840	::
				7841
				7842	<result> = extractvalue <aggregate type> <val>, <idx>{, <idx>}*
				7843
				7844	Overview:
				7845	"""""""""
				7846
				7847	The '``extractvalue``' instruction extracts the value of a member field
				7848	from an :ref:`aggregate <t_aggregate>` value.
				7849
				7850	Arguments:
				7851	""""""""""
				7852
				7853	The first operand of an '``extractvalue``' instruction is a value of
Arch D. Robison	a7f8f25	2015-10-14 19:10:45 +0000	[diff] [blame]	7854	:ref:`struct <t_struct>` or :ref:`array <t_array>` type. The other operands are
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7855	constant indices to specify which value to extract in a similar manner
				7856	as indices in a '``getelementptr``' instruction.
				7857
				7858	The major differences to ``getelementptr`` indexing are:
				7859
				7860	- Since the value being indexed is not a pointer, the first index is
				7861	omitted and assumed to be zero.
				7862	- At least one index must be specified.
				7863	- Not only struct indices but also array indices must be in bounds.
				7864
				7865	Semantics:
				7866	""""""""""
				7867
				7868	The result is the value at the position in the aggregate specified by
				7869	the index operands.
				7870
				7871	Example:
				7872	""""""""
				7873
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	7874	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7875
				7876	<result> = extractvalue {i32, float} %agg, 0 ; yields i32
				7877
				7878	.. _i_insertvalue:
				7879
				7880	'``insertvalue``' Instruction
				7881	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				7882
				7883	Syntax:
				7884	"""""""
				7885
				7886	::
				7887
				7888	<result> = insertvalue <aggregate type> <val>, <ty> <elt>, <idx>{, <idx>}* ; yields <aggregate type>
				7889
				7890	Overview:
				7891	"""""""""
				7892
				7893	The '``insertvalue``' instruction inserts a value into a member field in
				7894	an :ref:`aggregate <t_aggregate>` value.
				7895
				7896	Arguments:
				7897	""""""""""
				7898
				7899	The first operand of an '``insertvalue``' instruction is a value of
				7900	:ref:`struct <t_struct>` or :ref:`array <t_array>` type. The second operand is
				7901	a first-class value to insert. The following operands are constant
				7902	indices indicating the position at which to insert the value in a
				7903	similar manner as indices in a '``extractvalue``' instruction. The value
				7904	to insert must have the same type as the value identified by the
				7905	indices.
				7906
				7907	Semantics:
				7908	""""""""""
				7909
				7910	The result is an aggregate of the same type as ``val``. Its value is
				7911	that of ``val`` except that the value at the position specified by the
				7912	indices is that of ``elt``.
				7913
				7914	Example:
				7915	""""""""
				7916
				7917	.. code-block:: llvm
				7918
				7919	%agg1 = insertvalue {i32, float} undef, i32 1, 0 ; yields {i32 1, float undef}
				7920	%agg2 = insertvalue {i32, float} %agg1, float %val, 1 ; yields {i32 1, float %val}
Dan Liew	ffcfe7f	2014-09-08 21:19:46 +0000	[diff] [blame]	7921	%agg3 = insertvalue {i32, {float}} undef, float %val, 1, 0 ; yields {i32 undef, {float %val}}
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7922
				7923	.. _memoryops:
				7924
				7925	Memory Access and Addressing Operations
				7926	---------------------------------------
				7927
				7928	A key design point of an SSA-based representation is how it represents
				7929	memory. In LLVM, no memory locations are in SSA form, which makes things
				7930	very simple. This section describes how to read, write, and allocate
				7931	memory in LLVM.
				7932
				7933	.. _i_alloca:
				7934
				7935	'``alloca``' Instruction
				7936	^^^^^^^^^^^^^^^^^^^^^^^^
				7937
				7938	Syntax:
				7939	"""""""
				7940
				7941	::
				7942
Matt Arsenault	3c1fc76	2017-04-10 22:27:50 +0000	[diff] [blame]	7943	<result> = alloca [inalloca] <type> [, <ty> <NumElements>] [, align <alignment>] [, addrspace(<num>)] ; yields type addrspace(num)*:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7944
				7945	Overview:
				7946	"""""""""
				7947
				7948	The '``alloca``' instruction allocates memory on the stack frame of the
				7949	currently executing function, to be automatically released when this
				7950	function returns to its caller. The object is always allocated in the
Matt Arsenault	3c1fc76	2017-04-10 22:27:50 +0000	[diff] [blame]	7951	address space for allocas indicated in the datalayout.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7952
				7953	Arguments:
				7954	""""""""""
				7955
				7956	The '``alloca``' instruction allocates ``sizeof(<type>)*NumElements``
				7957	bytes of memory on the runtime stack, returning a pointer of the
				7958	appropriate type to the program. If "NumElements" is specified, it is
				7959	the number of elements allocated, otherwise "NumElements" is defaulted
				7960	to be one. If a constant alignment is specified, the value result of the
Reid Kleckner	15fe7a5	2014-07-15 01:16:09 +0000	[diff] [blame]	7961	allocation is guaranteed to be aligned to at least that boundary. The
				7962	alignment may not be greater than ``1 << 29``. If not specified, or if
				7963	zero, the target can choose to align the allocation on any convenient
				7964	boundary compatible with the type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7965
				7966	'``type``' may be any sized type.
				7967
				7968	Semantics:
				7969	""""""""""
				7970
				7971	Memory is allocated; a pointer is returned. The operation is undefined
				7972	if there is insufficient stack space for the allocation. '``alloca``'d
				7973	memory is automatically released when the function returns. The
				7974	'``alloca``' instruction is commonly used to represent automatic
				7975	variables that must have an address available. When the function returns
				7976	(either with the ``ret`` or ``resume`` instructions), the memory is
Eli Friedman	18f882c	2018-07-11 00:02:01 +0000	[diff] [blame]	7977	reclaimed. Allocating zero bytes is legal, but the returned pointer may not
				7978	be unique. The order in which memory is allocated (ie., which way the stack
				7979	grows) is not specified.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7980
				7981	Example:
				7982	""""""""
				7983
				7984	.. code-block:: llvm
				7985
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	7986	%ptr = alloca i32 ; yields i32*:ptr
				7987	%ptr = alloca i32, i32 4 ; yields i32*:ptr
				7988	%ptr = alloca i32, i32 4, align 1024 ; yields i32*:ptr
				7989	%ptr = alloca i32, align 1024 ; yields i32*:ptr
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	7990
				7991	.. _i_load:
				7992
				7993	'``load``' Instruction
				7994	^^^^^^^^^^^^^^^^^^^^^^
				7995
				7996	Syntax:
				7997	"""""""
				7998
				7999	::
				8000
Artur Pilipenko	b4d0090	2015-09-28 17:41:08 +0000	[diff] [blame]	8001	<result> = load [volatile] <ty>, <ty>* <pointer>[, align <alignment>][, !nontemporal !<index>][, !invariant.load !<index>][, !invariant.group !<index>][, !nonnull !<index>][, !dereferenceable !<deref_bytes_node>][, !dereferenceable_or_null !<deref_bytes_node>][, !align !<align_node>]
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8002	<result> = load atomic [volatile] <ty>, <ty>* <pointer> [syncscope("<target-scope>")] <ordering>, align <alignment> [, !invariant.group !<index>]
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8003	!<index> = !{ i32 1 }
Artur Pilipenko	253d71e	2015-09-18 12:07:10 +0000	[diff] [blame]	8004	!<deref_bytes_node> = !{i64 <dereferenceable_bytes>}
Artur Pilipenko	b4d0090	2015-09-28 17:41:08 +0000	[diff] [blame]	8005	!<align_node> = !{ i64 <value_alignment> }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8006
				8007	Overview:
				8008	"""""""""
				8009
				8010	The '``load``' instruction is used to read from memory.
				8011
				8012	Arguments:
				8013	""""""""""
				8014
Sanjoy Das	c2cf6ef	2016-06-01 16:13:10 +0000	[diff] [blame]	8015	The argument to the ``load`` instruction specifies the memory address from which
				8016	to load. The type specified must be a :ref:`first class <t_firstclass>` type of
				8017	known size (i.e. not containing an :ref:`opaque structural type <t_opaque>`). If
				8018	the ``load`` is marked as ``volatile``, then the optimizer is not allowed to
				8019	modify the number or order of execution of this ``load`` with other
				8020	:ref:`volatile operations <volatile>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8021
JF Bastien	d1fb585	2015-12-17 22:09:19 +0000	[diff] [blame]	8022	If the ``load`` is marked as ``atomic``, it takes an extra :ref:`ordering
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8023	<ordering>` and optional ``syncscope("<target-scope>")`` argument. The
				8024	``release`` and ``acq_rel`` orderings are not valid on ``load`` instructions.
				8025	Atomic loads produce :ref:`defined <memmodel>` results when they may see
				8026	multiple atomic stores. The type of the pointee must be an integer, pointer, or
				8027	floating-point type whose bit width is a power of two greater than or equal to
				8028	eight and less than or equal to a target-specific size limit. ``align`` must be
				8029	explicitly specified on atomic loads, and the load has undefined behavior if the
				8030	alignment is not set to a value which is at least the size in bytes of the
JF Bastien	d1fb585	2015-12-17 22:09:19 +0000	[diff] [blame]	8031	pointee. ``!nontemporal`` does not have any defined semantics for atomic loads.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8032
				8033	The optional constant ``align`` argument specifies the alignment of the
				8034	operation (that is, the alignment of the memory address). A value of 0
Eli Bendersky	239a78b	2013-04-17 20:17:08 +0000	[diff] [blame]	8035	or an omitted ``align`` argument means that the operation has the ABI
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8036	alignment for the target. It is the responsibility of the code emitter
				8037	to ensure that the alignment information is correct. Overestimating the
				8038	alignment results in undefined behavior. Underestimating the alignment
Reid Kleckner	15fe7a5	2014-07-15 01:16:09 +0000	[diff] [blame]	8039	may produce less efficient code. An alignment of 1 is always safe. The
Matt Arsenault	7020f25	2016-06-16 16:33:41 +0000	[diff] [blame]	8040	maximum possible alignment is ``1 << 29``. An alignment value higher
				8041	than the size of the loaded type implies memory up to the alignment
				8042	value bytes can be safely loaded without trapping in the default
				8043	address space. Access of the high bytes can interfere with debugging
				8044	tools, so should not be accessed if the function has the
				8045	``sanitize_thread`` or ``sanitize_address`` attributes.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8046
				8047	The optional ``!nontemporal`` metadata must reference a single
Stefanus Du Toit	736e2e2	2013-06-20 14:02:44 +0000	[diff] [blame]	8048	metadata name ``<index>`` corresponding to a metadata node with one
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8049	``i32`` entry of value 1. The existence of the ``!nontemporal``
Stefanus Du Toit	736e2e2	2013-06-20 14:02:44 +0000	[diff] [blame]	8050	metadata on the instruction tells the optimizer and code generator
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8051	that this load is not expected to be reused in the cache. The code
				8052	generator may select special instructions to save cache bandwidth, such
				8053	as the ``MOVNT`` instruction on x86.
				8054
				8055	The optional ``!invariant.load`` metadata must reference a single
Stefanus Du Toit	736e2e2	2013-06-20 14:02:44 +0000	[diff] [blame]	8056	metadata name ``<index>`` corresponding to a metadata node with no
Geoff Berry	4bda576	2016-08-31 17:39:21 +0000	[diff] [blame]	8057	entries. If a load instruction tagged with the ``!invariant.load``
				8058	metadata is executed, the optimizer may assume the memory location
				8059	referenced by the load contains the same value at all points in the
Eli Friedman	e15a111	2018-07-17 20:38:11 +0000	[diff] [blame]	8060	program where the memory location is known to be dereferenceable;
				8061	otherwise, the behavior is undefined.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8062
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	8063	The optional ``!invariant.group`` metadata must reference a single metadata name
Piotr Padlewski	ce35826	2018-05-18 23:53:46 +0000	[diff] [blame]	8064	``<index>`` corresponding to a metadata node with no entries.
				8065	See ``invariant.group`` metadata.
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	8066
Philip Reames	cdb72f3	2014-10-20 22:40:55 +0000	[diff] [blame]	8067	The optional ``!nonnull`` metadata must reference a single
				8068	metadata name ``<index>`` corresponding to a metadata node with no
				8069	entries. The existence of the ``!nonnull`` metadata on the
				8070	instruction tells the optimizer that the value loaded is known to
Eli Friedman	e15a111	2018-07-17 20:38:11 +0000	[diff] [blame]	8071	never be null. If the value is null at runtime, the behavior is undefined.
				8072	This is analogous to the ``nonnull`` attribute on parameters and return
				8073	values. This metadata can only be applied to loads of a pointer type.
Philip Reames	cdb72f3	2014-10-20 22:40:55 +0000	[diff] [blame]	8074
Artur Pilipenko	253d71e	2015-09-18 12:07:10 +0000	[diff] [blame]	8075	The optional ``!dereferenceable`` metadata must reference a single metadata
				8076	name ``<deref_bytes_node>`` corresponding to a metadata node with one ``i64``
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	8077	entry. The existence of the ``!dereferenceable`` metadata on the instruction
Sanjoy Das	f999547	2015-05-19 20:10:19 +0000	[diff] [blame]	8078	tells the optimizer that the value loaded is known to be dereferenceable.
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	8079	The number of bytes known to be dereferenceable is specified by the integer
				8080	value in the metadata node. This is analogous to the ''dereferenceable''
				8081	attribute on parameters and return values. This metadata can only be applied
Sanjoy Das	f999547	2015-05-19 20:10:19 +0000	[diff] [blame]	8082	to loads of a pointer type.
				8083
				8084	The optional ``!dereferenceable_or_null`` metadata must reference a single
Artur Pilipenko	253d71e	2015-09-18 12:07:10 +0000	[diff] [blame]	8085	metadata name ``<deref_bytes_node>`` corresponding to a metadata node with one
				8086	``i64`` entry. The existence of the ``!dereferenceable_or_null`` metadata on the
Sanjoy Das	f999547	2015-05-19 20:10:19 +0000	[diff] [blame]	8087	instruction tells the optimizer that the value loaded is known to be either
				8088	dereferenceable or null.
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	8089	The number of bytes known to be dereferenceable is specified by the integer
				8090	value in the metadata node. This is analogous to the ''dereferenceable_or_null''
				8091	attribute on parameters and return values. This metadata can only be applied
Sanjoy Das	f999547	2015-05-19 20:10:19 +0000	[diff] [blame]	8092	to loads of a pointer type.
				8093
Artur Pilipenko	b4d0090	2015-09-28 17:41:08 +0000	[diff] [blame]	8094	The optional ``!align`` metadata must reference a single metadata name
				8095	``<align_node>`` corresponding to a metadata node with one ``i64`` entry.
				8096	The existence of the ``!align`` metadata on the instruction tells the
				8097	optimizer that the value loaded is known to be aligned to a boundary specified
				8098	by the integer value in the metadata node. The alignment must be a power of 2.
				8099	This is analogous to the ''align'' attribute on parameters and return values.
Eli Friedman	e15a111	2018-07-17 20:38:11 +0000	[diff] [blame]	8100	This metadata can only be applied to loads of a pointer type. If the returned
				8101	value is not appropriately aligned at runtime, the behavior is undefined.
Artur Pilipenko	b4d0090	2015-09-28 17:41:08 +0000	[diff] [blame]	8102
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8103	Semantics:
				8104	""""""""""
				8105
				8106	The location of memory pointed to is loaded. If the value being loaded
				8107	is of scalar type then the number of bytes read does not exceed the
				8108	minimum number of bytes needed to hold all bits of the type. For
				8109	example, loading an ``i24`` reads at most three bytes. When loading a
				8110	value of a type like ``i20`` with a size that is not an integral number
				8111	of bytes, the result is undefined if the value was not originally
				8112	written using a store of the same type.
				8113
				8114	Examples:
				8115	"""""""""
				8116
				8117	.. code-block:: llvm
				8118
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	8119	%ptr = alloca i32 ; yields i32*:ptr
				8120	store i32 3, i32* %ptr ; yields void
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	8121	%val = load i32, i32* %ptr ; yields i32:val = i32 3
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8122
				8123	.. _i_store:
				8124
				8125	'``store``' Instruction
				8126	^^^^^^^^^^^^^^^^^^^^^^^
				8127
				8128	Syntax:
				8129	"""""""
				8130
				8131	::
				8132
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	8133	store [volatile] <ty> <value>, <ty>* <pointer>[, align <alignment>][, !nontemporal !<index>][, !invariant.group !<index>] ; yields void
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8134	store atomic [volatile] <ty> <value>, <ty>* <pointer> [syncscope("<target-scope>")] <ordering>, align <alignment> [, !invariant.group !<index>] ; yields void
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8135
				8136	Overview:
				8137	"""""""""
				8138
				8139	The '``store``' instruction is used to write to memory.
				8140
				8141	Arguments:
				8142	""""""""""
				8143
Sanjoy Das	c2cf6ef	2016-06-01 16:13:10 +0000	[diff] [blame]	8144	There are two arguments to the ``store`` instruction: a value to store and an
				8145	address at which to store it. The type of the ``<pointer>`` operand must be a
				8146	pointer to the :ref:`first class <t_firstclass>` type of the ``<value>``
				8147	operand. If the ``store`` is marked as ``volatile``, then the optimizer is not
				8148	allowed to modify the number or order of execution of this ``store`` with other
				8149	:ref:`volatile operations <volatile>`. Only values of :ref:`first class
				8150	<t_firstclass>` types of known size (i.e. not containing an :ref:`opaque
				8151	structural type <t_opaque>`) can be stored.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8152
JF Bastien	d1fb585	2015-12-17 22:09:19 +0000	[diff] [blame]	8153	If the ``store`` is marked as ``atomic``, it takes an extra :ref:`ordering
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8154	<ordering>` and optional ``syncscope("<target-scope>")`` argument. The
				8155	``acquire`` and ``acq_rel`` orderings aren't valid on ``store`` instructions.
				8156	Atomic loads produce :ref:`defined <memmodel>` results when they may see
				8157	multiple atomic stores. The type of the pointee must be an integer, pointer, or
				8158	floating-point type whose bit width is a power of two greater than or equal to
				8159	eight and less than or equal to a target-specific size limit. ``align`` must be
				8160	explicitly specified on atomic stores, and the store has undefined behavior if
				8161	the alignment is not set to a value which is at least the size in bytes of the
JF Bastien	d1fb585	2015-12-17 22:09:19 +0000	[diff] [blame]	8162	pointee. ``!nontemporal`` does not have any defined semantics for atomic stores.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8163
Eli Bendersky	ca38084	2013-04-17 17:17:20 +0000	[diff] [blame]	8164	The optional constant ``align`` argument specifies the alignment of the
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8165	operation (that is, the alignment of the memory address). A value of 0
Eli Bendersky	ca38084	2013-04-17 17:17:20 +0000	[diff] [blame]	8166	or an omitted ``align`` argument means that the operation has the ABI
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8167	alignment for the target. It is the responsibility of the code emitter
				8168	to ensure that the alignment information is correct. Overestimating the
Eli Bendersky	ca38084	2013-04-17 17:17:20 +0000	[diff] [blame]	8169	alignment results in undefined behavior. Underestimating the
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8170	alignment may produce less efficient code. An alignment of 1 is always
Matt Arsenault	7020f25	2016-06-16 16:33:41 +0000	[diff] [blame]	8171	safe. The maximum possible alignment is ``1 << 29``. An alignment
				8172	value higher than the size of the stored type implies memory up to the
				8173	alignment value bytes can be stored to without trapping in the default
				8174	address space. Storing to the higher bytes however may result in data
				8175	races if another thread can access the same address. Introducing a
				8176	data race is not allowed. Storing to the extra bytes is not allowed
				8177	even in situations where a data race is known to not exist if the
				8178	function has the ``sanitize_address`` attribute.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8179
Stefanus Du Toit	736e2e2	2013-06-20 14:02:44 +0000	[diff] [blame]	8180	The optional ``!nontemporal`` metadata must reference a single metadata
Eli Bendersky	ca38084	2013-04-17 17:17:20 +0000	[diff] [blame]	8181	name ``<index>`` corresponding to a metadata node with one ``i32`` entry of
Stefanus Du Toit	736e2e2	2013-06-20 14:02:44 +0000	[diff] [blame]	8182	value 1. The existence of the ``!nontemporal`` metadata on the instruction
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8183	tells the optimizer and code generator that this load is not expected to
				8184	be reused in the cache. The code generator may select special
JF Bastien	d2d8ffd	2016-01-13 04:52:26 +0000	[diff] [blame]	8185	instructions to save cache bandwidth, such as the ``MOVNT`` instruction on
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8186	x86.
				8187
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	8188	The optional ``!invariant.group`` metadata must reference a
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	8189	single metadata name ``<index>``. See ``invariant.group`` metadata.
				8190
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8191	Semantics:
				8192	""""""""""
				8193
Eli Bendersky	ca38084	2013-04-17 17:17:20 +0000	[diff] [blame]	8194	The contents of memory are updated to contain ``<value>`` at the
				8195	location specified by the ``<pointer>`` operand. If ``<value>`` is
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8196	of scalar type then the number of bytes written does not exceed the
				8197	minimum number of bytes needed to hold all bits of the type. For
				8198	example, storing an ``i24`` writes at most three bytes. When writing a
				8199	value of a type like ``i20`` with a size that is not an integral number
				8200	of bytes, it is unspecified what happens to the extra bits that do not
				8201	belong to the type, but they will typically be overwritten.
				8202
				8203	Example:
				8204	""""""""
				8205
				8206	.. code-block:: llvm
				8207
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	8208	%ptr = alloca i32 ; yields i32*:ptr
				8209	store i32 3, i32* %ptr ; yields void
Nick Lewycky	149d04c	2015-08-11 01:05:16 +0000	[diff] [blame]	8210	%val = load i32, i32* %ptr ; yields i32:val = i32 3
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8211
				8212	.. _i_fence:
				8213
				8214	'``fence``' Instruction
				8215	^^^^^^^^^^^^^^^^^^^^^^^
				8216
				8217	Syntax:
				8218	"""""""
				8219
				8220	::
				8221
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8222	fence [syncscope("<target-scope>")] <ordering> ; yields void
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8223
				8224	Overview:
				8225	"""""""""
				8226
				8227	The '``fence``' instruction is used to introduce happens-before edges
				8228	between operations.
				8229
				8230	Arguments:
				8231	""""""""""
				8232
				8233	'``fence``' instructions take an :ref:`ordering <ordering>` argument which
				8234	defines what synchronizes-with edges they add. They can only be given
				8235	``acquire``, ``release``, ``acq_rel``, and ``seq_cst`` orderings.
				8236
				8237	Semantics:
				8238	""""""""""
				8239
				8240	A fence A which has (at least) ``release`` ordering semantics
				8241	synchronizes with a fence B with (at least) ``acquire`` ordering
				8242	semantics if and only if there exist atomic operations X and Y, both
				8243	operating on some atomic object M, such that A is sequenced before X, X
				8244	modifies M (either directly or through some side effect of a sequence
				8245	headed by X), Y is sequenced before B, and Y observes M. This provides a
				8246	happens-before dependency between A and B. Rather than an explicit
				8247	``fence``, one (but not both) of the atomic operations X or Y might
				8248	provide a ``release`` or ``acquire`` (resp.) ordering constraint and
				8249	still synchronize-with the explicit ``fence`` and establish the
				8250	happens-before edge.
				8251
				8252	A ``fence`` which has ``seq_cst`` ordering, in addition to having both
				8253	``acquire`` and ``release`` semantics specified above, participates in
				8254	the global program order of other ``seq_cst`` operations and/or fences.
				8255
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8256	A ``fence`` instruction can also take an optional
				8257	":ref:`syncscope <syncscope>`" argument.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8258
				8259	Example:
				8260	""""""""
				8261
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	8262	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8263
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8264	fence acquire ; yields void
				8265	fence syncscope("singlethread") seq_cst ; yields void
				8266	fence syncscope("agent") seq_cst ; yields void
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8267
				8268	.. _i_cmpxchg:
				8269
				8270	'``cmpxchg``' Instruction
				8271	^^^^^^^^^^^^^^^^^^^^^^^^^
				8272
				8273	Syntax:
				8274	"""""""
				8275
				8276	::
				8277
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8278	cmpxchg [weak] [volatile] <ty>* <pointer>, <ty> <cmp>, <ty> <new> [syncscope("<target-scope>")] <success ordering> <failure ordering> ; yields { ty, i1 }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8279
				8280	Overview:
				8281	"""""""""
				8282
				8283	The '``cmpxchg``' instruction is used to atomically modify memory. It
				8284	loads a value in memory and compares it to a given value. If they are
Tim Northover	420a216	2014-06-13 14:24:07 +0000	[diff] [blame]	8285	equal, it tries to store a new value into the memory.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8286
				8287	Arguments:
				8288	""""""""""
				8289
				8290	There are three arguments to the '``cmpxchg``' instruction: an address
				8291	to operate on, a value to compare to the value currently be at that
				8292	address, and a new value to place at that address if the compared values
Philip Reames	1960cfd	2016-02-19 00:06:41 +0000	[diff] [blame]	8293	are equal. The type of '<cmp>' must be an integer or pointer type whose
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	8294	bit width is a power of two greater than or equal to eight and less
Philip Reames	1960cfd	2016-02-19 00:06:41 +0000	[diff] [blame]	8295	than or equal to a target-specific size limit. '<cmp>' and '<new>' must
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	8296	have the same type, and the type of '<pointer>' must be a pointer to
				8297	that type. If the ``cmpxchg`` is marked as ``volatile``, then the
Philip Reames	1960cfd	2016-02-19 00:06:41 +0000	[diff] [blame]	8298	optimizer is not allowed to modify the number or order of execution of
				8299	this ``cmpxchg`` with other :ref:`volatile operations <volatile>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8300
Tim Northover	e94a518	2014-03-11 10:48:52 +0000	[diff] [blame]	8301	The success and failure :ref:`ordering <ordering>` arguments specify how this
Tim Northover	1dcc9f9	2014-06-13 14:24:16 +0000	[diff] [blame]	8302	``cmpxchg`` synchronizes with other atomic operations. Both ordering parameters
				8303	must be at least ``monotonic``, the ordering constraint on failure must be no
				8304	stronger than that on success, and the failure ordering cannot be either
				8305	``release`` or ``acq_rel``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8306
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8307	A ``cmpxchg`` instruction can also take an optional
				8308	":ref:`syncscope <syncscope>`" argument.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8309
				8310	The pointer passed into cmpxchg must have alignment greater than or
				8311	equal to the size in memory of the operand.
				8312
				8313	Semantics:
				8314	""""""""""
				8315
Tim Northover	420a216	2014-06-13 14:24:07 +0000	[diff] [blame]	8316	The contents of memory at the location specified by the '``<pointer>``' operand
Matthias Braun	93f2b4b	2017-08-09 22:22:04 +0000	[diff] [blame]	8317	is read and compared to '``<cmp>``'; if the values are equal, '``<new>``' is
				8318	written to the location. The original value at the location is returned,
				8319	together with a flag indicating success (true) or failure (false).
Tim Northover	420a216	2014-06-13 14:24:07 +0000	[diff] [blame]	8320
				8321	If the cmpxchg operation is marked as ``weak`` then a spurious failure is
				8322	permitted: the operation may not write ``<new>`` even if the comparison
				8323	matched.
				8324
				8325	If the cmpxchg operation is strong (the default), the i1 value is 1 if and only
				8326	if the value loaded equals ``cmp``.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8327
Tim Northover	e94a518	2014-03-11 10:48:52 +0000	[diff] [blame]	8328	A successful ``cmpxchg`` is a read-modify-write instruction for the purpose of
				8329	identifying release sequences. A failed ``cmpxchg`` is equivalent to an atomic
				8330	load with an ordering parameter determined the second ordering parameter.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8331
				8332	Example:
				8333	""""""""
				8334
				8335	.. code-block:: llvm
				8336
				8337	entry:
Duncan P. N. Exon Smith	c917c7a	2016-02-07 05:06:35 +0000	[diff] [blame]	8338	%orig = load atomic i32, i32* %ptr unordered, align 4 ; yields i32
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8339	br label %loop
				8340
				8341	loop:
Duncan P. N. Exon Smith	c917c7a	2016-02-07 05:06:35 +0000	[diff] [blame]	8342	%cmp = phi i32 [ %orig, %entry ], [%value_loaded, %loop]
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8343	%squared = mul i32 %cmp, %cmp
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	8344	%val_success = cmpxchg i32* %ptr, i32 %cmp, i32 %squared acq_rel monotonic ; yields { i32, i1 }
Tim Northover	420a216	2014-06-13 14:24:07 +0000	[diff] [blame]	8345	%value_loaded = extractvalue { i32, i1 } %val_success, 0
				8346	%success = extractvalue { i32, i1 } %val_success, 1
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8347	br i1 %success, label %done, label %loop
				8348
				8349	done:
				8350	...
				8351
				8352	.. _i_atomicrmw:
				8353
				8354	'``atomicrmw``' Instruction
				8355	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8356
				8357	Syntax:
				8358	"""""""
				8359
				8360	::
				8361
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8362	atomicrmw [volatile] <operation> <ty>* <pointer>, <ty> <value> [syncscope("<target-scope>")] <ordering> ; yields ty
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8363
				8364	Overview:
				8365	"""""""""
				8366
				8367	The '``atomicrmw``' instruction is used to atomically modify memory.
				8368
				8369	Arguments:
				8370	""""""""""
				8371
				8372	There are three arguments to the '``atomicrmw``' instruction: an
				8373	operation to apply, an address whose value to modify, an argument to the
				8374	operation. The operation must be one of the following keywords:
				8375
				8376	- xchg
				8377	- add
				8378	- sub
				8379	- and
				8380	- nand
				8381	- or
				8382	- xor
				8383	- max
				8384	- min
				8385	- umax
				8386	- umin
				8387
				8388	The type of '<value>' must be an integer type whose bit width is a power
				8389	of two greater than or equal to eight and less than or equal to a
				8390	target-specific size limit. The type of the '``<pointer>``' operand must
				8391	be a pointer to that type. If the ``atomicrmw`` is marked as
				8392	``volatile``, then the optimizer is not allowed to modify the number or
				8393	order of execution of this ``atomicrmw`` with other :ref:`volatile
				8394	operations <volatile>`.
				8395
Konstantin Zhuravlyov	bb80d3e	2017-07-11 22:23:00 +0000	[diff] [blame]	8396	A ``atomicrmw`` instruction can also take an optional
				8397	":ref:`syncscope <syncscope>`" argument.
				8398
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8399	Semantics:
				8400	""""""""""
				8401
				8402	The contents of memory at the location specified by the '``<pointer>``'
				8403	operand are atomically read, modified, and written back. The original
				8404	value at the location is returned. The modification is specified by the
				8405	operation argument:
				8406
				8407	- xchg: ``*ptr = val``
				8408	- add: ``ptr = ptr + val``
				8409	- sub: ``ptr = ptr - val``
				8410	- and: ``ptr = ptr & val``
				8411	- nand: ``ptr = ~(ptr & val)``
				8412	- or: ``ptr = ptr \| val``
				8413	- xor: ``ptr = ptr ^ val``
				8414	- max: ``ptr = ptr > val ? *ptr : val`` (using a signed comparison)
				8415	- min: ``ptr = ptr < val ? *ptr : val`` (using a signed comparison)
				8416	- umax: ``ptr = ptr > val ? *ptr : val`` (using an unsigned
				8417	comparison)
				8418	- umin: ``ptr = ptr < val ? *ptr : val`` (using an unsigned
				8419	comparison)
				8420
				8421	Example:
				8422	""""""""
				8423
				8424	.. code-block:: llvm
				8425
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	8426	%old = atomicrmw add i32* %ptr, i32 1 acquire ; yields i32
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8427
				8428	.. _i_getelementptr:
				8429
				8430	'``getelementptr``' Instruction
				8431	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8432
				8433	Syntax:
				8434	"""""""
				8435
				8436	::
				8437
Peter Collingbourne	d93620b	2016-11-10 22:34:55 +0000	[diff] [blame]	8438	<result> = getelementptr <ty>, <ty>* <ptrval>{, [inrange] <ty> <idx>}*
				8439	<result> = getelementptr inbounds <ty>, <ty>* <ptrval>{, [inrange] <ty> <idx>}*
				8440	<result> = getelementptr <ty>, <ptr vector> <ptrval>, [inrange] <vector index type> <idx>
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8441
				8442	Overview:
				8443	"""""""""
				8444
				8445	The '``getelementptr``' instruction is used to get the address of a
				8446	subelement of an :ref:`aggregate <t_aggregate>` data structure. It performs
Elena Demikhovsky	37a4da8	2015-07-09 07:42:48 +0000	[diff] [blame]	8447	address calculation only and does not access memory. The instruction can also
				8448	be used to calculate a vector of such addresses.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8449
				8450	Arguments:
				8451	""""""""""
				8452
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	8453	The first argument is always a type used as the basis for the calculations.
				8454	The second argument is always a pointer or a vector of pointers, and is the
				8455	base address to start from. The remaining arguments are indices
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8456	that indicate which of the elements of the aggregate object are indexed.
				8457	The interpretation of each index is dependent on the type being indexed
				8458	into. The first index always indexes the pointer value given as the
David Blaikie	f91b030	2017-06-19 05:34:21 +0000	[diff] [blame]	8459	second argument, the second index indexes a value of the type pointed to
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8460	(not necessarily the value directly pointed to, since the first index
				8461	can be non-zero), etc. The first type indexed into must be a pointer
				8462	value, subsequent types can be arrays, vectors, and structs. Note that
				8463	subsequent types being indexed into can never be pointers, since that
				8464	would require loading the pointer before continuing calculation.
				8465
				8466	The type of each index argument depends on the type it is indexing into.
				8467	When indexing into a (optionally packed) structure, only ``i32`` integer
				8468	constants are allowed (when using a vector of indices they must all
				8469	be the same ``i32`` integer constant). When indexing into an array,
				8470	pointer or vector, integers of any width are allowed, and they are not
				8471	required to be constant. These integers are treated as signed values
				8472	where relevant.
				8473
				8474	For example, let's consider a C code fragment and how it gets compiled
				8475	to LLVM:
				8476
				8477	.. code-block:: c
				8478
				8479	struct RT {
				8480	char A;
				8481	int B[10][20];
				8482	char C;
				8483	};
				8484	struct ST {
				8485	int X;
				8486	double Y;
				8487	struct RT Z;
				8488	};
				8489
				8490	int foo(struct ST s) {
				8491	return &s[1].Z.B[5][13];
				8492	}
				8493
				8494	The LLVM code generated by Clang is:
				8495
				8496	.. code-block:: llvm
				8497
				8498	%struct.RT = type { i8, [10 x [20 x i32]], i8 }
				8499	%struct.ST = type { i32, double, %struct.RT }
				8500
				8501	define i32* @foo(%struct.ST* %s) nounwind uwtable readnone optsize ssp {
				8502	entry:
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	8503	%arrayidx = getelementptr inbounds %struct.ST, %struct.ST* %s, i64 1, i32 2, i32 1, i64 5, i64 13
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8504	ret i32* %arrayidx
				8505	}
				8506
				8507	Semantics:
				8508	""""""""""
				8509
				8510	In the example above, the first index is indexing into the
				8511	'``%struct.ST*``' type, which is a pointer, yielding a '``%struct.ST``'
				8512	= '``{ i32, double, %struct.RT }``' type, a structure. The second index
				8513	indexes into the third element of the structure, yielding a
				8514	'``%struct.RT``' = '``{ i8 , [10 x [20 x i32]], i8 }``' type, another
				8515	structure. The third index indexes into the second element of the
				8516	structure, yielding a '``[10 x [20 x i32]]``' type, an array. The two
				8517	dimensions of the array are subscripted into, yielding an '``i32``'
				8518	type. The '``getelementptr``' instruction returns a pointer to this
				8519	element, thus computing a value of '``i32*``' type.
				8520
				8521	Note that it is perfectly legal to index partially through a structure,
				8522	returning a pointer to an inner element. Because of this, the LLVM code
				8523	for the given testcase is equivalent to:
				8524
				8525	.. code-block:: llvm
				8526
				8527	define i32* @foo(%struct.ST* %s) {
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	8528	%t1 = getelementptr %struct.ST, %struct.ST* %s, i32 1 ; yields %struct.ST*:%t1
				8529	%t2 = getelementptr %struct.ST, %struct.ST* %t1, i32 0, i32 2 ; yields %struct.RT*:%t2
				8530	%t3 = getelementptr %struct.RT, %struct.RT* %t2, i32 0, i32 1 ; yields [10 x [20 x i32]]*:%t3
				8531	%t4 = getelementptr [10 x [20 x i32]], [10 x [20 x i32]]* %t3, i32 0, i32 5 ; yields [20 x i32]*:%t4
				8532	%t5 = getelementptr [20 x i32], [20 x i32]* %t4, i32 0, i32 13 ; yields i32*:%t5
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8533	ret i32* %t5
				8534	}
				8535
				8536	If the ``inbounds`` keyword is present, the result value of the
				8537	``getelementptr`` is a :ref:`poison value <poisonvalues>` if the base
				8538	pointer is not an in bounds address of an allocated object, or if any
				8539	of the addresses that would be formed by successive addition of the
				8540	offsets implied by the indices to the base address with infinitely
				8541	precise signed arithmetic are not an in bounds address of that
				8542	allocated object. The in bounds addresses for an allocated object are
				8543	all the addresses that point into the object, plus the address one byte
Eli Friedman	13f2e35	2017-02-23 00:48:18 +0000	[diff] [blame]	8544	past the end. The only in bounds address for a null pointer in the
				8545	default address-space is the null pointer itself. In cases where the
				8546	base is a vector of pointers the ``inbounds`` keyword applies to each
				8547	of the computations element-wise.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8548
				8549	If the ``inbounds`` keyword is not present, the offsets are added to the
				8550	base address with silently-wrapping two's complement arithmetic. If the
				8551	offsets have a different width from the pointer, they are sign-extended
				8552	or truncated to the width of the pointer. The result value of the
				8553	``getelementptr`` may be outside the object pointed to by the base
				8554	pointer. The result value may not necessarily be used to access memory
				8555	though, even if it happens to point into allocated storage. See the
				8556	:ref:`Pointer Aliasing Rules <pointeraliasing>` section for more
				8557	information.
				8558
Peter Collingbourne	d93620b	2016-11-10 22:34:55 +0000	[diff] [blame]	8559	If the ``inrange`` keyword is present before any index, loading from or
				8560	storing to any pointer derived from the ``getelementptr`` has undefined
				8561	behavior if the load or store would access memory outside of the bounds of
				8562	the element selected by the index marked as ``inrange``. The result of a
				8563	pointer comparison or ``ptrtoint`` (including ``ptrtoint``-like operations
				8564	involving memory) involving a pointer derived from a ``getelementptr`` with
				8565	the ``inrange`` keyword is undefined, with the exception of comparisons
				8566	in the case where both operands are in the range of the element selected
				8567	by the ``inrange`` keyword, inclusive of the address one past the end of
				8568	that element. Note that the ``inrange`` keyword is currently only allowed
				8569	in constant ``getelementptr`` expressions.
				8570
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8571	The getelementptr instruction is often confusing. For some more insight
				8572	into how it works, see :doc:`the getelementptr FAQ <GetElementPtr>`.
				8573
				8574	Example:
				8575	""""""""
				8576
				8577	.. code-block:: llvm
				8578
				8579	; yields [12 x i8]*:aptr
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	8580	%aptr = getelementptr {i32, [12 x i8]}, {i32, [12 x i8]}* %saptr, i64 0, i32 1
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8581	; yields i8*:vptr
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	8582	%vptr = getelementptr {i32, <2 x i8>}, {i32, <2 x i8>}* %svptr, i64 0, i32 1, i32 1
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8583	; yields i8*:eptr
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	8584	%eptr = getelementptr [12 x i8], [12 x i8]* %aptr, i64 0, i32 1
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8585	; yields i32*:iptr
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	8586	%iptr = getelementptr [10 x i32], [10 x i32]* @arr, i16 0, i16 0
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8587
Elena Demikhovsky	37a4da8	2015-07-09 07:42:48 +0000	[diff] [blame]	8588	Vector of pointers:
				8589	"""""""""""""""""""
				8590
				8591	The ``getelementptr`` returns a vector of pointers, instead of a single address,
				8592	when one or more of its arguments is a vector. In such cases, all vector
				8593	arguments should have the same number of elements, and every scalar argument
				8594	will be effectively broadcast into a vector during address calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8595
				8596	.. code-block:: llvm
				8597
Elena Demikhovsky	37a4da8	2015-07-09 07:42:48 +0000	[diff] [blame]	8598	; All arguments are vectors:
				8599	; A[i] = ptrs[i] + offsets[i]*sizeof(i8)
				8600	%A = getelementptr i8, <4 x i8*> %ptrs, <4 x i64> %offsets
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	8601
Elena Demikhovsky	37a4da8	2015-07-09 07:42:48 +0000	[diff] [blame]	8602	; Add the same scalar offset to each pointer of a vector:
				8603	; A[i] = ptrs[i] + offset*sizeof(i8)
				8604	%A = getelementptr i8, <4 x i8*> %ptrs, i64 %offset
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	8605
Elena Demikhovsky	37a4da8	2015-07-09 07:42:48 +0000	[diff] [blame]	8606	; Add distinct offsets to the same pointer:
				8607	; A[i] = ptr + offsets[i]*sizeof(i8)
				8608	%A = getelementptr i8, i8* %ptr, <4 x i64> %offsets
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	8609
Elena Demikhovsky	37a4da8	2015-07-09 07:42:48 +0000	[diff] [blame]	8610	; In all cases described above the type of the result is <4 x i8*>
				8611
				8612	The two following instructions are equivalent:
				8613
				8614	.. code-block:: llvm
				8615
				8616	getelementptr %struct.ST, <4 x %struct.ST*> %s, <4 x i64> %ind1,
				8617	<4 x i32> <i32 2, i32 2, i32 2, i32 2>,
				8618	<4 x i32> <i32 1, i32 1, i32 1, i32 1>,
				8619	<4 x i32> %ind4,
				8620	<4 x i64> <i64 13, i64 13, i64 13, i64 13>
Sean Silva	706fba5	2015-08-06 22:56:24 +0000	[diff] [blame]	8621
Elena Demikhovsky	37a4da8	2015-07-09 07:42:48 +0000	[diff] [blame]	8622	getelementptr %struct.ST, <4 x %struct.ST*> %s, <4 x i64> %ind1,
				8623	i32 2, i32 1, <4 x i32> %ind4, i64 13
				8624
				8625	Let's look at the C code, where the vector version of ``getelementptr``
				8626	makes sense:
				8627
				8628	.. code-block:: c
				8629
				8630	// Let's assume that we vectorize the following loop:
Alexey Bader	adec283	2017-01-30 07:38:58 +0000	[diff] [blame]	8631	double A, B; int *C;
Elena Demikhovsky	37a4da8	2015-07-09 07:42:48 +0000	[diff] [blame]	8632	for (int i = 0; i < size; ++i) {
				8633	A[i] = B[C[i]];
				8634	}
				8635
				8636	.. code-block:: llvm
				8637
				8638	; get pointers for 8 elements from array B
				8639	%ptrs = getelementptr double, double* %B, <8 x i32> %C
				8640	; load 8 elements from array B into A
Elad Cohen	ef5798a	2017-05-03 12:28:54 +0000	[diff] [blame]	8641	%A = call <8 x double> @llvm.masked.gather.v8f64.v8p0f64(<8 x double*> %ptrs,
Elena Demikhovsky	37a4da8	2015-07-09 07:42:48 +0000	[diff] [blame]	8642	i32 8, <8 x i1> %mask, <8 x double> %passthru)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8643
				8644	Conversion Operations
				8645	---------------------
				8646
				8647	The instructions in this category are the conversion instructions
				8648	(casting) which all take a single operand and a type. They perform
				8649	various bit conversions on the operand.
				8650
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	8651	.. _i_trunc:
				8652
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8653	'``trunc .. to``' Instruction
				8654	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8655
				8656	Syntax:
				8657	"""""""
				8658
				8659	::
				8660
				8661	<result> = trunc <ty> <value> to <ty2> ; yields ty2
				8662
				8663	Overview:
				8664	"""""""""
				8665
				8666	The '``trunc``' instruction truncates its operand to the type ``ty2``.
				8667
				8668	Arguments:
				8669	""""""""""
				8670
				8671	The '``trunc``' instruction takes a value to trunc, and a type to trunc
				8672	it to. Both types must be of :ref:`integer <t_integer>` types, or vectors
				8673	of the same number of integers. The bit size of the ``value`` must be
				8674	larger than the bit size of the destination type, ``ty2``. Equal sized
				8675	types are not allowed.
				8676
				8677	Semantics:
				8678	""""""""""
				8679
				8680	The '``trunc``' instruction truncates the high order bits in ``value``
				8681	and converts the remaining bits to ``ty2``. Since the source size must
				8682	be larger than the destination size, ``trunc`` cannot be a no-op cast.
				8683	It will always truncate bits.
				8684
				8685	Example:
				8686	""""""""
				8687
				8688	.. code-block:: llvm
				8689
				8690	%X = trunc i32 257 to i8 ; yields i8:1
				8691	%Y = trunc i32 123 to i1 ; yields i1:true
				8692	%Z = trunc i32 122 to i1 ; yields i1:false
				8693	%W = trunc <2 x i16> <i16 8, i16 7> to <2 x i8> ; yields <i8 8, i8 7>
				8694
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	8695	.. _i_zext:
				8696
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8697	'``zext .. to``' Instruction
				8698	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8699
				8700	Syntax:
				8701	"""""""
				8702
				8703	::
				8704
				8705	<result> = zext <ty> <value> to <ty2> ; yields ty2
				8706
				8707	Overview:
				8708	"""""""""
				8709
				8710	The '``zext``' instruction zero extends its operand to type ``ty2``.
				8711
				8712	Arguments:
				8713	""""""""""
				8714
				8715	The '``zext``' instruction takes a value to cast, and a type to cast it
				8716	to. Both types must be of :ref:`integer <t_integer>` types, or vectors of
				8717	the same number of integers. The bit size of the ``value`` must be
				8718	smaller than the bit size of the destination type, ``ty2``.
				8719
				8720	Semantics:
				8721	""""""""""
				8722
				8723	The ``zext`` fills the high order bits of the ``value`` with zero bits
				8724	until it reaches the size of the destination type, ``ty2``.
				8725
				8726	When zero extending from i1, the result will always be either 0 or 1.
				8727
				8728	Example:
				8729	""""""""
				8730
				8731	.. code-block:: llvm
				8732
				8733	%X = zext i32 257 to i64 ; yields i64:257
				8734	%Y = zext i1 true to i32 ; yields i32:1
				8735	%Z = zext <2 x i16> <i16 8, i16 7> to <2 x i32> ; yields <i32 8, i32 7>
				8736
Bjorn Pettersson	e1285e3	2017-10-24 11:59:20 +0000	[diff] [blame]	8737	.. _i_sext:
				8738
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8739	'``sext .. to``' Instruction
				8740	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8741
				8742	Syntax:
				8743	"""""""
				8744
				8745	::
				8746
				8747	<result> = sext <ty> <value> to <ty2> ; yields ty2
				8748
				8749	Overview:
				8750	"""""""""
				8751
				8752	The '``sext``' sign extends ``value`` to the type ``ty2``.
				8753
				8754	Arguments:
				8755	""""""""""
				8756
				8757	The '``sext``' instruction takes a value to cast, and a type to cast it
				8758	to. Both types must be of :ref:`integer <t_integer>` types, or vectors of
				8759	the same number of integers. The bit size of the ``value`` must be
				8760	smaller than the bit size of the destination type, ``ty2``.
				8761
				8762	Semantics:
				8763	""""""""""
				8764
				8765	The '``sext``' instruction performs a sign extension by copying the sign
				8766	bit (highest order bit) of the ``value`` until it reaches the bit size
				8767	of the type ``ty2``.
				8768
				8769	When sign extending from i1, the extension always results in -1 or 0.
				8770
				8771	Example:
				8772	""""""""
				8773
				8774	.. code-block:: llvm
				8775
				8776	%X = sext i8 -1 to i16 ; yields i16 :65535
				8777	%Y = sext i1 true to i32 ; yields i32:-1
				8778	%Z = sext <2 x i16> <i16 8, i16 7> to <2 x i32> ; yields <i32 8, i32 7>
				8779
				8780	'``fptrunc .. to``' Instruction
				8781	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8782
				8783	Syntax:
				8784	"""""""
				8785
				8786	::
				8787
				8788	<result> = fptrunc <ty> <value> to <ty2> ; yields ty2
				8789
				8790	Overview:
				8791	"""""""""
				8792
				8793	The '``fptrunc``' instruction truncates ``value`` to type ``ty2``.
				8794
				8795	Arguments:
				8796	""""""""""
				8797
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8798	The '``fptrunc``' instruction takes a :ref:`floating-point <t_floating>`
				8799	value to cast and a :ref:`floating-point <t_floating>` type to cast it to.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8800	The size of ``value`` must be larger than the size of ``ty2``. This
				8801	implies that ``fptrunc`` cannot be used to make a no-op cast.
				8802
				8803	Semantics:
				8804	""""""""""
				8805
Dan Liew	50456fb	2015-09-03 18:43:56 +0000	[diff] [blame]	8806	The '``fptrunc``' instruction casts a ``value`` from a larger
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8807	:ref:`floating-point <t_floating>` type to a smaller :ref:`floating-point
Sanjay Patel	d96a363	2018-04-03 13:05:20 +0000	[diff] [blame]	8808	<t_floating>` type.
				8809	This instruction is assumed to execute in the default :ref:`floating-point
				8810	environment <floatenv>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8811
				8812	Example:
				8813	""""""""
				8814
				8815	.. code-block:: llvm
				8816
Sanjay Patel	d96a363	2018-04-03 13:05:20 +0000	[diff] [blame]	8817	%X = fptrunc double 16777217.0 to float ; yields float:16777216.0
				8818	%Y = fptrunc double 1.0E+300 to half ; yields half:+infinity
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8819
				8820	'``fpext .. to``' Instruction
				8821	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8822
				8823	Syntax:
				8824	"""""""
				8825
				8826	::
				8827
				8828	<result> = fpext <ty> <value> to <ty2> ; yields ty2
				8829
				8830	Overview:
				8831	"""""""""
				8832
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8833	The '``fpext``' extends a floating-point ``value`` to a larger floating-point
				8834	value.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8835
				8836	Arguments:
				8837	""""""""""
				8838
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8839	The '``fpext``' instruction takes a :ref:`floating-point <t_floating>`
				8840	``value`` to cast, and a :ref:`floating-point <t_floating>` type to cast it
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8841	to. The source type must be smaller than the destination type.
				8842
				8843	Semantics:
				8844	""""""""""
				8845
				8846	The '``fpext``' instruction extends the ``value`` from a smaller
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8847	:ref:`floating-point <t_floating>` type to a larger :ref:`floating-point
				8848	<t_floating>` type. The ``fpext`` cannot be used to make a
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8849	no-op cast because it always changes bits. Use ``bitcast`` to make a
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8850	no-op cast for a floating-point cast.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8851
				8852	Example:
				8853	""""""""
				8854
				8855	.. code-block:: llvm
				8856
				8857	%X = fpext float 3.125 to double ; yields double:3.125000e+00
				8858	%Y = fpext double %X to fp128 ; yields fp128:0xL00000000000000004000900000000000
				8859
				8860	'``fptoui .. to``' Instruction
				8861	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8862
				8863	Syntax:
				8864	"""""""
				8865
				8866	::
				8867
				8868	<result> = fptoui <ty> <value> to <ty2> ; yields ty2
				8869
				8870	Overview:
				8871	"""""""""
				8872
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8873	The '``fptoui``' converts a floating-point ``value`` to its unsigned
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8874	integer equivalent of type ``ty2``.
				8875
				8876	Arguments:
				8877	""""""""""
				8878
				8879	The '``fptoui``' instruction takes a value to cast, which must be a
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8880	scalar or vector :ref:`floating-point <t_floating>` value, and a type to
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8881	cast it to ``ty2``, which must be an :ref:`integer <t_integer>` type. If
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8882	``ty`` is a vector floating-point type, ``ty2`` must be a vector integer
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8883	type with the same number of elements as ``ty``
				8884
				8885	Semantics:
				8886	""""""""""
				8887
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8888	The '``fptoui``' instruction converts its :ref:`floating-point
				8889	<t_floating>` operand into the nearest (rounding towards zero)
Eli Friedman	c065bb2	2018-06-08 21:33:33 +0000	[diff] [blame]	8890	unsigned integer value. If the value cannot fit in ``ty2``, the result
				8891	is a :ref:`poison value <poisonvalues>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8892
				8893	Example:
				8894	""""""""
				8895
				8896	.. code-block:: llvm
				8897
				8898	%X = fptoui double 123.0 to i32 ; yields i32:123
				8899	%Y = fptoui float 1.0E+300 to i1 ; yields undefined:1
				8900	%Z = fptoui float 1.04E+17 to i8 ; yields undefined:1
				8901
				8902	'``fptosi .. to``' Instruction
				8903	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8904
				8905	Syntax:
				8906	"""""""
				8907
				8908	::
				8909
				8910	<result> = fptosi <ty> <value> to <ty2> ; yields ty2
				8911
				8912	Overview:
				8913	"""""""""
				8914
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8915	The '``fptosi``' instruction converts :ref:`floating-point <t_floating>`
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8916	``value`` to type ``ty2``.
				8917
				8918	Arguments:
				8919	""""""""""
				8920
				8921	The '``fptosi``' instruction takes a value to cast, which must be a
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8922	scalar or vector :ref:`floating-point <t_floating>` value, and a type to
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8923	cast it to ``ty2``, which must be an :ref:`integer <t_integer>` type. If
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8924	``ty`` is a vector floating-point type, ``ty2`` must be a vector integer
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8925	type with the same number of elements as ``ty``
				8926
				8927	Semantics:
				8928	""""""""""
				8929
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8930	The '``fptosi``' instruction converts its :ref:`floating-point
				8931	<t_floating>` operand into the nearest (rounding towards zero)
Eli Friedman	c065bb2	2018-06-08 21:33:33 +0000	[diff] [blame]	8932	signed integer value. If the value cannot fit in ``ty2``, the result
				8933	is a :ref:`poison value <poisonvalues>`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8934
				8935	Example:
				8936	""""""""
				8937
				8938	.. code-block:: llvm
				8939
				8940	%X = fptosi double -123.0 to i32 ; yields i32:-123
				8941	%Y = fptosi float 1.0E-247 to i1 ; yields undefined:1
				8942	%Z = fptosi float 1.04E+17 to i8 ; yields undefined:1
				8943
				8944	'``uitofp .. to``' Instruction
				8945	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8946
				8947	Syntax:
				8948	"""""""
				8949
				8950	::
				8951
				8952	<result> = uitofp <ty> <value> to <ty2> ; yields ty2
				8953
				8954	Overview:
				8955	"""""""""
				8956
				8957	The '``uitofp``' instruction regards ``value`` as an unsigned integer
				8958	and converts that value to the ``ty2`` type.
				8959
				8960	Arguments:
				8961	""""""""""
				8962
				8963	The '``uitofp``' instruction takes a value to cast, which must be a
				8964	scalar or vector :ref:`integer <t_integer>` value, and a type to cast it to
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8965	``ty2``, which must be an :ref:`floating-point <t_floating>` type. If
				8966	``ty`` is a vector integer type, ``ty2`` must be a vector floating-point
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8967	type with the same number of elements as ``ty``
				8968
				8969	Semantics:
				8970	""""""""""
				8971
				8972	The '``uitofp``' instruction interprets its operand as an unsigned
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	8973	integer quantity and converts it to the corresponding floating-point
Eli Friedman	3f1ce09	2018-06-14 22:58:48 +0000	[diff] [blame]	8974	value. If the value cannot be exactly represented, it is rounded using
				8975	the default rounding mode.
				8976
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	8977
				8978	Example:
				8979	""""""""
				8980
				8981	.. code-block:: llvm
				8982
				8983	%X = uitofp i32 257 to float ; yields float:257.0
				8984	%Y = uitofp i8 -1 to double ; yields double:255.0
				8985
				8986	'``sitofp .. to``' Instruction
				8987	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				8988
				8989	Syntax:
				8990	"""""""
				8991
				8992	::
				8993
				8994	<result> = sitofp <ty> <value> to <ty2> ; yields ty2
				8995
				8996	Overview:
				8997	"""""""""
				8998
				8999	The '``sitofp``' instruction regards ``value`` as a signed integer and
				9000	converts that value to the ``ty2`` type.
				9001
				9002	Arguments:
				9003	""""""""""
				9004
				9005	The '``sitofp``' instruction takes a value to cast, which must be a
				9006	scalar or vector :ref:`integer <t_integer>` value, and a type to cast it to
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	9007	``ty2``, which must be an :ref:`floating-point <t_floating>` type. If
				9008	``ty`` is a vector integer type, ``ty2`` must be a vector floating-point
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9009	type with the same number of elements as ``ty``
				9010
				9011	Semantics:
				9012	""""""""""
				9013
				9014	The '``sitofp``' instruction interprets its operand as a signed integer
Eli Friedman	3f1ce09	2018-06-14 22:58:48 +0000	[diff] [blame]	9015	quantity and converts it to the corresponding floating-point value. If the
				9016	value cannot be exactly represented, it is rounded using the default rounding
				9017	mode.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9018
				9019	Example:
				9020	""""""""
				9021
				9022	.. code-block:: llvm
				9023
				9024	%X = sitofp i32 257 to float ; yields float:257.0
				9025	%Y = sitofp i8 -1 to double ; yields double:-1.0
				9026
				9027	.. _i_ptrtoint:
				9028
				9029	'``ptrtoint .. to``' Instruction
				9030	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				9031
				9032	Syntax:
				9033	"""""""
				9034
				9035	::
				9036
				9037	<result> = ptrtoint <ty> <value> to <ty2> ; yields ty2
				9038
				9039	Overview:
				9040	"""""""""
				9041
				9042	The '``ptrtoint``' instruction converts the pointer or a vector of
				9043	pointers ``value`` to the integer (or vector of integers) type ``ty2``.
				9044
				9045	Arguments:
				9046	""""""""""
				9047
				9048	The '``ptrtoint``' instruction takes a ``value`` to cast, which must be
Ed Maste	8ed40ce	2015-04-14 20:52:58 +0000	[diff] [blame]	9049	a value of type :ref:`pointer <t_pointer>` or a vector of pointers, and a
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9050	type to cast it to ``ty2``, which must be an :ref:`integer <t_integer>` or
				9051	a vector of integers type.
				9052
				9053	Semantics:
				9054	""""""""""
				9055
				9056	The '``ptrtoint``' instruction converts ``value`` to integer type
				9057	``ty2`` by interpreting the pointer value as an integer and either
				9058	truncating or zero extending that value to the size of the integer type.
				9059	If ``value`` is smaller than ``ty2`` then a zero extension is done. If
				9060	``value`` is larger than ``ty2`` then a truncation is done. If they are
				9061	the same size, then nothing is done (no-op cast) other than a type
				9062	change.
				9063
				9064	Example:
				9065	""""""""
				9066
				9067	.. code-block:: llvm
				9068
				9069	%X = ptrtoint i32* %P to i8 ; yields truncation on 32-bit architecture
				9070	%Y = ptrtoint i32* %P to i64 ; yields zero extension on 32-bit architecture
				9071	%Z = ptrtoint <4 x i32*> %P to <4 x i64>; yields vector zero extension for a vector of addresses on 32-bit architecture
				9072
				9073	.. _i_inttoptr:
				9074
				9075	'``inttoptr .. to``' Instruction
				9076	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				9077
				9078	Syntax:
				9079	"""""""
				9080
				9081	::
				9082
				9083	<result> = inttoptr <ty> <value> to <ty2> ; yields ty2
				9084
				9085	Overview:
				9086	"""""""""
				9087
				9088	The '``inttoptr``' instruction converts an integer ``value`` to a
				9089	pointer type, ``ty2``.
				9090
				9091	Arguments:
				9092	""""""""""
				9093
				9094	The '``inttoptr``' instruction takes an :ref:`integer <t_integer>` value to
				9095	cast, and a type to cast it to, which must be a :ref:`pointer <t_pointer>`
				9096	type.
				9097
				9098	Semantics:
				9099	""""""""""
				9100
				9101	The '``inttoptr``' instruction converts ``value`` to type ``ty2`` by
				9102	applying either a zero extension or a truncation depending on the size
				9103	of the integer ``value``. If ``value`` is larger than the size of a
				9104	pointer then a truncation is done. If ``value`` is smaller than the size
				9105	of a pointer then a zero extension is done. If they are the same size,
				9106	nothing is done (no-op cast).
				9107
				9108	Example:
				9109	""""""""
				9110
				9111	.. code-block:: llvm
				9112
				9113	%X = inttoptr i32 255 to i32* ; yields zero extension on 64-bit architecture
				9114	%Y = inttoptr i32 255 to i32* ; yields no-op on 32-bit architecture
				9115	%Z = inttoptr i64 0 to i32* ; yields truncation on 32-bit architecture
				9116	%Z = inttoptr <4 x i32> %G to <4 x i8*>; yields truncation of vector G to four pointers
				9117
				9118	.. _i_bitcast:
				9119
				9120	'``bitcast .. to``' Instruction
				9121	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				9122
				9123	Syntax:
				9124	"""""""
				9125
				9126	::
				9127
				9128	<result> = bitcast <ty> <value> to <ty2> ; yields ty2
				9129
				9130	Overview:
				9131	"""""""""
				9132
				9133	The '``bitcast``' instruction converts ``value`` to type ``ty2`` without
				9134	changing any bits.
				9135
				9136	Arguments:
				9137	""""""""""
				9138
				9139	The '``bitcast``' instruction takes a value to cast, which must be a
				9140	non-aggregate first class value, and a type to cast it to, which must
Matt Arsenault	24b49c4	2013-07-31 17:49:08 +0000	[diff] [blame]	9141	also be a non-aggregate :ref:`first class <t_firstclass>` type. The
				9142	bit sizes of ``value`` and the destination type, ``ty2``, must be
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	9143	identical. If the source type is a pointer, the destination type must
Matt Arsenault	24b49c4	2013-07-31 17:49:08 +0000	[diff] [blame]	9144	also be a pointer of the same size. This instruction supports bitwise
				9145	conversion of vectors to integers and to vectors of other types (as
				9146	long as they have the same size).
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9147
				9148	Semantics:
				9149	""""""""""
				9150
Matt Arsenault	24b49c4	2013-07-31 17:49:08 +0000	[diff] [blame]	9151	The '``bitcast``' instruction converts ``value`` to type ``ty2``. It
				9152	is always a no-op cast because no bits change with this
				9153	conversion. The conversion is done as if the ``value`` had been stored
				9154	to memory and read back as type ``ty2``. Pointer (or vector of
				9155	pointers) types may only be converted to other pointer (or vector of
Matt Arsenault	b03bd4d	2013-11-15 01:34:59 +0000	[diff] [blame]	9156	pointers) types with the same address space through this instruction.
				9157	To convert pointers to other types, use the :ref:`inttoptr <i_inttoptr>`
				9158	or :ref:`ptrtoint <i_ptrtoint>` instructions first.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9159
				9160	Example:
				9161	""""""""
				9162
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	9163	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9164
				9165	%X = bitcast i8 255 to i8 ; yields i8 :-1
				9166	%Y = bitcast i32* %x to sint* ; yields sint*:%x
				9167	%Z = bitcast <2 x int> %V to i64; ; yields i64: %V
				9168	%Z = bitcast <2 x i32> %V to <2 x i64> ; yields <2 x i64*>
				9169
Matt Arsenault	b03bd4d	2013-11-15 01:34:59 +0000	[diff] [blame]	9170	.. _i_addrspacecast:
				9171
				9172	'``addrspacecast .. to``' Instruction
				9173	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				9174
				9175	Syntax:
				9176	"""""""
				9177
				9178	::
				9179
				9180	<result> = addrspacecast <pty> <ptrval> to <pty2> ; yields pty2
				9181
				9182	Overview:
				9183	"""""""""
				9184
				9185	The '``addrspacecast``' instruction converts ``ptrval`` from ``pty`` in
				9186	address space ``n`` to type ``pty2`` in address space ``m``.
				9187
				9188	Arguments:
				9189	""""""""""
				9190
				9191	The '``addrspacecast``' instruction takes a pointer or vector of pointer value
				9192	to cast and a pointer type to cast it to, which must have a different
				9193	address space.
				9194
				9195	Semantics:
				9196	""""""""""
				9197
				9198	The '``addrspacecast``' instruction converts the pointer value
				9199	``ptrval`` to type ``pty2``. It can be a no-op cast or a complex
Matt Arsenault	54a2a17	2013-11-15 05:44:56 +0000	[diff] [blame]	9200	value modification, depending on the target and the address space
				9201	pair. Pointer conversions within the same address space must be
				9202	performed with the ``bitcast`` instruction. Note that if the address space
Matt Arsenault	b03bd4d	2013-11-15 01:34:59 +0000	[diff] [blame]	9203	conversion is legal then both result and operand refer to the same memory
				9204	location.
				9205
				9206	Example:
				9207	""""""""
				9208
				9209	.. code-block:: llvm
				9210
Matt Arsenault	9c13dd0	2013-11-15 22:43:50 +0000	[diff] [blame]	9211	%X = addrspacecast i32* %x to i32 addrspace(1)* ; yields i32 addrspace(1)*:%x
				9212	%Y = addrspacecast i32 addrspace(1)* %y to i64 addrspace(2)* ; yields i64 addrspace(2)*:%y
				9213	%Z = addrspacecast <4 x i32> %z to <4 x float addrspace(3)> ; yields <4 x float addrspace(3)*>:%z
Matt Arsenault	b03bd4d	2013-11-15 01:34:59 +0000	[diff] [blame]	9214
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9215	.. _otherops:
				9216
				9217	Other Operations
				9218	----------------
				9219
				9220	The instructions in this category are the "miscellaneous" instructions,
				9221	which defy better classification.
				9222
				9223	.. _i_icmp:
				9224
				9225	'``icmp``' Instruction
				9226	^^^^^^^^^^^^^^^^^^^^^^
				9227
				9228	Syntax:
				9229	"""""""
				9230
				9231	::
				9232
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	9233	<result> = icmp <cond> <ty> <op1>, <op2> ; yields i1 or <N x i1>:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9234
				9235	Overview:
				9236	"""""""""
				9237
				9238	The '``icmp``' instruction returns a boolean value or a vector of
				9239	boolean values based on comparison of its two integer, integer vector,
				9240	pointer, or pointer vector operands.
				9241
				9242	Arguments:
				9243	""""""""""
				9244
				9245	The '``icmp``' instruction takes three operands. The first operand is
				9246	the condition code indicating the kind of comparison to perform. It is
Sanjay Patel	43d4144	2016-03-30 21:38:20 +0000	[diff] [blame]	9247	not a value, just a keyword. The possible condition codes are:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9248
				9249	#. ``eq``: equal
				9250	#. ``ne``: not equal
				9251	#. ``ugt``: unsigned greater than
				9252	#. ``uge``: unsigned greater or equal
				9253	#. ``ult``: unsigned less than
				9254	#. ``ule``: unsigned less or equal
				9255	#. ``sgt``: signed greater than
				9256	#. ``sge``: signed greater or equal
				9257	#. ``slt``: signed less than
				9258	#. ``sle``: signed less or equal
				9259
				9260	The remaining two arguments must be :ref:`integer <t_integer>` or
				9261	:ref:`pointer <t_pointer>` or integer :ref:`vector <t_vector>` typed. They
				9262	must also be identical types.
				9263
				9264	Semantics:
				9265	""""""""""
				9266
				9267	The '``icmp``' compares ``op1`` and ``op2`` according to the condition
				9268	code given as ``cond``. The comparison performed always yields either an
				9269	:ref:`i1 <t_integer>` or vector of ``i1`` result, as follows:
				9270
				9271	#. ``eq``: yields ``true`` if the operands are equal, ``false``
				9272	otherwise. No sign interpretation is necessary or performed.
				9273	#. ``ne``: yields ``true`` if the operands are unequal, ``false``
				9274	otherwise. No sign interpretation is necessary or performed.
				9275	#. ``ugt``: interprets the operands as unsigned values and yields
				9276	``true`` if ``op1`` is greater than ``op2``.
				9277	#. ``uge``: interprets the operands as unsigned values and yields
				9278	``true`` if ``op1`` is greater than or equal to ``op2``.
				9279	#. ``ult``: interprets the operands as unsigned values and yields
				9280	``true`` if ``op1`` is less than ``op2``.
				9281	#. ``ule``: interprets the operands as unsigned values and yields
				9282	``true`` if ``op1`` is less than or equal to ``op2``.
				9283	#. ``sgt``: interprets the operands as signed values and yields ``true``
				9284	if ``op1`` is greater than ``op2``.
				9285	#. ``sge``: interprets the operands as signed values and yields ``true``
				9286	if ``op1`` is greater than or equal to ``op2``.
				9287	#. ``slt``: interprets the operands as signed values and yields ``true``
				9288	if ``op1`` is less than ``op2``.
				9289	#. ``sle``: interprets the operands as signed values and yields ``true``
				9290	if ``op1`` is less than or equal to ``op2``.
				9291
				9292	If the operands are :ref:`pointer <t_pointer>` typed, the pointer values
				9293	are compared as if they were integers.
				9294
				9295	If the operands are integer vectors, then they are compared element by
				9296	element. The result is an ``i1`` vector with the same number of elements
				9297	as the values being compared. Otherwise, the result is an ``i1``.
				9298
				9299	Example:
				9300	""""""""
				9301
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	9302	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9303
				9304	<result> = icmp eq i32 4, 5 ; yields: result=false
				9305	<result> = icmp ne float* %X, %X ; yields: result=false
				9306	<result> = icmp ult i16 4, 5 ; yields: result=true
				9307	<result> = icmp sgt i16 4, 5 ; yields: result=false
				9308	<result> = icmp ule i16 -4, 5 ; yields: result=false
				9309	<result> = icmp sge i16 4, 5 ; yields: result=false
				9310
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9311	.. _i_fcmp:
				9312
				9313	'``fcmp``' Instruction
				9314	^^^^^^^^^^^^^^^^^^^^^^
				9315
				9316	Syntax:
				9317	"""""""
				9318
				9319	::
				9320
James Molloy	88eb535	2015-07-10 12:52:00 +0000	[diff] [blame]	9321	<result> = fcmp [fast-math flags]* <cond> <ty> <op1>, <op2> ; yields i1 or <N x i1>:result
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9322
				9323	Overview:
				9324	"""""""""
				9325
				9326	The '``fcmp``' instruction returns a boolean value or vector of boolean
				9327	values based on comparison of its operands.
				9328
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	9329	If the operands are floating-point scalars, then the result type is a
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9330	boolean (:ref:`i1 <t_integer>`).
				9331
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	9332	If the operands are floating-point vectors, then the result type is a
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9333	vector of boolean with the same number of elements as the operands being
				9334	compared.
				9335
				9336	Arguments:
				9337	""""""""""
				9338
				9339	The '``fcmp``' instruction takes three operands. The first operand is
				9340	the condition code indicating the kind of comparison to perform. It is
Sanjay Patel	43d4144	2016-03-30 21:38:20 +0000	[diff] [blame]	9341	not a value, just a keyword. The possible condition codes are:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9342
				9343	#. ``false``: no comparison, always returns false
				9344	#. ``oeq``: ordered and equal
				9345	#. ``ogt``: ordered and greater than
				9346	#. ``oge``: ordered and greater than or equal
				9347	#. ``olt``: ordered and less than
				9348	#. ``ole``: ordered and less than or equal
				9349	#. ``one``: ordered and not equal
				9350	#. ``ord``: ordered (no nans)
				9351	#. ``ueq``: unordered or equal
				9352	#. ``ugt``: unordered or greater than
				9353	#. ``uge``: unordered or greater than or equal
				9354	#. ``ult``: unordered or less than
				9355	#. ``ule``: unordered or less than or equal
				9356	#. ``une``: unordered or not equal
				9357	#. ``uno``: unordered (either nans)
				9358	#. ``true``: no comparison, always returns true
				9359
				9360	Ordered means that neither operand is a QNAN while unordered means
				9361	that either operand may be a QNAN.
				9362
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	9363	Each of ``val1`` and ``val2`` arguments must be either a :ref:`floating-point
				9364	<t_floating>` type or a :ref:`vector <t_vector>` of floating-point type.
				9365	They must have identical types.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9366
				9367	Semantics:
				9368	""""""""""
				9369
				9370	The '``fcmp``' instruction compares ``op1`` and ``op2`` according to the
				9371	condition code given as ``cond``. If the operands are vectors, then the
				9372	vectors are compared element by element. Each comparison performed
				9373	always yields an :ref:`i1 <t_integer>` result, as follows:
				9374
				9375	#. ``false``: always yields ``false``, regardless of operands.
				9376	#. ``oeq``: yields ``true`` if both operands are not a QNAN and ``op1``
				9377	is equal to ``op2``.
				9378	#. ``ogt``: yields ``true`` if both operands are not a QNAN and ``op1``
				9379	is greater than ``op2``.
				9380	#. ``oge``: yields ``true`` if both operands are not a QNAN and ``op1``
				9381	is greater than or equal to ``op2``.
				9382	#. ``olt``: yields ``true`` if both operands are not a QNAN and ``op1``
				9383	is less than ``op2``.
				9384	#. ``ole``: yields ``true`` if both operands are not a QNAN and ``op1``
				9385	is less than or equal to ``op2``.
				9386	#. ``one``: yields ``true`` if both operands are not a QNAN and ``op1``
				9387	is not equal to ``op2``.
				9388	#. ``ord``: yields ``true`` if both operands are not a QNAN.
				9389	#. ``ueq``: yields ``true`` if either operand is a QNAN or ``op1`` is
				9390	equal to ``op2``.
				9391	#. ``ugt``: yields ``true`` if either operand is a QNAN or ``op1`` is
				9392	greater than ``op2``.
				9393	#. ``uge``: yields ``true`` if either operand is a QNAN or ``op1`` is
				9394	greater than or equal to ``op2``.
				9395	#. ``ult``: yields ``true`` if either operand is a QNAN or ``op1`` is
				9396	less than ``op2``.
				9397	#. ``ule``: yields ``true`` if either operand is a QNAN or ``op1`` is
				9398	less than or equal to ``op2``.
				9399	#. ``une``: yields ``true`` if either operand is a QNAN or ``op1`` is
				9400	not equal to ``op2``.
				9401	#. ``uno``: yields ``true`` if either operand is a QNAN.
				9402	#. ``true``: always yields ``true``, regardless of operands.
				9403
James Molloy	88eb535	2015-07-10 12:52:00 +0000	[diff] [blame]	9404	The ``fcmp`` instruction can also optionally take any number of
				9405	:ref:`fast-math flags <fastmath>`, which are optimization hints to enable
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	9406	otherwise unsafe floating-point optimizations.
James Molloy	88eb535	2015-07-10 12:52:00 +0000	[diff] [blame]	9407
				9408	Any set of fast-math flags are legal on an ``fcmp`` instruction, but the
				9409	only flags that have any effect on its semantics are those that allow
				9410	assumptions to be made about the values of input arguments; namely
Eli Friedman	8bb4326	2018-07-17 20:28:31 +0000	[diff] [blame]	9411	``nnan``, ``ninf``, and ``reassoc``. See :ref:`fastmath` for more information.
James Molloy	88eb535	2015-07-10 12:52:00 +0000	[diff] [blame]	9412
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9413	Example:
				9414	""""""""
				9415
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	9416	.. code-block:: text
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9417
				9418	<result> = fcmp oeq float 4.0, 5.0 ; yields: result=false
				9419	<result> = fcmp one float 4.0, 5.0 ; yields: result=true
				9420	<result> = fcmp olt float 4.0, 5.0 ; yields: result=true
				9421	<result> = fcmp ueq double 1.0, 2.0 ; yields: result=false
				9422
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9423	.. _i_phi:
				9424
				9425	'``phi``' Instruction
				9426	^^^^^^^^^^^^^^^^^^^^^
				9427
				9428	Syntax:
				9429	"""""""
				9430
				9431	::
				9432
				9433	<result> = phi <ty> [ <val0>, <label0>], ...
				9434
				9435	Overview:
				9436	"""""""""
				9437
				9438	The '``phi``' instruction is used to implement the φ node in the SSA
				9439	graph representing the function.
				9440
				9441	Arguments:
				9442	""""""""""
				9443
				9444	The type of the incoming values is specified with the first type field.
				9445	After this, the '``phi``' instruction takes a list of pairs as
				9446	arguments, with one pair for each predecessor basic block of the current
				9447	block. Only values of :ref:`first class <t_firstclass>` type may be used as
				9448	the value arguments to the PHI node. Only labels may be used as the
				9449	label arguments.
				9450
				9451	There must be no non-phi instructions between the start of a basic block
				9452	and the PHI instructions: i.e. PHI instructions must be first in a basic
				9453	block.
				9454
				9455	For the purposes of the SSA form, the use of each incoming value is
				9456	deemed to occur on the edge from the corresponding predecessor block to
				9457	the current block (but after any definition of an '``invoke``'
				9458	instruction's return value on the same edge).
				9459
				9460	Semantics:
				9461	""""""""""
				9462
				9463	At runtime, the '``phi``' instruction logically takes on the value
				9464	specified by the pair corresponding to the predecessor basic block that
				9465	executed just prior to the current block.
				9466
				9467	Example:
				9468	""""""""
				9469
				9470	.. code-block:: llvm
				9471
				9472	Loop: ; Infinite loop that counts from 0 on up...
				9473	%indvar = phi i32 [ 0, %LoopHeader ], [ %nextindvar, %Loop ]
				9474	%nextindvar = add i32 %indvar, 1
				9475	br label %Loop
				9476
				9477	.. _i_select:
				9478
				9479	'``select``' Instruction
				9480	^^^^^^^^^^^^^^^^^^^^^^^^
				9481
				9482	Syntax:
				9483	"""""""
				9484
				9485	::
				9486
				9487	<result> = select selty <cond>, <ty> <val1>, <ty> <val2> ; yields ty
				9488
				9489	selty is either i1 or {<N x i1>}
				9490
				9491	Overview:
				9492	"""""""""
				9493
				9494	The '``select``' instruction is used to choose one value based on a
Joerg Sonnenberger	94321ec	2014-03-26 15:30:21 +0000	[diff] [blame]	9495	condition, without IR-level branching.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9496
				9497	Arguments:
				9498	""""""""""
				9499
				9500	The '``select``' instruction requires an 'i1' value or a vector of 'i1'
				9501	values indicating the condition, and two values of the same :ref:`first
David Majnemer	40a0b59	2015-03-03 22:45:47 +0000	[diff] [blame]	9502	class <t_firstclass>` type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9503
				9504	Semantics:
				9505	""""""""""
				9506
				9507	If the condition is an i1 and it evaluates to 1, the instruction returns
				9508	the first value argument; otherwise, it returns the second value
				9509	argument.
				9510
				9511	If the condition is a vector of i1, then the value arguments must be
				9512	vectors of the same size, and the selection is done element by element.
				9513
David Majnemer	40a0b59	2015-03-03 22:45:47 +0000	[diff] [blame]	9514	If the condition is an i1 and the value arguments are vectors of the
				9515	same size, then an entire vector is selected.
				9516
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9517	Example:
				9518	""""""""
				9519
				9520	.. code-block:: llvm
				9521
				9522	%X = select i1 true, i8 17, i8 42 ; yields i8:17
				9523
				9524	.. _i_call:
				9525
				9526	'``call``' Instruction
				9527	^^^^^^^^^^^^^^^^^^^^^^
				9528
				9529	Syntax:
				9530	"""""""
				9531
				9532	::
				9533
Alexander Richardson	6bcf2ba	2018-08-23 09:25:17 +0000	[diff] [blame]	9534	<result> = [tail \| musttail \| notail ] call [fast-math flags] [cconv] [ret attrs] [addrspace(<num>)]
				9535	[<ty>\|<fnty> <fnptrval>(<function args>) [fn attrs] [ operand bundles ]
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9536
				9537	Overview:
				9538	"""""""""
				9539
				9540	The '``call``' instruction represents a simple function call.
				9541
				9542	Arguments:
				9543	""""""""""
				9544
				9545	This instruction requires several arguments:
				9546
Reid Kleckner	5772b77	2014-04-24 20:14:34 +0000	[diff] [blame]	9547	#. The optional ``tail`` and ``musttail`` markers indicate that the optimizers
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	9548	should perform tail call optimization. The ``tail`` marker is a hint that
				9549	`can be ignored <CodeGenerator.html#sibcallopt>`_. The ``musttail`` marker
Reid Kleckner	5772b77	2014-04-24 20:14:34 +0000	[diff] [blame]	9550	means that the call must be tail call optimized in order for the program to
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	9551	be correct. The ``musttail`` marker provides these guarantees:
Reid Kleckner	5772b77	2014-04-24 20:14:34 +0000	[diff] [blame]	9552
				9553	#. The call will not cause unbounded stack growth if it is part of a
				9554	recursive cycle in the call graph.
				9555	#. Arguments with the :ref:`inalloca <attr_inalloca>` attribute are
				9556	forwarded in place.
				9557
Florian Hahn	edae5a6	2018-01-17 23:29:25 +0000	[diff] [blame]	9558	Both markers imply that the callee does not access allocas from the caller.
				9559	The ``tail`` marker additionally implies that the callee does not access
				9560	varargs from the caller, while ``musttail`` implies that varargs from the
				9561	caller are passed to the callee. Calls marked ``musttail`` must obey the
				9562	following additional rules:
Reid Kleckner	5772b77	2014-04-24 20:14:34 +0000	[diff] [blame]	9563
				9564	- The call must immediately precede a :ref:`ret <i_ret>` instruction,
				9565	or a pointer bitcast followed by a ret instruction.
				9566	- The ret instruction must return the (possibly bitcasted) value
				9567	produced by the call or void.
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	9568	- The caller and callee prototypes must match. Pointer types of
Reid Kleckner	5772b77	2014-04-24 20:14:34 +0000	[diff] [blame]	9569	parameters or return types may differ in pointee type, but not
				9570	in address space.
				9571	- The calling conventions of the caller and callee must match.
				9572	- All ABI-impacting function attributes, such as sret, byval, inreg,
				9573	returned, and inalloca, must match.
Reid Kleckner	8349864	2014-08-26 00:33:28 +0000	[diff] [blame]	9574	- The callee must be varargs iff the caller is varargs. Bitcasting a
				9575	non-varargs function to the appropriate varargs type is legal so
				9576	long as the non-varargs prefixes obey the other rules.
Reid Kleckner	5772b77	2014-04-24 20:14:34 +0000	[diff] [blame]	9577
				9578	Tail call optimization for calls marked ``tail`` is guaranteed to occur if
				9579	the following conditions are met:
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9580
				9581	- Caller and callee both have the calling convention ``fastcc``.
				9582	- The call is in tail position (ret immediately follows call and ret
				9583	uses value of call or is void).
				9584	- Option ``-tailcallopt`` is enabled, or
				9585	``llvm::GuaranteedTailCallOpt`` is ``true``.
Alp Toker	cf21875	2014-06-30 18:57:16 +0000	[diff] [blame]	9586	- `Platform-specific constraints are
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9587	met. <CodeGenerator.html#tailcallopt>`_
				9588
Akira Hatanaka	5cfcce12	2015-11-06 23:55:38 +0000	[diff] [blame]	9589	#. The optional ``notail`` marker indicates that the optimizers should not add
				9590	``tail`` or ``musttail`` markers to the call. It is used to prevent tail
				9591	call optimization from being performed on the call.
				9592
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	9593	#. The optional ``fast-math flags`` marker indicates that the call has one or more
Sanjay Patel	fa54ace	2015-12-14 21:59:03 +0000	[diff] [blame]	9594	:ref:`fast-math flags <fastmath>`, which are optimization hints to enable
				9595	otherwise unsafe floating-point optimizations. Fast-math flags are only valid
				9596	for calls that return a floating-point scalar or vector type.
				9597
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9598	#. The optional "cconv" marker indicates which :ref:`calling
				9599	convention <callingconv>` the call should use. If none is
				9600	specified, the call defaults to using C calling conventions. The
				9601	calling convention of the call must match the calling convention of
				9602	the target function, or else the behavior is undefined.
				9603	#. The optional :ref:`Parameter Attributes <paramattrs>` list for return
				9604	values. Only '``zeroext``', '``signext``', and '``inreg``' attributes
				9605	are valid here.
Alexander Richardson	6bcf2ba	2018-08-23 09:25:17 +0000	[diff] [blame]	9606	#. The optional addrspace attribute can be used to indicate the adress space
				9607	of the called function. If it is not specified, the program address space
				9608	from the :ref:`datalayout string<langref_datalayout>` will be used.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9609	#. '``ty``': the type of the call instruction itself which is also the
				9610	type of the return value. Functions that return no value are marked
				9611	``void``.
David Blaikie	b83cf10	2016-07-13 17:21:34 +0000	[diff] [blame]	9612	#. '``fnty``': shall be the signature of the function being called. The
				9613	argument types must match the types implied by this signature. This
				9614	type can be omitted if the function is not varargs.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9615	#. '``fnptrval``': An LLVM value containing a pointer to a function to
David Blaikie	b83cf10	2016-07-13 17:21:34 +0000	[diff] [blame]	9616	be called. In most cases, this is a direct function call, but
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9617	indirect ``call``'s are just as possible, calling an arbitrary pointer
				9618	to function value.
				9619	#. '``function args``': argument list whose types match the function
				9620	signature argument types and parameter attributes. All arguments must
				9621	be of :ref:`first class <t_firstclass>` type. If the function signature
				9622	indicates the function accepts a variable number of arguments, the
				9623	extra arguments can be specified.
George Burgess IV	39c9105	2017-04-13 04:01:55 +0000	[diff] [blame]	9624	#. The optional :ref:`function attributes <fnattrs>` list.
Sanjoy Das	b513a9f	2015-09-24 23:34:52 +0000	[diff] [blame]	9625	#. The optional :ref:`operand bundles <opbundles>` list.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9626
				9627	Semantics:
				9628	""""""""""
				9629
				9630	The '``call``' instruction is used to cause control flow to transfer to
				9631	a specified function, with its incoming arguments bound to the specified
				9632	values. Upon a '``ret``' instruction in the called function, control
				9633	flow continues with the instruction after the function call, and the
				9634	return value of the function is bound to the result argument.
				9635
				9636	Example:
				9637	""""""""
				9638
				9639	.. code-block:: llvm
				9640
				9641	%retval = call i32 @test(i32 %argc)
				9642	call i32 (i8, ...) @printf(i8* %msg, i32 12, i8 42) ; yields i32
				9643	%X = tail call i32 @foo() ; yields i32
				9644	%Y = tail call fastcc i32 @foo() ; yields i32
				9645	call void %foo(i8 97 signext)
				9646
				9647	%struct.A = type { i32, i8 }
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	9648	%r = call %struct.A @foo() ; yields { i32, i8 }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9649	%gr = extractvalue %struct.A %r, 0 ; yields i32
				9650	%gr1 = extractvalue %struct.A %r, 1 ; yields i8
				9651	%Z = call void @foo() noreturn ; indicates that %foo never returns normally
				9652	%ZZ = call zeroext i32 @bar() ; Return value is %zero extended
				9653
				9654	llvm treats calls to some functions with names and arguments that match
				9655	the standard C99 library as being the C99 library functions, and may
				9656	perform optimizations or generate code for them under that assumption.
				9657	This is something we'd like to change in the future to provide better
				9658	support for freestanding environments and non-C-based languages.
				9659
				9660	.. _i_va_arg:
				9661
				9662	'``va_arg``' Instruction
				9663	^^^^^^^^^^^^^^^^^^^^^^^^
				9664
				9665	Syntax:
				9666	"""""""
				9667
				9668	::
				9669
				9670	<resultval> = va_arg <va_list*> <arglist>, <argty>
				9671
				9672	Overview:
				9673	"""""""""
				9674
				9675	The '``va_arg``' instruction is used to access arguments passed through
				9676	the "variable argument" area of a function call. It is used to implement
				9677	the ``va_arg`` macro in C.
				9678
				9679	Arguments:
				9680	""""""""""
				9681
				9682	This instruction takes a ``va_list*`` value and the type of the
				9683	argument. It returns a value of the specified argument type and
				9684	increments the ``va_list`` to point to the next argument. The actual
				9685	type of ``va_list`` is target specific.
				9686
				9687	Semantics:
				9688	""""""""""
				9689
				9690	The '``va_arg``' instruction loads an argument of the specified type
				9691	from the specified ``va_list`` and causes the ``va_list`` to point to
				9692	the next argument. For more information, see the variable argument
				9693	handling :ref:`Intrinsic Functions <int_varargs>`.
				9694
				9695	It is legal for this instruction to be called in a function which does
				9696	not take a variable number of arguments, for example, the ``vfprintf``
				9697	function.
				9698
				9699	``va_arg`` is an LLVM instruction instead of an :ref:`intrinsic
				9700	function <intrinsics>` because it takes a type as an argument.
				9701
				9702	Example:
				9703	""""""""
				9704
				9705	See the :ref:`variable argument processing <int_varargs>` section.
				9706
				9707	Note that the code generator does not yet fully support va\_arg on many
				9708	targets. Also, it does not currently support va\_arg with aggregate
				9709	types on any target.
				9710
				9711	.. _i_landingpad:
				9712
				9713	'``landingpad``' Instruction
				9714	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				9715
				9716	Syntax:
				9717	"""""""
				9718
				9719	::
				9720
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	9721	<resultval> = landingpad <resultty> <clause>+
				9722	<resultval> = landingpad <resultty> cleanup <clause>*
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9723
				9724	<clause> := catch <type> <value>
				9725	<clause> := filter <array constant type> <array constant>
				9726
				9727	Overview:
				9728	"""""""""
				9729
				9730	The '``landingpad``' instruction is used by `LLVM's exception handling
				9731	system <ExceptionHandling.html#overview>`_ to specify that a basic block
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	9732	is a landing pad --- one where the exception lands, and corresponds to the
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9733	code found in the ``catch`` portion of a ``try``/``catch`` sequence. It
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	9734	defines values supplied by the :ref:`personality function <personalityfn>` upon
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9735	re-entry to the function. The ``resultval`` has the type ``resultty``.
				9736
				9737	Arguments:
				9738	""""""""""
				9739
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	9740	The optional
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9741	``cleanup`` flag indicates that the landing pad block is a cleanup.
				9742
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	9743	A ``clause`` begins with the clause type --- ``catch`` or ``filter`` --- and
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9744	contains the global variable representing the "type" that may be caught
				9745	or filtered respectively. Unlike the ``catch`` clause, the ``filter``
				9746	clause takes an array constant as its argument. Use
				9747	"``[0 x i8**] undef``" for a filter which cannot throw. The
				9748	'``landingpad``' instruction must contain at least one ``clause`` or
				9749	the ``cleanup`` flag.
				9750
				9751	Semantics:
				9752	""""""""""
				9753
				9754	The '``landingpad``' instruction defines the values which are set by the
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	9755	:ref:`personality function <personalityfn>` upon re-entry to the function, and
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9756	therefore the "result type" of the ``landingpad`` instruction. As with
				9757	calling conventions, how the personality function results are
				9758	represented in LLVM IR is target specific.
				9759
				9760	The clauses are applied in order from top to bottom. If two
				9761	``landingpad`` instructions are merged together through inlining, the
				9762	clauses from the calling function are appended to the list of clauses.
				9763	When the call stack is being unwound due to an exception being thrown,
				9764	the exception is compared against each ``clause`` in turn. If it doesn't
				9765	match any of the clauses, and the ``cleanup`` flag is not set, then
				9766	unwinding continues further up the call stack.
				9767
				9768	The ``landingpad`` instruction has several restrictions:
				9769
				9770	- A landing pad block is a basic block which is the unwind destination
				9771	of an '``invoke``' instruction.
				9772	- A landing pad block must have a '``landingpad``' instruction as its
				9773	first non-PHI instruction.
				9774	- There can be only one '``landingpad``' instruction within the landing
				9775	pad block.
				9776	- A basic block that is not a landing pad block may not include a
				9777	'``landingpad``' instruction.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9778
				9779	Example:
				9780	""""""""
				9781
				9782	.. code-block:: llvm
				9783
				9784	;; A landing pad which can catch an integer.
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	9785	%res = landingpad { i8*, i32 }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9786	catch i8** @_ZTIi
				9787	;; A landing pad that is a cleanup.
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	9788	%res = landingpad { i8*, i32 }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9789	cleanup
				9790	;; A landing pad which can catch an integer and can only throw a double.
David Majnemer	7fddecc	2015-06-17 20:52:32 +0000	[diff] [blame]	9791	%res = landingpad { i8*, i32 }
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9792	catch i8** @_ZTIi
				9793	filter [1 x i8**] [@_ZTId]
				9794
Joseph Tremoulet	2adaa98	2016-01-10 04:46:10 +0000	[diff] [blame]	9795	.. _i_catchpad:
				9796
				9797	'``catchpad``' Instruction
				9798	^^^^^^^^^^^^^^^^^^^^^^^^^^
				9799
				9800	Syntax:
				9801	"""""""
				9802
				9803	::
				9804
				9805	<resultval> = catchpad within <catchswitch> [<args>*]
				9806
				9807	Overview:
				9808	"""""""""
				9809
				9810	The '``catchpad``' instruction is used by `LLVM's exception handling
				9811	system <ExceptionHandling.html#overview>`_ to specify that a basic block
				9812	begins a catch handler --- one where a personality routine attempts to transfer
				9813	control to catch an exception.
				9814
				9815	Arguments:
				9816	""""""""""
				9817
				9818	The ``catchswitch`` operand must always be a token produced by a
				9819	:ref:`catchswitch <i_catchswitch>` instruction in a predecessor block. This
				9820	ensures that each ``catchpad`` has exactly one predecessor block, and it always
				9821	terminates in a ``catchswitch``.
				9822
				9823	The ``args`` correspond to whatever information the personality routine
				9824	requires to know if this is an appropriate handler for the exception. Control
				9825	will transfer to the ``catchpad`` if this is the first appropriate handler for
				9826	the exception.
				9827
				9828	The ``resultval`` has the type :ref:`token <t_token>` and is used to match the
				9829	``catchpad`` to corresponding :ref:`catchrets <i_catchret>` and other nested EH
				9830	pads.
				9831
				9832	Semantics:
				9833	""""""""""
				9834
				9835	When the call stack is being unwound due to an exception being thrown, the
				9836	exception is compared against the ``args``. If it doesn't match, control will
				9837	not reach the ``catchpad`` instruction. The representation of ``args`` is
				9838	entirely target and personality function-specific.
				9839
				9840	Like the :ref:`landingpad <i_landingpad>` instruction, the ``catchpad``
				9841	instruction must be the first non-phi of its parent basic block.
				9842
				9843	The meaning of the tokens produced and consumed by ``catchpad`` and other "pad"
				9844	instructions is described in the
				9845	`Windows exception handling documentation\ <ExceptionHandling.html#wineh>`_.
				9846
				9847	When a ``catchpad`` has been "entered" but not yet "exited" (as
				9848	described in the `EH documentation\ <ExceptionHandling.html#wineh-constraints>`_),
				9849	it is undefined behavior to execute a :ref:`call <i_call>` or :ref:`invoke <i_invoke>`
				9850	that does not carry an appropriate :ref:`"funclet" bundle <ob_funclet>`.
				9851
				9852	Example:
				9853	""""""""
				9854
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	9855	.. code-block:: text
Joseph Tremoulet	2adaa98	2016-01-10 04:46:10 +0000	[diff] [blame]	9856
				9857	dispatch:
				9858	%cs = catchswitch within none [label %handler0] unwind to caller
				9859	;; A catch block which can catch an integer.
				9860	handler0:
				9861	%tok = catchpad within %cs [i8** @_ZTIi]
				9862
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	9863	.. _i_cleanuppad:
				9864
				9865	'``cleanuppad``' Instruction
				9866	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				9867
				9868	Syntax:
				9869	"""""""
				9870
				9871	::
				9872
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	9873	<resultval> = cleanuppad within <parent> [<args>*]
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	9874
				9875	Overview:
				9876	"""""""""
				9877
				9878	The '``cleanuppad``' instruction is used by `LLVM's exception handling
				9879	system <ExceptionHandling.html#overview>`_ to specify that a basic block
				9880	is a cleanup block --- one where a personality routine attempts to
				9881	transfer control to run cleanup actions.
				9882	The ``args`` correspond to whatever additional
				9883	information the :ref:`personality function <personalityfn>` requires to
				9884	execute the cleanup.
Joseph Tremoulet	8220bcc	2015-08-23 00:26:33 +0000	[diff] [blame]	9885	The ``resultval`` has the type :ref:`token <t_token>` and is used to
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	9886	match the ``cleanuppad`` to corresponding :ref:`cleanuprets <i_cleanupret>`.
				9887	The ``parent`` argument is the token of the funclet that contains the
				9888	``cleanuppad`` instruction. If the ``cleanuppad`` is not inside a funclet,
				9889	this operand may be the token ``none``.
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	9890
				9891	Arguments:
				9892	""""""""""
				9893
				9894	The instruction takes a list of arbitrary values which are interpreted
				9895	by the :ref:`personality function <personalityfn>`.
				9896
				9897	Semantics:
				9898	""""""""""
				9899
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	9900	When the call stack is being unwound due to an exception being thrown,
				9901	the :ref:`personality function <personalityfn>` transfers control to the
				9902	``cleanuppad`` with the aid of the personality-specific arguments.
Joseph Tremoulet	9ce71f7	2015-09-03 09:09:43 +0000	[diff] [blame]	9903	As with calling conventions, how the personality function results are
				9904	represented in LLVM IR is target specific.
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	9905
				9906	The ``cleanuppad`` instruction has several restrictions:
				9907
				9908	- A cleanup block is a basic block which is the unwind destination of
				9909	an exceptional instruction.
				9910	- A cleanup block must have a '``cleanuppad``' instruction as its
				9911	first non-PHI instruction.
				9912	- There can be only one '``cleanuppad``' instruction within the
				9913	cleanup block.
				9914	- A basic block that is not a cleanup block may not include a
				9915	'``cleanuppad``' instruction.
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	9916
Joseph Tremoulet	e28885e	2016-01-10 04:28:38 +0000	[diff] [blame]	9917	When a ``cleanuppad`` has been "entered" but not yet "exited" (as
				9918	described in the `EH documentation\ <ExceptionHandling.html#wineh-constraints>`_),
				9919	it is undefined behavior to execute a :ref:`call <i_call>` or :ref:`invoke <i_invoke>`
				9920	that does not carry an appropriate :ref:`"funclet" bundle <ob_funclet>`.
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	9921
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	9922	Example:
				9923	""""""""
				9924
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	9925	.. code-block:: text
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	9926
David Majnemer	8a1c45d	2015-12-12 05:38:55 +0000	[diff] [blame]	9927	%tok = cleanuppad within %cs []
David Majnemer	654e130	2015-07-31 17:58:14 +0000	[diff] [blame]	9928
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	9929	.. _intrinsics:
				9930
				9931	Intrinsic Functions
				9932	===================
				9933
				9934	LLVM supports the notion of an "intrinsic function". These functions
				9935	have well known names and semantics and are required to follow certain
				9936	restrictions. Overall, these intrinsics represent an extension mechanism
				9937	for the LLVM language that does not require changing all of the
				9938	transformations in LLVM when adding to the language (or the bitcode
				9939	reader/writer, the parser, etc...).
				9940
				9941	Intrinsic function names must all start with an "``llvm.``" prefix. This
				9942	prefix is reserved in LLVM for intrinsic names; thus, function names may
				9943	not begin with this prefix. Intrinsic functions must always be external
				9944	functions: you cannot define the body of intrinsic functions. Intrinsic
				9945	functions may only be used in call or invoke instructions: it is illegal
				9946	to take the address of an intrinsic function. Additionally, because
				9947	intrinsic functions are part of the LLVM language, it is required if any
				9948	are added that they be documented here.
				9949
				9950	Some intrinsic functions can be overloaded, i.e., the intrinsic
				9951	represents a family of functions that perform the same operation but on
				9952	different data types. Because LLVM can represent over 8 million
				9953	different integer types, overloading is used commonly to allow an
				9954	intrinsic function to operate on any integer type. One or more of the
				9955	argument types or the result type can be overloaded to accept any
				9956	integer type. Argument types may also be defined as exactly matching a
				9957	previous argument's type or the result type. This allows an intrinsic
				9958	function which accepts multiple arguments, but needs all of them to be
				9959	of the same type, to only be overloaded with respect to a single
				9960	argument or the result.
				9961
				9962	Overloaded intrinsics will have the names of its overloaded argument
				9963	types encoded into its function name, each preceded by a period. Only
				9964	those types which are overloaded result in a name suffix. Arguments
				9965	whose type is matched against another type do not. For example, the
				9966	``llvm.ctpop`` function can take an integer of any width and returns an
				9967	integer of exactly the same integer width. This leads to a family of
				9968	functions such as ``i8 @llvm.ctpop.i8(i8 %val)`` and
				9969	``i29 @llvm.ctpop.i29(i29 %val)``. Only one type, the return type, is
				9970	overloaded, and only one type suffix is required. Because the argument's
				9971	type is matched against the return type, it does not require its own
				9972	name suffix.
				9973
				9974	To learn how to add an intrinsic function, please see the `Extending
				9975	LLVM Guide <ExtendingLLVM.html>`_.
				9976
				9977	.. _int_varargs:
				9978
				9979	Variable Argument Handling Intrinsics
				9980	-------------------------------------
				9981
				9982	Variable argument support is defined in LLVM with the
				9983	:ref:`va_arg <i_va_arg>` instruction and these three intrinsic
				9984	functions. These functions are related to the similarly named macros
				9985	defined in the ``<stdarg.h>`` header file.
				9986
				9987	All of these functions operate on arguments that use a target-specific
				9988	value type "``va_list``". The LLVM assembly language reference manual
				9989	does not define what this type is, so all transformations should be
				9990	prepared to handle these functions regardless of the type used.
				9991
				9992	This example shows how the :ref:`va_arg <i_va_arg>` instruction and the
				9993	variable argument handling intrinsic functions are used.
				9994
				9995	.. code-block:: llvm
				9996
Tim Northover	ab60bb9	2014-11-02 01:21:51 +0000	[diff] [blame]	9997	; This struct is different for every platform. For most platforms,
				9998	; it is merely an i8*.
				9999	%struct.va_list = type { i8* }
				10000
				10001	; For Unix x86_64 platforms, va_list is the following struct:
				10002	; %struct.va_list = type { i32, i32, i8, i8 }
				10003
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10004	define i32 @test(i32 %X, ...) {
				10005	; Initialize variable argument processing
Tim Northover	ab60bb9	2014-11-02 01:21:51 +0000	[diff] [blame]	10006	%ap = alloca %struct.va_list
				10007	%ap2 = bitcast %struct.va_list* %ap to i8*
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10008	call void @llvm.va_start(i8* %ap2)
				10009
				10010	; Read a single integer argument
Tim Northover	ab60bb9	2014-11-02 01:21:51 +0000	[diff] [blame]	10011	%tmp = va_arg i8* %ap2, i32
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10012
				10013	; Demonstrate usage of llvm.va_copy and llvm.va_end
				10014	%aq = alloca i8*
				10015	%aq2 = bitcast i8** %aq to i8*
				10016	call void @llvm.va_copy(i8* %aq2, i8* %ap2)
				10017	call void @llvm.va_end(i8* %aq2)
				10018
				10019	; Stop processing of arguments.
				10020	call void @llvm.va_end(i8* %ap2)
				10021	ret i32 %tmp
				10022	}
				10023
				10024	declare void @llvm.va_start(i8*)
				10025	declare void @llvm.va_copy(i8, i8)
				10026	declare void @llvm.va_end(i8*)
				10027
				10028	.. _int_va_start:
				10029
				10030	'``llvm.va_start``' Intrinsic
				10031	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10032
				10033	Syntax:
				10034	"""""""
				10035
				10036	::
				10037
Nick Lewycky	04f6de0	2013-09-11 22:04:52 +0000	[diff] [blame]	10038	declare void @llvm.va_start(i8* <arglist>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10039
				10040	Overview:
				10041	"""""""""
				10042
				10043	The '``llvm.va_start``' intrinsic initializes ``*<arglist>`` for
				10044	subsequent use by ``va_arg``.
				10045
				10046	Arguments:
				10047	""""""""""
				10048
				10049	The argument is a pointer to a ``va_list`` element to initialize.
				10050
				10051	Semantics:
				10052	""""""""""
				10053
				10054	The '``llvm.va_start``' intrinsic works just like the ``va_start`` macro
				10055	available in C. In a target-dependent way, it initializes the
				10056	``va_list`` element to which the argument points, so that the next call
				10057	to ``va_arg`` will produce the first variable argument passed to the
				10058	function. Unlike the C ``va_start`` macro, this intrinsic does not need
				10059	to know the last argument of the function as the compiler can figure
				10060	that out.
				10061
				10062	'``llvm.va_end``' Intrinsic
				10063	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10064
				10065	Syntax:
				10066	"""""""
				10067
				10068	::
				10069
				10070	declare void @llvm.va_end(i8* <arglist>)
				10071
				10072	Overview:
				10073	"""""""""
				10074
				10075	The '``llvm.va_end``' intrinsic destroys ``*<arglist>``, which has been
				10076	initialized previously with ``llvm.va_start`` or ``llvm.va_copy``.
				10077
				10078	Arguments:
				10079	""""""""""
				10080
				10081	The argument is a pointer to a ``va_list`` to destroy.
				10082
				10083	Semantics:
				10084	""""""""""
				10085
				10086	The '``llvm.va_end``' intrinsic works just like the ``va_end`` macro
				10087	available in C. In a target-dependent way, it destroys the ``va_list``
				10088	element to which the argument points. Calls to
				10089	:ref:`llvm.va_start <int_va_start>` and
				10090	:ref:`llvm.va_copy <int_va_copy>` must be matched exactly with calls to
				10091	``llvm.va_end``.
				10092
				10093	.. _int_va_copy:
				10094
				10095	'``llvm.va_copy``' Intrinsic
				10096	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10097
				10098	Syntax:
				10099	"""""""
				10100
				10101	::
				10102
				10103	declare void @llvm.va_copy(i8* <destarglist>, i8* <srcarglist>)
				10104
				10105	Overview:
				10106	"""""""""
				10107
				10108	The '``llvm.va_copy``' intrinsic copies the current argument position
				10109	from the source argument list to the destination argument list.
				10110
				10111	Arguments:
				10112	""""""""""
				10113
				10114	The first argument is a pointer to a ``va_list`` element to initialize.
				10115	The second argument is a pointer to a ``va_list`` element to copy from.
				10116
				10117	Semantics:
				10118	""""""""""
				10119
				10120	The '``llvm.va_copy``' intrinsic works just like the ``va_copy`` macro
				10121	available in C. In a target-dependent way, it copies the source
				10122	``va_list`` element into the destination ``va_list`` element. This
				10123	intrinsic is necessary because the `` llvm.va_start`` intrinsic may be
				10124	arbitrarily complex and require, for example, memory allocation.
				10125
				10126	Accurate Garbage Collection Intrinsics
				10127	--------------------------------------
				10128
Philip Reames	c5b0f56	2015-02-25 23:52:06 +0000	[diff] [blame]	10129	LLVM's support for `Accurate Garbage Collection <GarbageCollection.html>`_
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	10130	(GC) requires the frontend to generate code containing appropriate intrinsic
				10131	calls and select an appropriate GC strategy which knows how to lower these
Philip Reames	c5b0f56	2015-02-25 23:52:06 +0000	[diff] [blame]	10132	intrinsics in a manner which is appropriate for the target collector.
				10133
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10134	These intrinsics allow identification of :ref:`GC roots on the
				10135	stack <int_gcroot>`, as well as garbage collector implementations that
				10136	require :ref:`read <int_gcread>` and :ref:`write <int_gcwrite>` barriers.
Philip Reames	c5b0f56	2015-02-25 23:52:06 +0000	[diff] [blame]	10137	Frontends for type-safe garbage collected languages should generate
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10138	these intrinsics to make use of the LLVM garbage collectors. For more
Philip Reames	f80bbff	2015-02-25 23:45:20 +0000	[diff] [blame]	10139	details, see `Garbage Collection with LLVM <GarbageCollection.html>`_.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10140
Philip Reames	f80bbff	2015-02-25 23:45:20 +0000	[diff] [blame]	10141	Experimental Statepoint Intrinsics
				10142	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10143
				10144	LLVM provides an second experimental set of intrinsics for describing garbage
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	10145	collection safepoints in compiled code. These intrinsics are an alternative
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	10146	to the ``llvm.gcroot`` intrinsics, but are compatible with the ones for
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	10147	:ref:`read <int_gcread>` and :ref:`write <int_gcwrite>` barriers. The
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	10148	differences in approach are covered in the `Garbage Collection with LLVM
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	10149	<GarbageCollection.html>`_ documentation. The intrinsics themselves are
Philip Reames	f80bbff	2015-02-25 23:45:20 +0000	[diff] [blame]	10150	described in :doc:`Statepoints`.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10151
				10152	.. _int_gcroot:
				10153
				10154	'``llvm.gcroot``' Intrinsic
				10155	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10156
				10157	Syntax:
				10158	"""""""
				10159
				10160	::
				10161
				10162	declare void @llvm.gcroot(i8** %ptrloc, i8* %metadata)
				10163
				10164	Overview:
				10165	"""""""""
				10166
				10167	The '``llvm.gcroot``' intrinsic declares the existence of a GC root to
				10168	the code generator, and allows some metadata to be associated with it.
				10169
				10170	Arguments:
				10171	""""""""""
				10172
				10173	The first argument specifies the address of a stack object that contains
				10174	the root pointer. The second pointer (which must be either a constant or
				10175	a global value address) contains the meta-data to be associated with the
				10176	root.
				10177
				10178	Semantics:
				10179	""""""""""
				10180
				10181	At runtime, a call to this intrinsic stores a null pointer into the
				10182	"ptrloc" location. At compile-time, the code generator generates
				10183	information to allow the runtime to find the pointer at GC safe points.
				10184	The '``llvm.gcroot``' intrinsic may only be used in a function which
				10185	:ref:`specifies a GC algorithm <gc>`.
				10186
				10187	.. _int_gcread:
				10188
				10189	'``llvm.gcread``' Intrinsic
				10190	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10191
				10192	Syntax:
				10193	"""""""
				10194
				10195	::
				10196
				10197	declare i8* @llvm.gcread(i8* %ObjPtr, i8** %Ptr)
				10198
				10199	Overview:
				10200	"""""""""
				10201
				10202	The '``llvm.gcread``' intrinsic identifies reads of references from heap
				10203	locations, allowing garbage collector implementations that require read
				10204	barriers.
				10205
				10206	Arguments:
				10207	""""""""""
				10208
				10209	The second argument is the address to read from, which should be an
				10210	address allocated from the garbage collector. The first object is a
				10211	pointer to the start of the referenced object, if needed by the language
				10212	runtime (otherwise null).
				10213
				10214	Semantics:
				10215	""""""""""
				10216
				10217	The '``llvm.gcread``' intrinsic has the same semantics as a load
				10218	instruction, but may be replaced with substantially more complex code by
				10219	the garbage collector runtime, as needed. The '``llvm.gcread``'
				10220	intrinsic may only be used in a function which :ref:`specifies a GC
				10221	algorithm <gc>`.
				10222
				10223	.. _int_gcwrite:
				10224
				10225	'``llvm.gcwrite``' Intrinsic
				10226	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10227
				10228	Syntax:
				10229	"""""""
				10230
				10231	::
				10232
				10233	declare void @llvm.gcwrite(i8* %P1, i8* %Obj, i8** %P2)
				10234
				10235	Overview:
				10236	"""""""""
				10237
				10238	The '``llvm.gcwrite``' intrinsic identifies writes of references to heap
				10239	locations, allowing garbage collector implementations that require write
				10240	barriers (such as generational or reference counting collectors).
				10241
				10242	Arguments:
				10243	""""""""""
				10244
				10245	The first argument is the reference to store, the second is the start of
				10246	the object to store it to, and the third is the address of the field of
				10247	Obj to store to. If the runtime does not require a pointer to the
				10248	object, Obj may be null.
				10249
				10250	Semantics:
				10251	""""""""""
				10252
				10253	The '``llvm.gcwrite``' intrinsic has the same semantics as a store
				10254	instruction, but may be replaced with substantially more complex code by
				10255	the garbage collector runtime, as needed. The '``llvm.gcwrite``'
				10256	intrinsic may only be used in a function which :ref:`specifies a GC
				10257	algorithm <gc>`.
				10258
				10259	Code Generator Intrinsics
				10260	-------------------------
				10261
				10262	These intrinsics are provided by LLVM to expose special features that
				10263	may only be implemented with code generator support.
				10264
				10265	'``llvm.returnaddress``' Intrinsic
				10266	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10267
				10268	Syntax:
				10269	"""""""
				10270
				10271	::
				10272
George Burgess IV	fbc3498	2017-05-20 04:52:29 +0000	[diff] [blame]	10273	declare i8* @llvm.returnaddress(i32 <level>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10274
				10275	Overview:
				10276	"""""""""
				10277
				10278	The '``llvm.returnaddress``' intrinsic attempts to compute a
				10279	target-specific value indicating the return address of the current
				10280	function or one of its callers.
				10281
				10282	Arguments:
				10283	""""""""""
				10284
				10285	The argument to this intrinsic indicates which function to return the
				10286	address for. Zero indicates the calling function, one indicates its
				10287	caller, etc. The argument is required to be a constant integer
				10288	value.
				10289
				10290	Semantics:
				10291	""""""""""
				10292
				10293	The '``llvm.returnaddress``' intrinsic either returns a pointer
				10294	indicating the return address of the specified call frame, or zero if it
				10295	cannot be identified. The value returned by this intrinsic is likely to
				10296	be incorrect or 0 for arguments other than zero, so it should only be
				10297	used for debugging purposes.
				10298
				10299	Note that calling this intrinsic does not prevent function inlining or
				10300	other aggressive transformations, so the value returned may not be that
				10301	of the obvious source-language caller.
				10302
Albert Gutowski	795d7d6	2016-10-12 22:13:19 +0000	[diff] [blame]	10303	'``llvm.addressofreturnaddress``' Intrinsic
Albert Gutowski	57ad5fe	2016-10-12 23:10:02 +0000	[diff] [blame]	10304	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Albert Gutowski	795d7d6	2016-10-12 22:13:19 +0000	[diff] [blame]	10305
				10306	Syntax:
				10307	"""""""
				10308
				10309	::
				10310
George Burgess IV	fbc3498	2017-05-20 04:52:29 +0000	[diff] [blame]	10311	declare i8* @llvm.addressofreturnaddress()
Albert Gutowski	795d7d6	2016-10-12 22:13:19 +0000	[diff] [blame]	10312
				10313	Overview:
				10314	"""""""""
				10315
				10316	The '``llvm.addressofreturnaddress``' intrinsic returns a target-specific
				10317	pointer to the place in the stack frame where the return address of the
				10318	current function is stored.
				10319
				10320	Semantics:
				10321	""""""""""
				10322
				10323	Note that calling this intrinsic does not prevent function inlining or
				10324	other aggressive transformations, so the value returned may not be that
				10325	of the obvious source-language caller.
				10326
				10327	This intrinsic is only implemented for x86.
				10328
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10329	'``llvm.frameaddress``' Intrinsic
				10330	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10331
				10332	Syntax:
				10333	"""""""
				10334
				10335	::
				10336
				10337	declare i8* @llvm.frameaddress(i32 <level>)
				10338
				10339	Overview:
				10340	"""""""""
				10341
				10342	The '``llvm.frameaddress``' intrinsic attempts to return the
				10343	target-specific frame pointer value for the specified stack frame.
				10344
				10345	Arguments:
				10346	""""""""""
				10347
				10348	The argument to this intrinsic indicates which function to return the
				10349	frame pointer for. Zero indicates the calling function, one indicates
				10350	its caller, etc. The argument is required to be a constant integer
				10351	value.
				10352
				10353	Semantics:
				10354	""""""""""
				10355
				10356	The '``llvm.frameaddress``' intrinsic either returns a pointer
				10357	indicating the frame address of the specified call frame, or zero if it
				10358	cannot be identified. The value returned by this intrinsic is likely to
				10359	be incorrect or 0 for arguments other than zero, so it should only be
				10360	used for debugging purposes.
				10361
				10362	Note that calling this intrinsic does not prevent function inlining or
				10363	other aggressive transformations, so the value returned may not be that
				10364	of the obvious source-language caller.
				10365
Reid Kleckner	6038179	2015-07-07 22:25:32 +0000	[diff] [blame]	10366	'``llvm.localescape``' and '``llvm.localrecover``' Intrinsics
Reid Kleckner	e9b8931	2015-01-13 00:48:10 +0000	[diff] [blame]	10367	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10368
				10369	Syntax:
				10370	"""""""
				10371
				10372	::
				10373
Reid Kleckner	6038179	2015-07-07 22:25:32 +0000	[diff] [blame]	10374	declare void @llvm.localescape(...)
				10375	declare i8* @llvm.localrecover(i8* %func, i8* %fp, i32 %idx)
Reid Kleckner	e9b8931	2015-01-13 00:48:10 +0000	[diff] [blame]	10376
				10377	Overview:
				10378	"""""""""
				10379
Reid Kleckner	6038179	2015-07-07 22:25:32 +0000	[diff] [blame]	10380	The '``llvm.localescape``' intrinsic escapes offsets of a collection of static
				10381	allocas, and the '``llvm.localrecover``' intrinsic applies those offsets to a
Reid Kleckner	cfb9ce5	2015-03-05 18:26:34 +0000	[diff] [blame]	10382	live frame pointer to recover the address of the allocation. The offset is
Reid Kleckner	6038179	2015-07-07 22:25:32 +0000	[diff] [blame]	10383	computed during frame layout of the caller of ``llvm.localescape``.
Reid Kleckner	e9b8931	2015-01-13 00:48:10 +0000	[diff] [blame]	10384
				10385	Arguments:
				10386	""""""""""
				10387
Reid Kleckner	6038179	2015-07-07 22:25:32 +0000	[diff] [blame]	10388	All arguments to '``llvm.localescape``' must be pointers to static allocas or
				10389	casts of static allocas. Each function can only call '``llvm.localescape``'
Reid Kleckner	cfb9ce5	2015-03-05 18:26:34 +0000	[diff] [blame]	10390	once, and it can only do so from the entry block.
Reid Kleckner	e9b8931	2015-01-13 00:48:10 +0000	[diff] [blame]	10391
Reid Kleckner	6038179	2015-07-07 22:25:32 +0000	[diff] [blame]	10392	The ``func`` argument to '``llvm.localrecover``' must be a constant
Reid Kleckner	e9b8931	2015-01-13 00:48:10 +0000	[diff] [blame]	10393	bitcasted pointer to a function defined in the current module. The code
				10394	generator cannot determine the frame allocation offset of functions defined in
				10395	other modules.
				10396
Reid Kleckner	d5afc62f	2015-07-07 23:23:03 +0000	[diff] [blame]	10397	The ``fp`` argument to '``llvm.localrecover``' must be a frame pointer of a
				10398	call frame that is currently live. The return value of '``llvm.localaddress``'
				10399	is one way to produce such a value, but various runtimes also expose a suitable
				10400	pointer in platform-specific ways.
Reid Kleckner	e9b8931	2015-01-13 00:48:10 +0000	[diff] [blame]	10401
Reid Kleckner	6038179	2015-07-07 22:25:32 +0000	[diff] [blame]	10402	The ``idx`` argument to '``llvm.localrecover``' indicates which alloca passed to
				10403	'``llvm.localescape``' to recover. It is zero-indexed.
Reid Kleckner	cfb9ce5	2015-03-05 18:26:34 +0000	[diff] [blame]	10404
Reid Kleckner	e9b8931	2015-01-13 00:48:10 +0000	[diff] [blame]	10405	Semantics:
				10406	""""""""""
				10407
Reid Kleckner	6038179	2015-07-07 22:25:32 +0000	[diff] [blame]	10408	These intrinsics allow a group of functions to share access to a set of local
				10409	stack allocations of a one parent function. The parent function may call the
				10410	'``llvm.localescape``' intrinsic once from the function entry block, and the
				10411	child functions can use '``llvm.localrecover``' to access the escaped allocas.
				10412	The '``llvm.localescape``' intrinsic blocks inlining, as inlining changes where
				10413	the escaped allocas are allocated, which would break attempts to use
				10414	'``llvm.localrecover``'.
Reid Kleckner	e9b8931	2015-01-13 00:48:10 +0000	[diff] [blame]	10415
Renato Golin	c7aea40	2014-05-06 16:51:25 +0000	[diff] [blame]	10416	.. _int_read_register:
				10417	.. _int_write_register:
				10418
				10419	'``llvm.read_register``' and '``llvm.write_register``' Intrinsics
				10420	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10421
				10422	Syntax:
				10423	"""""""
				10424
				10425	::
				10426
				10427	declare i32 @llvm.read_register.i32(metadata)
				10428	declare i64 @llvm.read_register.i64(metadata)
				10429	declare void @llvm.write_register.i32(metadata, i32 @value)
				10430	declare void @llvm.write_register.i64(metadata, i64 @value)
Duncan P. N. Exon Smith	be7ea19	2014-12-15 19:07:53 +0000	[diff] [blame]	10431	!0 = !{!"sp\00"}
Renato Golin	c7aea40	2014-05-06 16:51:25 +0000	[diff] [blame]	10432
				10433	Overview:
				10434	"""""""""
				10435
				10436	The '``llvm.read_register``' and '``llvm.write_register``' intrinsics
				10437	provides access to the named register. The register must be valid on
				10438	the architecture being compiled to. The type needs to be compatible
				10439	with the register being read.
				10440
				10441	Semantics:
				10442	""""""""""
				10443
				10444	The '``llvm.read_register``' intrinsic returns the current value of the
				10445	register, where possible. The '``llvm.write_register``' intrinsic sets
				10446	the current value of the register, where possible.
				10447
				10448	This is useful to implement named register global variables that need
				10449	to always be mapped to a specific register, as is common practice on
				10450	bare-metal programs including OS kernels.
				10451
				10452	The compiler doesn't check for register availability or use of the used
				10453	register in surrounding code, including inline assembly. Because of that,
				10454	allocatable registers are not supported.
				10455
				10456	Warning: So far it only works with the stack pointer on selected
Tim Northover	3b0846e	2014-05-24 12:50:23 +0000	[diff] [blame]	10457	architectures (ARM, AArch64, PowerPC and x86_64). Significant amount of
Renato Golin	c7aea40	2014-05-06 16:51:25 +0000	[diff] [blame]	10458	work is needed to support other registers and even more so, allocatable
				10459	registers.
				10460
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10461	.. _int_stacksave:
				10462
				10463	'``llvm.stacksave``' Intrinsic
				10464	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10465
				10466	Syntax:
				10467	"""""""
				10468
				10469	::
				10470
				10471	declare i8* @llvm.stacksave()
				10472
				10473	Overview:
				10474	"""""""""
				10475
				10476	The '``llvm.stacksave``' intrinsic is used to remember the current state
				10477	of the function stack, for use with
				10478	:ref:`llvm.stackrestore <int_stackrestore>`. This is useful for
				10479	implementing language features like scoped automatic variable sized
				10480	arrays in C99.
				10481
				10482	Semantics:
				10483	""""""""""
				10484
				10485	This intrinsic returns a opaque pointer value that can be passed to
				10486	:ref:`llvm.stackrestore <int_stackrestore>`. When an
				10487	``llvm.stackrestore`` intrinsic is executed with a value saved from
				10488	``llvm.stacksave``, it effectively restores the state of the stack to
				10489	the state it was in when the ``llvm.stacksave`` intrinsic executed. In
				10490	practice, this pops any :ref:`alloca <i_alloca>` blocks from the stack that
				10491	were allocated after the ``llvm.stacksave`` was executed.
				10492
				10493	.. _int_stackrestore:
				10494
				10495	'``llvm.stackrestore``' Intrinsic
				10496	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10497
				10498	Syntax:
				10499	"""""""
				10500
				10501	::
				10502
				10503	declare void @llvm.stackrestore(i8* %ptr)
				10504
				10505	Overview:
				10506	"""""""""
				10507
				10508	The '``llvm.stackrestore``' intrinsic is used to restore the state of
				10509	the function stack to the state it was in when the corresponding
				10510	:ref:`llvm.stacksave <int_stacksave>` intrinsic executed. This is
				10511	useful for implementing language features like scoped automatic variable
				10512	sized arrays in C99.
				10513
				10514	Semantics:
				10515	""""""""""
				10516
				10517	See the description for :ref:`llvm.stacksave <int_stacksave>`.
				10518
Yury Gribov	d7dbb66	2015-12-01 11:40:55 +0000	[diff] [blame]	10519	.. _int_get_dynamic_area_offset:
				10520
				10521	'``llvm.get.dynamic.area.offset``' Intrinsic
Yury Gribov	81f3f15	2015-12-01 13:24:48 +0000	[diff] [blame]	10522	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Yury Gribov	d7dbb66	2015-12-01 11:40:55 +0000	[diff] [blame]	10523
				10524	Syntax:
				10525	"""""""
				10526
				10527	::
				10528
				10529	declare i32 @llvm.get.dynamic.area.offset.i32()
				10530	declare i64 @llvm.get.dynamic.area.offset.i64()
				10531
Lang Hames	1023993	2016-10-08 00:20:42 +0000	[diff] [blame]	10532	Overview:
				10533	"""""""""
Yury Gribov	d7dbb66	2015-12-01 11:40:55 +0000	[diff] [blame]	10534
				10535	The '``llvm.get.dynamic.area.offset.*``' intrinsic family is used to
				10536	get the offset from native stack pointer to the address of the most
				10537	recent dynamic alloca on the caller's stack. These intrinsics are
				10538	intendend for use in combination with
				10539	:ref:`llvm.stacksave <int_stacksave>` to get a
				10540	pointer to the most recent dynamic alloca. This is useful, for example,
				10541	for AddressSanitizer's stack unpoisoning routines.
				10542
				10543	Semantics:
				10544	""""""""""
				10545
				10546	These intrinsics return a non-negative integer value that can be used to
				10547	get the address of the most recent dynamic alloca, allocated by :ref:`alloca <i_alloca>`
				10548	on the caller's stack. In particular, for targets where stack grows downwards,
				10549	adding this offset to the native stack pointer would get the address of the most
				10550	recent dynamic alloca. For targets where stack grows upwards, the situation is a bit more
Sylvestre Ledru	0455cbe	2016-07-28 09:28:58 +0000	[diff] [blame]	10551	complicated, because subtracting this value from stack pointer would get the address
Yury Gribov	d7dbb66	2015-12-01 11:40:55 +0000	[diff] [blame]	10552	one past the end of the most recent dynamic alloca.
				10553
				10554	Although for most targets `llvm.get.dynamic.area.offset <int_get_dynamic_area_offset>`
				10555	returns just a zero, for others, such as PowerPC and PowerPC64, it returns a
				10556	compile-time-known constant value.
				10557
				10558	The return value type of :ref:`llvm.get.dynamic.area.offset <int_get_dynamic_area_offset>`
Matt Arsenault	c749bdc	2017-03-30 23:36:47 +0000	[diff] [blame]	10559	must match the target's default address space's (address space 0) pointer type.
Yury Gribov	d7dbb66	2015-12-01 11:40:55 +0000	[diff] [blame]	10560
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10561	'``llvm.prefetch``' Intrinsic
				10562	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10563
				10564	Syntax:
				10565	"""""""
				10566
				10567	::
				10568
				10569	declare void @llvm.prefetch(i8* <address>, i32 <rw>, i32 <locality>, i32 <cache type>)
				10570
				10571	Overview:
				10572	"""""""""
				10573
				10574	The '``llvm.prefetch``' intrinsic is a hint to the code generator to
				10575	insert a prefetch instruction if supported; otherwise, it is a noop.
				10576	Prefetches have no effect on the behavior of the program but can change
				10577	its performance characteristics.
				10578
				10579	Arguments:
				10580	""""""""""
				10581
				10582	``address`` is the address to be prefetched, ``rw`` is the specifier
				10583	determining if the fetch should be for a read (0) or write (1), and
				10584	``locality`` is a temporal locality specifier ranging from (0) - no
				10585	locality, to (3) - extremely local keep in cache. The ``cache type``
				10586	specifies whether the prefetch is performed on the data (1) or
				10587	instruction (0) cache. The ``rw``, ``locality`` and ``cache type``
				10588	arguments must be constant integers.
				10589
				10590	Semantics:
				10591	""""""""""
				10592
				10593	This intrinsic does not modify the behavior of the program. In
				10594	particular, prefetches cannot trap and do not produce a value. On
				10595	targets that support this intrinsic, the prefetch can provide hints to
				10596	the processor cache for better performance.
				10597
				10598	'``llvm.pcmarker``' Intrinsic
				10599	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10600
				10601	Syntax:
				10602	"""""""
				10603
				10604	::
				10605
				10606	declare void @llvm.pcmarker(i32 <id>)
				10607
				10608	Overview:
				10609	"""""""""
				10610
				10611	The '``llvm.pcmarker``' intrinsic is a method to export a Program
				10612	Counter (PC) in a region of code to simulators and other tools. The
				10613	method is target specific, but it is expected that the marker will use
				10614	exported symbols to transmit the PC of the marker. The marker makes no
				10615	guarantees that it will remain with any specific instruction after
				10616	optimizations. It is possible that the presence of a marker will inhibit
				10617	optimizations. The intended use is to be inserted after optimizations to
				10618	allow correlations of simulation runs.
				10619
				10620	Arguments:
				10621	""""""""""
				10622
				10623	``id`` is a numerical id identifying the marker.
				10624
				10625	Semantics:
				10626	""""""""""
				10627
				10628	This intrinsic does not modify the behavior of the program. Backends
				10629	that do not support this intrinsic may ignore it.
				10630
				10631	'``llvm.readcyclecounter``' Intrinsic
				10632	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10633
				10634	Syntax:
				10635	"""""""
				10636
				10637	::
				10638
				10639	declare i64 @llvm.readcyclecounter()
				10640
				10641	Overview:
				10642	"""""""""
				10643
				10644	The '``llvm.readcyclecounter``' intrinsic provides access to the cycle
				10645	counter register (or similar low latency, high accuracy clocks) on those
				10646	targets that support it. On X86, it should map to RDTSC. On Alpha, it
				10647	should map to RPCC. As the backing counters overflow quickly (on the
				10648	order of 9 seconds on alpha), this should only be used for small
				10649	timings.
				10650
				10651	Semantics:
				10652	""""""""""
				10653
				10654	When directly supported, reading the cycle counter should not modify any
				10655	memory. Implementations are allowed to either return a application
				10656	specific value or a system wide value. On backends without support, this
				10657	is lowered to a constant 0.
				10658
Tim Northover	bc93308	2013-05-23 19:11:20 +0000	[diff] [blame]	10659	Note that runtime support may be conditional on the privilege-level code is
				10660	running at and the host platform.
				10661
Renato Golin	c0a3c1d	2014-03-26 12:52:28 +0000	[diff] [blame]	10662	'``llvm.clear_cache``' Intrinsic
				10663	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10664
				10665	Syntax:
				10666	"""""""
				10667
				10668	::
				10669
				10670	declare void @llvm.clear_cache(i8, i8)
				10671
				10672	Overview:
				10673	"""""""""
				10674
Joerg Sonnenberger	03014d6	2014-03-26 14:35:21 +0000	[diff] [blame]	10675	The '``llvm.clear_cache``' intrinsic ensures visibility of modifications
				10676	in the specified range to the execution unit of the processor. On
				10677	targets with non-unified instruction and data cache, the implementation
				10678	flushes the instruction cache.
Renato Golin	c0a3c1d	2014-03-26 12:52:28 +0000	[diff] [blame]	10679
				10680	Semantics:
				10681	""""""""""
				10682
Joerg Sonnenberger	03014d6	2014-03-26 14:35:21 +0000	[diff] [blame]	10683	On platforms with coherent instruction and data caches (e.g. x86), this
				10684	intrinsic is a nop. On platforms with non-coherent instruction and data
Alp Toker	16f98b2	2014-04-09 14:47:27 +0000	[diff] [blame]	10685	cache (e.g. ARM, MIPS), the intrinsic is lowered either to appropriate
Joerg Sonnenberger	03014d6	2014-03-26 14:35:21 +0000	[diff] [blame]	10686	instructions or a system call, if cache flushing requires special
				10687	privileges.
Renato Golin	c0a3c1d	2014-03-26 12:52:28 +0000	[diff] [blame]	10688
Sean Silva	d02bf3e	2014-04-07 22:29:53 +0000	[diff] [blame]	10689	The default behavior is to emit a call to ``__clear_cache`` from the run
Joerg Sonnenberger	03014d6	2014-03-26 14:35:21 +0000	[diff] [blame]	10690	time library.
Renato Golin	93010e6	2014-03-26 14:01:32 +0000	[diff] [blame]	10691
Joerg Sonnenberger	03014d6	2014-03-26 14:35:21 +0000	[diff] [blame]	10692	This instrinsic does not empty the instruction pipeline. Modifications
				10693	of the current function are outside the scope of the intrinsic.
Renato Golin	c0a3c1d	2014-03-26 12:52:28 +0000	[diff] [blame]	10694
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10695	'``llvm.instrprof.increment``' Intrinsic
Justin Bogner	61ba2e3	2014-12-08 18:02:35 +0000	[diff] [blame]	10696	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10697
				10698	Syntax:
				10699	"""""""
				10700
				10701	::
				10702
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10703	declare void @llvm.instrprof.increment(i8* <name>, i64 <hash>,
Justin Bogner	61ba2e3	2014-12-08 18:02:35 +0000	[diff] [blame]	10704	i32 <num-counters>, i32 <index>)
				10705
				10706	Overview:
				10707	"""""""""
				10708
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10709	The '``llvm.instrprof.increment``' intrinsic can be emitted by a
Justin Bogner	61ba2e3	2014-12-08 18:02:35 +0000	[diff] [blame]	10710	frontend for use with instrumentation based profiling. These will be
				10711	lowered by the ``-instrprof`` pass to generate execution counts of a
				10712	program at runtime.
				10713
				10714	Arguments:
				10715	""""""""""
				10716
				10717	The first argument is a pointer to a global variable containing the
				10718	name of the entity being instrumented. This should generally be the
				10719	(mangled) function name for a set of counters.
				10720
				10721	The second argument is a hash value that can be used by the consumer
				10722	of the profile data to detect changes to the instrumented source, and
				10723	the third is the number of counters associated with ``name``. It is an
				10724	error if ``hash`` or ``num-counters`` differ between two instances of
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10725	``instrprof.increment`` that refer to the same name.
Justin Bogner	61ba2e3	2014-12-08 18:02:35 +0000	[diff] [blame]	10726
				10727	The last argument refers to which of the counters for ``name`` should
				10728	be incremented. It should be a value between 0 and ``num-counters``.
				10729
				10730	Semantics:
				10731	""""""""""
				10732
				10733	This intrinsic represents an increment of a profiling counter. It will
				10734	cause the ``-instrprof`` pass to generate the appropriate data
				10735	structures and the code to increment the appropriate value, in a
				10736	format that can be written out by a compiler runtime and consumed via
				10737	the ``llvm-profdata`` tool.
				10738
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10739	'``llvm.instrprof.increment.step``' Intrinsic
Xinliang David Li	e111710	2016-09-18 22:10:19 +0000	[diff] [blame]	10740	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Xinliang David Li	4ca1733	2016-09-18 18:34:07 +0000	[diff] [blame]	10741
				10742	Syntax:
				10743	"""""""
				10744
				10745	::
				10746
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10747	declare void @llvm.instrprof.increment.step(i8* <name>, i64 <hash>,
Xinliang David Li	4ca1733	2016-09-18 18:34:07 +0000	[diff] [blame]	10748	i32 <num-counters>,
				10749	i32 <index>, i64 <step>)
				10750
				10751	Overview:
				10752	"""""""""
				10753
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10754	The '``llvm.instrprof.increment.step``' intrinsic is an extension to
				10755	the '``llvm.instrprof.increment``' intrinsic with an additional fifth
Xinliang David Li	4ca1733	2016-09-18 18:34:07 +0000	[diff] [blame]	10756	argument to specify the step of the increment.
				10757
				10758	Arguments:
				10759	""""""""""
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10760	The first four arguments are the same as '``llvm.instrprof.increment``'
Pete Couperus	ed9569d	2017-08-23 20:58:22 +0000	[diff] [blame]	10761	intrinsic.
Xinliang David Li	4ca1733	2016-09-18 18:34:07 +0000	[diff] [blame]	10762
				10763	The last argument specifies the value of the increment of the counter variable.
				10764
				10765	Semantics:
				10766	""""""""""
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10767	See description of '``llvm.instrprof.increment``' instrinsic.
Xinliang David Li	4ca1733	2016-09-18 18:34:07 +0000	[diff] [blame]	10768
				10769
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10770	'``llvm.instrprof.value.profile``' Intrinsic
Betul Buyukkurt	6fac174	2015-11-18 18:14:55 +0000	[diff] [blame]	10771	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10772
				10773	Syntax:
				10774	"""""""
				10775
				10776	::
				10777
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10778	declare void @llvm.instrprof.value.profile(i8* <name>, i64 <hash>,
Betul Buyukkurt	6fac174	2015-11-18 18:14:55 +0000	[diff] [blame]	10779	i64 <value>, i32 <value_kind>,
				10780	i32 <index>)
				10781
				10782	Overview:
				10783	"""""""""
				10784
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10785	The '``llvm.instrprof.value.profile``' intrinsic can be emitted by a
Betul Buyukkurt	6fac174	2015-11-18 18:14:55 +0000	[diff] [blame]	10786	frontend for use with instrumentation based profiling. This will be
				10787	lowered by the ``-instrprof`` pass to find out the target values,
				10788	instrumented expressions take in a program at runtime.
				10789
				10790	Arguments:
				10791	""""""""""
				10792
				10793	The first argument is a pointer to a global variable containing the
				10794	name of the entity being instrumented. ``name`` should generally be the
				10795	(mangled) function name for a set of counters.
				10796
				10797	The second argument is a hash value that can be used by the consumer
				10798	of the profile data to detect changes to the instrumented source. It
				10799	is an error if ``hash`` differs between two instances of
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10800	``llvm.instrprof.*`` that refer to the same name.
Betul Buyukkurt	6fac174	2015-11-18 18:14:55 +0000	[diff] [blame]	10801
				10802	The third argument is the value of the expression being profiled. The profiled
				10803	expression's value should be representable as an unsigned 64-bit value. The
				10804	fourth argument represents the kind of value profiling that is being done. The
				10805	supported value profiling kinds are enumerated through the
				10806	``InstrProfValueKind`` type declared in the
				10807	``<include/llvm/ProfileData/InstrProf.h>`` header file. The last argument is the
				10808	index of the instrumented expression within ``name``. It should be >= 0.
				10809
				10810	Semantics:
				10811	""""""""""
				10812
				10813	This intrinsic represents the point where a call to a runtime routine
				10814	should be inserted for value profiling of target expressions. ``-instrprof``
				10815	pass will generate the appropriate data structures and replace the
Vedant Kumar	51ce668	2018-01-26 23:54:25 +0000	[diff] [blame]	10816	``llvm.instrprof.value.profile`` intrinsic with the call to the profile
Betul Buyukkurt	6fac174	2015-11-18 18:14:55 +0000	[diff] [blame]	10817	runtime library with proper arguments.
				10818
Marcin Koscielnicki	3fdc257	2016-04-19 20:51:05 +0000	[diff] [blame]	10819	'``llvm.thread.pointer``' Intrinsic
				10820	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10821
				10822	Syntax:
				10823	"""""""
				10824
				10825	::
				10826
				10827	declare i8* @llvm.thread.pointer()
				10828
				10829	Overview:
				10830	"""""""""
				10831
				10832	The '``llvm.thread.pointer``' intrinsic returns the value of the thread
				10833	pointer.
				10834
				10835	Semantics:
				10836	""""""""""
				10837
				10838	The '``llvm.thread.pointer``' intrinsic returns a pointer to the TLS area
				10839	for the current thread. The exact semantics of this value are target
				10840	specific: it may point to the start of TLS area, to the end, or somewhere
				10841	in the middle. Depending on the target, this intrinsic may read a register,
				10842	call a helper function, read from an alternate memory space, or perform
				10843	other operations necessary to locate the TLS area. Not all targets support
				10844	this intrinsic.
				10845
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10846	Standard C Library Intrinsics
				10847	-----------------------------
				10848
				10849	LLVM provides intrinsics for a few important standard C library
				10850	functions. These intrinsics allow source-language front-ends to pass
				10851	information about the alignment of the pointer arguments to the code
				10852	generator, providing opportunity for more efficient code generation.
				10853
				10854	.. _int_memcpy:
				10855
				10856	'``llvm.memcpy``' Intrinsic
				10857	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10858
				10859	Syntax:
				10860	"""""""
				10861
				10862	This is an overloaded intrinsic. You can use ``llvm.memcpy`` on any
				10863	integer bit width and for different address spaces. Not all targets
				10864	support all bit widths however.
				10865
				10866	::
				10867
				10868	declare void @llvm.memcpy.p0i8.p0i8.i32(i8* <dest>, i8* <src>,
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10869	i32 <len>, i1 <isvolatile>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10870	declare void @llvm.memcpy.p0i8.p0i8.i64(i8* <dest>, i8* <src>,
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10871	i64 <len>, i1 <isvolatile>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10872
				10873	Overview:
				10874	"""""""""
				10875
				10876	The '``llvm.memcpy.*``' intrinsics copy a block of memory from the
				10877	source location to the destination location.
				10878
				10879	Note that, unlike the standard libc function, the ``llvm.memcpy.*``
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10880	intrinsics do not return a value, takes extra isvolatile
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10881	arguments and the pointers can be in specified address spaces.
				10882
				10883	Arguments:
				10884	""""""""""
				10885
				10886	The first argument is a pointer to the destination, the second is a
				10887	pointer to the source. The third argument is an integer argument
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10888	specifying the number of bytes to copy, and the fourth is a
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10889	boolean indicating a volatile access.
				10890
Daniel Neilson	39eb6a5	2018-01-19 17:24:21 +0000	[diff] [blame]	10891	The :ref:`align <attr_align>` parameter attribute can be provided
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10892	for the first and second arguments.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10893
				10894	If the ``isvolatile`` parameter is ``true``, the ``llvm.memcpy`` call is
				10895	a :ref:`volatile operation <volatile>`. The detailed access behavior is not
				10896	very cleanly specified and it is unwise to depend on it.
				10897
				10898	Semantics:
				10899	""""""""""
				10900
				10901	The '``llvm.memcpy.*``' intrinsics copy a block of memory from the
				10902	source location to the destination location, which are not allowed to
				10903	overlap. It copies "len" bytes of memory over. If the argument is known
				10904	to be aligned to some boundary, this can be specified as the fourth
Bill Wendling	6116315	2013-10-18 23:26:55 +0000	[diff] [blame]	10905	argument, otherwise it should be set to 0 or 1 (both meaning no alignment).
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10906
Daniel Neilson	57226ef	2017-07-12 15:25:26 +0000	[diff] [blame]	10907	.. _int_memmove:
				10908
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10909	'``llvm.memmove``' Intrinsic
				10910	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10911
				10912	Syntax:
				10913	"""""""
				10914
				10915	This is an overloaded intrinsic. You can use llvm.memmove on any integer
				10916	bit width and for different address space. Not all targets support all
				10917	bit widths however.
				10918
				10919	::
				10920
				10921	declare void @llvm.memmove.p0i8.p0i8.i32(i8* <dest>, i8* <src>,
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10922	i32 <len>, i1 <isvolatile>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10923	declare void @llvm.memmove.p0i8.p0i8.i64(i8* <dest>, i8* <src>,
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10924	i64 <len>, i1 <isvolatile>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10925
				10926	Overview:
				10927	"""""""""
				10928
				10929	The '``llvm.memmove.*``' intrinsics move a block of memory from the
				10930	source location to the destination location. It is similar to the
				10931	'``llvm.memcpy``' intrinsic but allows the two memory locations to
				10932	overlap.
				10933
				10934	Note that, unlike the standard libc function, the ``llvm.memmove.*``
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10935	intrinsics do not return a value, takes an extra isvolatile
				10936	argument and the pointers can be in specified address spaces.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10937
				10938	Arguments:
				10939	""""""""""
				10940
				10941	The first argument is a pointer to the destination, the second is a
				10942	pointer to the source. The third argument is an integer argument
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10943	specifying the number of bytes to copy, and the fourth is a
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10944	boolean indicating a volatile access.
				10945
Daniel Neilson	aac0f8f	2018-01-19 17:32:33 +0000	[diff] [blame]	10946	The :ref:`align <attr_align>` parameter attribute can be provided
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10947	for the first and second arguments.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10948
				10949	If the ``isvolatile`` parameter is ``true``, the ``llvm.memmove`` call
				10950	is a :ref:`volatile operation <volatile>`. The detailed access behavior is
				10951	not very cleanly specified and it is unwise to depend on it.
				10952
				10953	Semantics:
				10954	""""""""""
				10955
				10956	The '``llvm.memmove.*``' intrinsics copy a block of memory from the
				10957	source location to the destination location, which may overlap. It
				10958	copies "len" bytes of memory over. If the argument is known to be
				10959	aligned to some boundary, this can be specified as the fourth argument,
Bill Wendling	6116315	2013-10-18 23:26:55 +0000	[diff] [blame]	10960	otherwise it should be set to 0 or 1 (both meaning no alignment).
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10961
Daniel Neilson	965613e	2017-07-12 21:57:23 +0000	[diff] [blame]	10962	.. _int_memset:
				10963
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10964	'``llvm.memset.*``' Intrinsics
				10965	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				10966
				10967	Syntax:
				10968	"""""""
				10969
				10970	This is an overloaded intrinsic. You can use llvm.memset on any integer
				10971	bit width and for different address spaces. However, not all targets
				10972	support all bit widths.
				10973
				10974	::
				10975
				10976	declare void @llvm.memset.p0i8.i32(i8* <dest>, i8 <val>,
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10977	i32 <len>, i1 <isvolatile>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10978	declare void @llvm.memset.p0i8.i64(i8* <dest>, i8 <val>,
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10979	i64 <len>, i1 <isvolatile>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10980
				10981	Overview:
				10982	"""""""""
				10983
				10984	The '``llvm.memset.*``' intrinsics fill a block of memory with a
				10985	particular byte value.
				10986
				10987	Note that, unlike the standard libc function, the ``llvm.memset``
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10988	intrinsic does not return a value and takes an extra volatile
				10989	argument. Also, the destination can be in an arbitrary address space.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10990
				10991	Arguments:
				10992	""""""""""
				10993
				10994	The first argument is a pointer to the destination to fill, the second
				10995	is the byte value with which to fill it, the third argument is an
				10996	integer argument specifying the number of bytes to fill, and the fourth
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	10997	is a boolean indicating a volatile access.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	10998
Daniel Neilson	aac0f8f	2018-01-19 17:32:33 +0000	[diff] [blame]	10999	The :ref:`align <attr_align>` parameter attribute can be provided
Daniel Neilson	1e68724	2018-01-19 17:13:12 +0000	[diff] [blame]	11000	for the first arguments.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11001
				11002	If the ``isvolatile`` parameter is ``true``, the ``llvm.memset`` call is
				11003	a :ref:`volatile operation <volatile>`. The detailed access behavior is not
				11004	very cleanly specified and it is unwise to depend on it.
				11005
				11006	Semantics:
				11007	""""""""""
				11008
				11009	The '``llvm.memset.*``' intrinsics fill "len" bytes of memory starting
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11010	at the destination location.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11011
				11012	'``llvm.sqrt.*``' Intrinsic
				11013	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11014
				11015	Syntax:
				11016	"""""""
				11017
				11018	This is an overloaded intrinsic. You can use ``llvm.sqrt`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11019	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11020	all types however.
				11021
				11022	::
				11023
				11024	declare float @llvm.sqrt.f32(float %Val)
				11025	declare double @llvm.sqrt.f64(double %Val)
				11026	declare x86_fp80 @llvm.sqrt.f80(x86_fp80 %Val)
				11027	declare fp128 @llvm.sqrt.f128(fp128 %Val)
				11028	declare ppc_fp128 @llvm.sqrt.ppcf128(ppc_fp128 %Val)
				11029
				11030	Overview:
				11031	"""""""""
				11032
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11033	The '``llvm.sqrt``' intrinsics return the square root of the specified value.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11034
				11035	Arguments:
				11036	""""""""""
				11037
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11038	The argument and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11039
				11040	Semantics:
				11041	""""""""""
				11042
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11043	Return the same value as a corresponding libm '``sqrt``' function but without
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11044	trapping or setting ``errno``. For types specified by IEEE-754, the result
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11045	matches a conforming libm implementation.
				11046
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11047	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11048	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11049
				11050	'``llvm.powi.*``' Intrinsic
				11051	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11052
				11053	Syntax:
				11054	"""""""
				11055
				11056	This is an overloaded intrinsic. You can use ``llvm.powi`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11057	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11058	all types however.
				11059
				11060	::
				11061
				11062	declare float @llvm.powi.f32(float %Val, i32 %power)
				11063	declare double @llvm.powi.f64(double %Val, i32 %power)
				11064	declare x86_fp80 @llvm.powi.f80(x86_fp80 %Val, i32 %power)
				11065	declare fp128 @llvm.powi.f128(fp128 %Val, i32 %power)
				11066	declare ppc_fp128 @llvm.powi.ppcf128(ppc_fp128 %Val, i32 %power)
				11067
				11068	Overview:
				11069	"""""""""
				11070
				11071	The '``llvm.powi.*``' intrinsics return the first operand raised to the
				11072	specified (positive or negative) power. The order of evaluation of
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11073	multiplications is not defined. When a vector of floating-point type is
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11074	used, the second argument remains a scalar integer value.
				11075
				11076	Arguments:
				11077	""""""""""
				11078
				11079	The second argument is an integer power, and the first is a value to
				11080	raise to that power.
				11081
				11082	Semantics:
				11083	""""""""""
				11084
				11085	This function returns the first value raised to the second power with an
				11086	unspecified sequence of rounding operations.
				11087
				11088	'``llvm.sin.*``' Intrinsic
				11089	^^^^^^^^^^^^^^^^^^^^^^^^^^
				11090
				11091	Syntax:
				11092	"""""""
				11093
				11094	This is an overloaded intrinsic. You can use ``llvm.sin`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11095	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11096	all types however.
				11097
				11098	::
				11099
				11100	declare float @llvm.sin.f32(float %Val)
				11101	declare double @llvm.sin.f64(double %Val)
				11102	declare x86_fp80 @llvm.sin.f80(x86_fp80 %Val)
				11103	declare fp128 @llvm.sin.f128(fp128 %Val)
				11104	declare ppc_fp128 @llvm.sin.ppcf128(ppc_fp128 %Val)
				11105
				11106	Overview:
				11107	"""""""""
				11108
				11109	The '``llvm.sin.*``' intrinsics return the sine of the operand.
				11110
				11111	Arguments:
				11112	""""""""""
				11113
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11114	The argument and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11115
				11116	Semantics:
				11117	""""""""""
				11118
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11119	Return the same value as a corresponding libm '``sin``' function but without
				11120	trapping or setting ``errno``.
				11121
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11122	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11123	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11124
				11125	'``llvm.cos.*``' Intrinsic
				11126	^^^^^^^^^^^^^^^^^^^^^^^^^^
				11127
				11128	Syntax:
				11129	"""""""
				11130
				11131	This is an overloaded intrinsic. You can use ``llvm.cos`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11132	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11133	all types however.
				11134
				11135	::
				11136
				11137	declare float @llvm.cos.f32(float %Val)
				11138	declare double @llvm.cos.f64(double %Val)
				11139	declare x86_fp80 @llvm.cos.f80(x86_fp80 %Val)
				11140	declare fp128 @llvm.cos.f128(fp128 %Val)
				11141	declare ppc_fp128 @llvm.cos.ppcf128(ppc_fp128 %Val)
				11142
				11143	Overview:
				11144	"""""""""
				11145
				11146	The '``llvm.cos.*``' intrinsics return the cosine of the operand.
				11147
				11148	Arguments:
				11149	""""""""""
				11150
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11151	The argument and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11152
				11153	Semantics:
				11154	""""""""""
				11155
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11156	Return the same value as a corresponding libm '``cos``' function but without
				11157	trapping or setting ``errno``.
				11158
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11159	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11160	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11161
				11162	'``llvm.pow.*``' Intrinsic
				11163	^^^^^^^^^^^^^^^^^^^^^^^^^^
				11164
				11165	Syntax:
				11166	"""""""
				11167
				11168	This is an overloaded intrinsic. You can use ``llvm.pow`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11169	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11170	all types however.
				11171
				11172	::
				11173
				11174	declare float @llvm.pow.f32(float %Val, float %Power)
				11175	declare double @llvm.pow.f64(double %Val, double %Power)
				11176	declare x86_fp80 @llvm.pow.f80(x86_fp80 %Val, x86_fp80 %Power)
				11177	declare fp128 @llvm.pow.f128(fp128 %Val, fp128 %Power)
				11178	declare ppc_fp128 @llvm.pow.ppcf128(ppc_fp128 %Val, ppc_fp128 Power)
				11179
				11180	Overview:
				11181	"""""""""
				11182
				11183	The '``llvm.pow.*``' intrinsics return the first operand raised to the
				11184	specified (positive or negative) power.
				11185
				11186	Arguments:
				11187	""""""""""
				11188
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11189	The arguments and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11190
				11191	Semantics:
				11192	""""""""""
				11193
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11194	Return the same value as a corresponding libm '``pow``' function but without
				11195	trapping or setting ``errno``.
				11196
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11197	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11198	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11199
				11200	'``llvm.exp.*``' Intrinsic
				11201	^^^^^^^^^^^^^^^^^^^^^^^^^^
				11202
				11203	Syntax:
				11204	"""""""
				11205
				11206	This is an overloaded intrinsic. You can use ``llvm.exp`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11207	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11208	all types however.
				11209
				11210	::
				11211
				11212	declare float @llvm.exp.f32(float %Val)
				11213	declare double @llvm.exp.f64(double %Val)
				11214	declare x86_fp80 @llvm.exp.f80(x86_fp80 %Val)
				11215	declare fp128 @llvm.exp.f128(fp128 %Val)
				11216	declare ppc_fp128 @llvm.exp.ppcf128(ppc_fp128 %Val)
				11217
				11218	Overview:
				11219	"""""""""
				11220
Andrew Kaylor	caf24d2	2017-04-11 21:52:40 +0000	[diff] [blame]	11221	The '``llvm.exp.*``' intrinsics compute the base-e exponential of the specified
				11222	value.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11223
				11224	Arguments:
				11225	""""""""""
				11226
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11227	The argument and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11228
				11229	Semantics:
				11230	""""""""""
				11231
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11232	Return the same value as a corresponding libm '``exp``' function but without
				11233	trapping or setting ``errno``.
				11234
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11235	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11236	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11237
				11238	'``llvm.exp2.*``' Intrinsic
				11239	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11240
				11241	Syntax:
				11242	"""""""
				11243
				11244	This is an overloaded intrinsic. You can use ``llvm.exp2`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11245	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11246	all types however.
				11247
				11248	::
				11249
				11250	declare float @llvm.exp2.f32(float %Val)
				11251	declare double @llvm.exp2.f64(double %Val)
				11252	declare x86_fp80 @llvm.exp2.f80(x86_fp80 %Val)
				11253	declare fp128 @llvm.exp2.f128(fp128 %Val)
				11254	declare ppc_fp128 @llvm.exp2.ppcf128(ppc_fp128 %Val)
				11255
				11256	Overview:
				11257	"""""""""
				11258
Andrew Kaylor	caf24d2	2017-04-11 21:52:40 +0000	[diff] [blame]	11259	The '``llvm.exp2.*``' intrinsics compute the base-2 exponential of the
				11260	specified value.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11261
				11262	Arguments:
				11263	""""""""""
				11264
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11265	The argument and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11266
				11267	Semantics:
				11268	""""""""""
				11269
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11270	Return the same value as a corresponding libm '``exp2``' function but without
				11271	trapping or setting ``errno``.
				11272
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11273	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11274	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11275
				11276	'``llvm.log.*``' Intrinsic
				11277	^^^^^^^^^^^^^^^^^^^^^^^^^^
				11278
				11279	Syntax:
				11280	"""""""
				11281
				11282	This is an overloaded intrinsic. You can use ``llvm.log`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11283	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11284	all types however.
				11285
				11286	::
				11287
				11288	declare float @llvm.log.f32(float %Val)
				11289	declare double @llvm.log.f64(double %Val)
				11290	declare x86_fp80 @llvm.log.f80(x86_fp80 %Val)
				11291	declare fp128 @llvm.log.f128(fp128 %Val)
				11292	declare ppc_fp128 @llvm.log.ppcf128(ppc_fp128 %Val)
				11293
				11294	Overview:
				11295	"""""""""
				11296
Andrew Kaylor	caf24d2	2017-04-11 21:52:40 +0000	[diff] [blame]	11297	The '``llvm.log.*``' intrinsics compute the base-e logarithm of the specified
				11298	value.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11299
				11300	Arguments:
				11301	""""""""""
				11302
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11303	The argument and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11304
				11305	Semantics:
				11306	""""""""""
				11307
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11308	Return the same value as a corresponding libm '``log``' function but without
				11309	trapping or setting ``errno``.
				11310
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11311	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11312	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11313
				11314	'``llvm.log10.*``' Intrinsic
				11315	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11316
				11317	Syntax:
				11318	"""""""
				11319
				11320	This is an overloaded intrinsic. You can use ``llvm.log10`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11321	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11322	all types however.
				11323
				11324	::
				11325
				11326	declare float @llvm.log10.f32(float %Val)
				11327	declare double @llvm.log10.f64(double %Val)
				11328	declare x86_fp80 @llvm.log10.f80(x86_fp80 %Val)
				11329	declare fp128 @llvm.log10.f128(fp128 %Val)
				11330	declare ppc_fp128 @llvm.log10.ppcf128(ppc_fp128 %Val)
				11331
				11332	Overview:
				11333	"""""""""
				11334
Andrew Kaylor	caf24d2	2017-04-11 21:52:40 +0000	[diff] [blame]	11335	The '``llvm.log10.*``' intrinsics compute the base-10 logarithm of the
				11336	specified value.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11337
				11338	Arguments:
				11339	""""""""""
				11340
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11341	The argument and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11342
				11343	Semantics:
				11344	""""""""""
				11345
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11346	Return the same value as a corresponding libm '``log10``' function but without
				11347	trapping or setting ``errno``.
				11348
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11349	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11350	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11351
				11352	'``llvm.log2.*``' Intrinsic
				11353	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11354
				11355	Syntax:
				11356	"""""""
				11357
				11358	This is an overloaded intrinsic. You can use ``llvm.log2`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11359	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11360	all types however.
				11361
				11362	::
				11363
				11364	declare float @llvm.log2.f32(float %Val)
				11365	declare double @llvm.log2.f64(double %Val)
				11366	declare x86_fp80 @llvm.log2.f80(x86_fp80 %Val)
				11367	declare fp128 @llvm.log2.f128(fp128 %Val)
				11368	declare ppc_fp128 @llvm.log2.ppcf128(ppc_fp128 %Val)
				11369
				11370	Overview:
				11371	"""""""""
				11372
Andrew Kaylor	caf24d2	2017-04-11 21:52:40 +0000	[diff] [blame]	11373	The '``llvm.log2.*``' intrinsics compute the base-2 logarithm of the specified
				11374	value.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11375
				11376	Arguments:
				11377	""""""""""
				11378
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11379	The argument and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11380
				11381	Semantics:
				11382	""""""""""
				11383
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11384	Return the same value as a corresponding libm '``log2``' function but without
				11385	trapping or setting ``errno``.
				11386
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11387	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11388	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11389
				11390	'``llvm.fma.*``' Intrinsic
				11391	^^^^^^^^^^^^^^^^^^^^^^^^^^
				11392
				11393	Syntax:
				11394	"""""""
				11395
				11396	This is an overloaded intrinsic. You can use ``llvm.fma`` on any
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11397	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11398	all types however.
				11399
				11400	::
				11401
				11402	declare float @llvm.fma.f32(float %a, float %b, float %c)
				11403	declare double @llvm.fma.f64(double %a, double %b, double %c)
				11404	declare x86_fp80 @llvm.fma.f80(x86_fp80 %a, x86_fp80 %b, x86_fp80 %c)
				11405	declare fp128 @llvm.fma.f128(fp128 %a, fp128 %b, fp128 %c)
				11406	declare ppc_fp128 @llvm.fma.ppcf128(ppc_fp128 %a, ppc_fp128 %b, ppc_fp128 %c)
				11407
				11408	Overview:
				11409	"""""""""
				11410
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11411	The '``llvm.fma.*``' intrinsics perform the fused multiply-add operation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11412
				11413	Arguments:
				11414	""""""""""
				11415
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11416	The arguments and return value are floating-point numbers of the same type.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11417
				11418	Semantics:
				11419	""""""""""
				11420
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11421	Return the same value as a corresponding libm '``fma``' function but without
				11422	trapping or setting ``errno``.
				11423
Elena Demikhovsky	945b7e5	2018-02-14 06:58:08 +0000	[diff] [blame]	11424	When specified with the fast-math-flag 'afn', the result may be approximated
Sanjay Patel	629c411	2017-11-06 16:27:15 +0000	[diff] [blame]	11425	using a less accurate calculation.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11426
				11427	'``llvm.fabs.*``' Intrinsic
				11428	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11429
				11430	Syntax:
				11431	"""""""
				11432
				11433	This is an overloaded intrinsic. You can use ``llvm.fabs`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11434	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11435	all types however.
				11436
				11437	::
				11438
				11439	declare float @llvm.fabs.f32(float %Val)
				11440	declare double @llvm.fabs.f64(double %Val)
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11441	declare x86_fp80 @llvm.fabs.f80(x86_fp80 %Val)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11442	declare fp128 @llvm.fabs.f128(fp128 %Val)
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11443	declare ppc_fp128 @llvm.fabs.ppcf128(ppc_fp128 %Val)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11444
				11445	Overview:
				11446	"""""""""
				11447
				11448	The '``llvm.fabs.*``' intrinsics return the absolute value of the
				11449	operand.
				11450
				11451	Arguments:
				11452	""""""""""
				11453
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11454	The argument and return value are floating-point numbers of the same
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11455	type.
				11456
				11457	Semantics:
				11458	""""""""""
				11459
				11460	This function returns the same values as the libm ``fabs`` functions
				11461	would, and handles error conditions in the same way.
				11462
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11463	'``llvm.minnum.*``' Intrinsic
Matt Arsenault	9886b0d	2014-10-22 00:15:53 +0000	[diff] [blame]	11464	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11465
				11466	Syntax:
				11467	"""""""
				11468
				11469	This is an overloaded intrinsic. You can use ``llvm.minnum`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11470	floating-point or vector of floating-point type. Not all targets support
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11471	all types however.
				11472
				11473	::
				11474
Matt Arsenault	64313c9	2014-10-22 18:25:02 +0000	[diff] [blame]	11475	declare float @llvm.minnum.f32(float %Val0, float %Val1)
				11476	declare double @llvm.minnum.f64(double %Val0, double %Val1)
				11477	declare x86_fp80 @llvm.minnum.f80(x86_fp80 %Val0, x86_fp80 %Val1)
				11478	declare fp128 @llvm.minnum.f128(fp128 %Val0, fp128 %Val1)
				11479	declare ppc_fp128 @llvm.minnum.ppcf128(ppc_fp128 %Val0, ppc_fp128 %Val1)
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11480
				11481	Overview:
				11482	"""""""""
				11483
				11484	The '``llvm.minnum.*``' intrinsics return the minimum of the two
				11485	arguments.
				11486
				11487
				11488	Arguments:
				11489	""""""""""
				11490
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11491	The arguments and return value are floating-point numbers of the same
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11492	type.
				11493
				11494	Semantics:
				11495	""""""""""
				11496
Matt Arsenault	937003c	2018-08-27 17:40:07 +0000	[diff] [blame]	11497	Follows the IEEE-754 semantics for minNum, except for handling of
				11498	signaling NaNs. This match's the behavior of libm's fmin.
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11499
				11500	If either operand is a NaN, returns the other non-NaN operand. Returns
Matt Arsenault	937003c	2018-08-27 17:40:07 +0000	[diff] [blame]	11501	NaN only if both operands are NaN. The returned NaN is always
				11502	quiet. If the operands compare equal, returns a value that compares
				11503	equal to both operands. This means that fmin(+/-0.0, +/-0.0) could
				11504	return either -0.0 or 0.0.
				11505
				11506	Unlike the IEEE-754 2008 behavior, this does not distinguish between
				11507	signaling and quiet NaN inputs. If a target's implementation follows
				11508	the standard and returns a quiet NaN if either input is a signaling
				11509	NaN, the intrinsic lowering is responsible for quieting the inputs to
				11510	correctly return the non-NaN input (e.g. by using the equivalent of
				11511	``llvm.canonicalize``).
				11512
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11513
				11514	'``llvm.maxnum.*``' Intrinsic
Matt Arsenault	9886b0d	2014-10-22 00:15:53 +0000	[diff] [blame]	11515	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11516
				11517	Syntax:
				11518	"""""""
				11519
				11520	This is an overloaded intrinsic. You can use ``llvm.maxnum`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11521	floating-point or vector of floating-point type. Not all targets support
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11522	all types however.
				11523
				11524	::
				11525
Matt Arsenault	64313c9	2014-10-22 18:25:02 +0000	[diff] [blame]	11526	declare float @llvm.maxnum.f32(float %Val0, float %Val1l)
				11527	declare double @llvm.maxnum.f64(double %Val0, double %Val1)
				11528	declare x86_fp80 @llvm.maxnum.f80(x86_fp80 %Val0, x86_fp80 %Val1)
				11529	declare fp128 @llvm.maxnum.f128(fp128 %Val0, fp128 %Val1)
				11530	declare ppc_fp128 @llvm.maxnum.ppcf128(ppc_fp128 %Val0, ppc_fp128 %Val1)
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11531
				11532	Overview:
				11533	"""""""""
				11534
				11535	The '``llvm.maxnum.*``' intrinsics return the maximum of the two
				11536	arguments.
				11537
				11538
				11539	Arguments:
				11540	""""""""""
				11541
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11542	The arguments and return value are floating-point numbers of the same
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11543	type.
				11544
				11545	Semantics:
				11546	""""""""""
Matt Arsenault	937003c	2018-08-27 17:40:07 +0000	[diff] [blame]	11547	Follows the IEEE-754 semantics for maxNum except for the handling of
				11548	signaling NaNs. This matches the behavior of libm's fmax.
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11549
				11550	If either operand is a NaN, returns the other non-NaN operand. Returns
Matt Arsenault	937003c	2018-08-27 17:40:07 +0000	[diff] [blame]	11551	NaN only if both operands are NaN. The returned NaN is always
				11552	quiet. If the operands compare equal, returns a value that compares
				11553	equal to both operands. This means that fmax(+/-0.0, +/-0.0) could
				11554	return either -0.0 or 0.0.
				11555
				11556	Unlike the IEEE-754 2008 behavior, this does not distinguish between
				11557	signaling and quiet NaN inputs. If a target's implementation follows
				11558	the standard and returns a quiet NaN if either input is a signaling
				11559	NaN, the intrinsic lowering is responsible for quieting the inputs to
				11560	correctly return the non-NaN input (e.g. by using the equivalent of
				11561	``llvm.canonicalize``).
Matt Arsenault	d6511b4	2014-10-21 23:00:20 +0000	[diff] [blame]	11562
Hal Finkel	0c5c01aa	2013-08-19 23:35:46 +0000	[diff] [blame]	11563	'``llvm.copysign.*``' Intrinsic
				11564	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11565
				11566	Syntax:
				11567	"""""""
				11568
				11569	This is an overloaded intrinsic. You can use ``llvm.copysign`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11570	floating-point or vector of floating-point type. Not all targets support
Hal Finkel	0c5c01aa	2013-08-19 23:35:46 +0000	[diff] [blame]	11571	all types however.
				11572
				11573	::
				11574
				11575	declare float @llvm.copysign.f32(float %Mag, float %Sgn)
				11576	declare double @llvm.copysign.f64(double %Mag, double %Sgn)
				11577	declare x86_fp80 @llvm.copysign.f80(x86_fp80 %Mag, x86_fp80 %Sgn)
				11578	declare fp128 @llvm.copysign.f128(fp128 %Mag, fp128 %Sgn)
				11579	declare ppc_fp128 @llvm.copysign.ppcf128(ppc_fp128 %Mag, ppc_fp128 %Sgn)
				11580
				11581	Overview:
				11582	"""""""""
				11583
				11584	The '``llvm.copysign.*``' intrinsics return a value with the magnitude of the
				11585	first operand and the sign of the second operand.
				11586
				11587	Arguments:
				11588	""""""""""
				11589
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11590	The arguments and return value are floating-point numbers of the same
Hal Finkel	0c5c01aa	2013-08-19 23:35:46 +0000	[diff] [blame]	11591	type.
				11592
				11593	Semantics:
				11594	""""""""""
				11595
				11596	This function returns the same values as the libm ``copysign``
				11597	functions would, and handles error conditions in the same way.
				11598
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11599	'``llvm.floor.*``' Intrinsic
				11600	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11601
				11602	Syntax:
				11603	"""""""
				11604
				11605	This is an overloaded intrinsic. You can use ``llvm.floor`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11606	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11607	all types however.
				11608
				11609	::
				11610
				11611	declare float @llvm.floor.f32(float %Val)
				11612	declare double @llvm.floor.f64(double %Val)
				11613	declare x86_fp80 @llvm.floor.f80(x86_fp80 %Val)
				11614	declare fp128 @llvm.floor.f128(fp128 %Val)
				11615	declare ppc_fp128 @llvm.floor.ppcf128(ppc_fp128 %Val)
				11616
				11617	Overview:
				11618	"""""""""
				11619
				11620	The '``llvm.floor.*``' intrinsics return the floor of the operand.
				11621
				11622	Arguments:
				11623	""""""""""
				11624
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11625	The argument and return value are floating-point numbers of the same
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11626	type.
				11627
				11628	Semantics:
				11629	""""""""""
				11630
				11631	This function returns the same values as the libm ``floor`` functions
				11632	would, and handles error conditions in the same way.
				11633
				11634	'``llvm.ceil.*``' Intrinsic
				11635	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11636
				11637	Syntax:
				11638	"""""""
				11639
				11640	This is an overloaded intrinsic. You can use ``llvm.ceil`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11641	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11642	all types however.
				11643
				11644	::
				11645
				11646	declare float @llvm.ceil.f32(float %Val)
				11647	declare double @llvm.ceil.f64(double %Val)
				11648	declare x86_fp80 @llvm.ceil.f80(x86_fp80 %Val)
				11649	declare fp128 @llvm.ceil.f128(fp128 %Val)
				11650	declare ppc_fp128 @llvm.ceil.ppcf128(ppc_fp128 %Val)
				11651
				11652	Overview:
				11653	"""""""""
				11654
				11655	The '``llvm.ceil.*``' intrinsics return the ceiling of the operand.
				11656
				11657	Arguments:
				11658	""""""""""
				11659
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11660	The argument and return value are floating-point numbers of the same
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11661	type.
				11662
				11663	Semantics:
				11664	""""""""""
				11665
				11666	This function returns the same values as the libm ``ceil`` functions
				11667	would, and handles error conditions in the same way.
				11668
				11669	'``llvm.trunc.*``' Intrinsic
				11670	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11671
				11672	Syntax:
				11673	"""""""
				11674
				11675	This is an overloaded intrinsic. You can use ``llvm.trunc`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11676	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11677	all types however.
				11678
				11679	::
				11680
				11681	declare float @llvm.trunc.f32(float %Val)
				11682	declare double @llvm.trunc.f64(double %Val)
				11683	declare x86_fp80 @llvm.trunc.f80(x86_fp80 %Val)
				11684	declare fp128 @llvm.trunc.f128(fp128 %Val)
				11685	declare ppc_fp128 @llvm.trunc.ppcf128(ppc_fp128 %Val)
				11686
				11687	Overview:
				11688	"""""""""
				11689
				11690	The '``llvm.trunc.*``' intrinsics returns the operand rounded to the
				11691	nearest integer not larger in magnitude than the operand.
				11692
				11693	Arguments:
				11694	""""""""""
				11695
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11696	The argument and return value are floating-point numbers of the same
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11697	type.
				11698
				11699	Semantics:
				11700	""""""""""
				11701
				11702	This function returns the same values as the libm ``trunc`` functions
				11703	would, and handles error conditions in the same way.
				11704
				11705	'``llvm.rint.*``' Intrinsic
				11706	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11707
				11708	Syntax:
				11709	"""""""
				11710
				11711	This is an overloaded intrinsic. You can use ``llvm.rint`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11712	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11713	all types however.
				11714
				11715	::
				11716
				11717	declare float @llvm.rint.f32(float %Val)
				11718	declare double @llvm.rint.f64(double %Val)
				11719	declare x86_fp80 @llvm.rint.f80(x86_fp80 %Val)
				11720	declare fp128 @llvm.rint.f128(fp128 %Val)
				11721	declare ppc_fp128 @llvm.rint.ppcf128(ppc_fp128 %Val)
				11722
				11723	Overview:
				11724	"""""""""
				11725
				11726	The '``llvm.rint.*``' intrinsics returns the operand rounded to the
				11727	nearest integer. It may raise an inexact floating-point exception if the
				11728	operand isn't an integer.
				11729
				11730	Arguments:
				11731	""""""""""
				11732
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11733	The argument and return value are floating-point numbers of the same
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11734	type.
				11735
				11736	Semantics:
				11737	""""""""""
				11738
				11739	This function returns the same values as the libm ``rint`` functions
				11740	would, and handles error conditions in the same way.
				11741
				11742	'``llvm.nearbyint.*``' Intrinsic
				11743	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11744
				11745	Syntax:
				11746	"""""""
				11747
				11748	This is an overloaded intrinsic. You can use ``llvm.nearbyint`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11749	floating-point or vector of floating-point type. Not all targets support
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11750	all types however.
				11751
				11752	::
				11753
				11754	declare float @llvm.nearbyint.f32(float %Val)
				11755	declare double @llvm.nearbyint.f64(double %Val)
				11756	declare x86_fp80 @llvm.nearbyint.f80(x86_fp80 %Val)
				11757	declare fp128 @llvm.nearbyint.f128(fp128 %Val)
				11758	declare ppc_fp128 @llvm.nearbyint.ppcf128(ppc_fp128 %Val)
				11759
				11760	Overview:
				11761	"""""""""
				11762
				11763	The '``llvm.nearbyint.*``' intrinsics returns the operand rounded to the
				11764	nearest integer.
				11765
				11766	Arguments:
				11767	""""""""""
				11768
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11769	The argument and return value are floating-point numbers of the same
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11770	type.
				11771
				11772	Semantics:
				11773	""""""""""
				11774
				11775	This function returns the same values as the libm ``nearbyint``
				11776	functions would, and handles error conditions in the same way.
				11777
Hal Finkel	171817e	2013-08-07 22:49:12 +0000	[diff] [blame]	11778	'``llvm.round.*``' Intrinsic
				11779	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11780
				11781	Syntax:
				11782	"""""""
				11783
				11784	This is an overloaded intrinsic. You can use ``llvm.round`` on any
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11785	floating-point or vector of floating-point type. Not all targets support
Hal Finkel	171817e	2013-08-07 22:49:12 +0000	[diff] [blame]	11786	all types however.
				11787
				11788	::
				11789
				11790	declare float @llvm.round.f32(float %Val)
				11791	declare double @llvm.round.f64(double %Val)
				11792	declare x86_fp80 @llvm.round.f80(x86_fp80 %Val)
				11793	declare fp128 @llvm.round.f128(fp128 %Val)
				11794	declare ppc_fp128 @llvm.round.ppcf128(ppc_fp128 %Val)
				11795
				11796	Overview:
				11797	"""""""""
				11798
				11799	The '``llvm.round.*``' intrinsics returns the operand rounded to the
				11800	nearest integer.
				11801
				11802	Arguments:
				11803	""""""""""
				11804
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	11805	The argument and return value are floating-point numbers of the same
Hal Finkel	171817e	2013-08-07 22:49:12 +0000	[diff] [blame]	11806	type.
				11807
				11808	Semantics:
				11809	""""""""""
				11810
				11811	This function returns the same values as the libm ``round``
				11812	functions would, and handles error conditions in the same way.
				11813
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11814	Bit Manipulation Intrinsics
				11815	---------------------------
				11816
				11817	LLVM provides intrinsics for a few important bit manipulation
				11818	operations. These allow efficient code generation for some algorithms.
				11819
James Molloy	90111f7	2015-11-12 12:29:09 +0000	[diff] [blame]	11820	'``llvm.bitreverse.*``' Intrinsics
Akira Hatanaka	7f5562b	2015-11-13 21:09:57 +0000	[diff] [blame]	11821	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
James Molloy	90111f7	2015-11-12 12:29:09 +0000	[diff] [blame]	11822
				11823	Syntax:
				11824	"""""""
				11825
				11826	This is an overloaded intrinsic function. You can use bitreverse on any
				11827	integer type.
				11828
				11829	::
				11830
				11831	declare i16 @llvm.bitreverse.i16(i16 <id>)
				11832	declare i32 @llvm.bitreverse.i32(i32 <id>)
				11833	declare i64 @llvm.bitreverse.i64(i64 <id>)
				11834
				11835	Overview:
				11836	"""""""""
				11837
				11838	The '``llvm.bitreverse``' family of intrinsics is used to reverse the
Matt Arsenault	de2d6a3	2016-03-07 21:54:52 +0000	[diff] [blame]	11839	bitpattern of an integer value; for example ``0b10110110`` becomes
				11840	``0b01101101``.
James Molloy	90111f7	2015-11-12 12:29:09 +0000	[diff] [blame]	11841
				11842	Semantics:
				11843	""""""""""
				11844
Yichao Yu	5abf14b	2016-11-23 16:25:31 +0000	[diff] [blame]	11845	The ``llvm.bitreverse.iN`` intrinsic returns an iN value that has bit
James Molloy	90111f7	2015-11-12 12:29:09 +0000	[diff] [blame]	11846	``M`` in the input moved to bit ``N-M`` in the output.
				11847
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11848	'``llvm.bswap.*``' Intrinsics
				11849	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11850
				11851	Syntax:
				11852	"""""""
				11853
				11854	This is an overloaded intrinsic function. You can use bswap on any
				11855	integer type that is an even number of bytes (i.e. BitWidth % 16 == 0).
				11856
				11857	::
				11858
				11859	declare i16 @llvm.bswap.i16(i16 <id>)
				11860	declare i32 @llvm.bswap.i32(i32 <id>)
				11861	declare i64 @llvm.bswap.i64(i64 <id>)
				11862
				11863	Overview:
				11864	"""""""""
				11865
				11866	The '``llvm.bswap``' family of intrinsics is used to byte swap integer
				11867	values with an even number of bytes (positive multiple of 16 bits).
				11868	These are useful for performing operations on data that is not in the
				11869	target's native byte order.
				11870
				11871	Semantics:
				11872	""""""""""
				11873
				11874	The ``llvm.bswap.i16`` intrinsic returns an i16 value that has the high
				11875	and low byte of the input i16 swapped. Similarly, the ``llvm.bswap.i32``
				11876	intrinsic returns an i32 value that has the four bytes of the input i32
				11877	swapped, so that if the input bytes are numbered 0, 1, 2, 3 then the
				11878	returned i32 will have its bytes in 3, 2, 1, 0 order. The
				11879	``llvm.bswap.i48``, ``llvm.bswap.i64`` and other intrinsics extend this
				11880	concept to additional even-byte lengths (6 bytes, 8 bytes and more,
				11881	respectively).
				11882
				11883	'``llvm.ctpop.*``' Intrinsic
				11884	^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11885
				11886	Syntax:
				11887	"""""""
				11888
				11889	This is an overloaded intrinsic. You can use llvm.ctpop on any integer
				11890	bit width, or on any vector with integer elements. Not all targets
				11891	support all bit widths or vector types, however.
				11892
				11893	::
				11894
				11895	declare i8 @llvm.ctpop.i8(i8 <src>)
				11896	declare i16 @llvm.ctpop.i16(i16 <src>)
				11897	declare i32 @llvm.ctpop.i32(i32 <src>)
				11898	declare i64 @llvm.ctpop.i64(i64 <src>)
				11899	declare i256 @llvm.ctpop.i256(i256 <src>)
				11900	declare <2 x i32> @llvm.ctpop.v2i32(<2 x i32> <src>)
				11901
				11902	Overview:
				11903	"""""""""
				11904
				11905	The '``llvm.ctpop``' family of intrinsics counts the number of bits set
				11906	in a value.
				11907
				11908	Arguments:
				11909	""""""""""
				11910
				11911	The only argument is the value to be counted. The argument may be of any
				11912	integer type, or a vector with integer elements. The return type must
				11913	match the argument type.
				11914
				11915	Semantics:
				11916	""""""""""
				11917
				11918	The '``llvm.ctpop``' intrinsic counts the 1's in a variable, or within
				11919	each element of a vector.
				11920
				11921	'``llvm.ctlz.*``' Intrinsic
				11922	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11923
				11924	Syntax:
				11925	"""""""
				11926
				11927	This is an overloaded intrinsic. You can use ``llvm.ctlz`` on any
				11928	integer bit width, or any vector whose elements are integers. Not all
				11929	targets support all bit widths or vector types, however.
				11930
				11931	::
				11932
				11933	declare i8 @llvm.ctlz.i8 (i8 <src>, i1 <is_zero_undef>)
				11934	declare i16 @llvm.ctlz.i16 (i16 <src>, i1 <is_zero_undef>)
				11935	declare i32 @llvm.ctlz.i32 (i32 <src>, i1 <is_zero_undef>)
				11936	declare i64 @llvm.ctlz.i64 (i64 <src>, i1 <is_zero_undef>)
				11937	declare i256 @llvm.ctlz.i256(i256 <src>, i1 <is_zero_undef>)
Alexey Samsonov	c4b1830	2016-03-17 23:08:01 +0000	[diff] [blame]	11938	declare <2 x i32> @llvm.ctlz.v2i32(<2 x i32> <src>, i1 <is_zero_undef>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11939
				11940	Overview:
				11941	"""""""""
				11942
				11943	The '``llvm.ctlz``' family of intrinsic functions counts the number of
				11944	leading zeros in a variable.
				11945
				11946	Arguments:
				11947	""""""""""
				11948
				11949	The first argument is the value to be counted. This argument may be of
Hal Finkel	5dd8278	2015-01-05 04:05:21 +0000	[diff] [blame]	11950	any integer type, or a vector with integer element type. The return
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11951	type must match the first argument type.
				11952
				11953	The second argument must be a constant and is a flag to indicate whether
				11954	the intrinsic should ensure that a zero as the first argument produces a
				11955	defined result. Historically some architectures did not provide a
				11956	defined result for zero values as efficiently, and many algorithms are
				11957	now predicated on avoiding zero-value inputs.
				11958
				11959	Semantics:
				11960	""""""""""
				11961
				11962	The '``llvm.ctlz``' intrinsic counts the leading (most significant)
				11963	zeros in a variable, or within each element of the vector. If
				11964	``src == 0`` then the result is the size in bits of the type of ``src``
				11965	if ``is_zero_undef == 0`` and ``undef`` otherwise. For example,
				11966	``llvm.ctlz(i32 2) = 30``.
				11967
				11968	'``llvm.cttz.*``' Intrinsic
				11969	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				11970
				11971	Syntax:
				11972	"""""""
				11973
				11974	This is an overloaded intrinsic. You can use ``llvm.cttz`` on any
				11975	integer bit width, or any vector of integer elements. Not all targets
				11976	support all bit widths or vector types, however.
				11977
				11978	::
				11979
				11980	declare i8 @llvm.cttz.i8 (i8 <src>, i1 <is_zero_undef>)
				11981	declare i16 @llvm.cttz.i16 (i16 <src>, i1 <is_zero_undef>)
				11982	declare i32 @llvm.cttz.i32 (i32 <src>, i1 <is_zero_undef>)
				11983	declare i64 @llvm.cttz.i64 (i64 <src>, i1 <is_zero_undef>)
				11984	declare i256 @llvm.cttz.i256(i256 <src>, i1 <is_zero_undef>)
Alexey Samsonov	c4b1830	2016-03-17 23:08:01 +0000	[diff] [blame]	11985	declare <2 x i32> @llvm.cttz.v2i32(<2 x i32> <src>, i1 <is_zero_undef>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11986
				11987	Overview:
				11988	"""""""""
				11989
				11990	The '``llvm.cttz``' family of intrinsic functions counts the number of
				11991	trailing zeros.
				11992
				11993	Arguments:
				11994	""""""""""
				11995
				11996	The first argument is the value to be counted. This argument may be of
Hal Finkel	5dd8278	2015-01-05 04:05:21 +0000	[diff] [blame]	11997	any integer type, or a vector with integer element type. The return
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	11998	type must match the first argument type.
				11999
				12000	The second argument must be a constant and is a flag to indicate whether
				12001	the intrinsic should ensure that a zero as the first argument produces a
				12002	defined result. Historically some architectures did not provide a
				12003	defined result for zero values as efficiently, and many algorithms are
				12004	now predicated on avoiding zero-value inputs.
				12005
				12006	Semantics:
				12007	""""""""""
				12008
				12009	The '``llvm.cttz``' intrinsic counts the trailing (least significant)
				12010	zeros in a variable, or within each element of a vector. If ``src == 0``
				12011	then the result is the size in bits of the type of ``src`` if
				12012	``is_zero_undef == 0`` and ``undef`` otherwise. For example,
				12013	``llvm.cttz(2) = 1``.
				12014
Philip Reames	34843ae	2015-03-05 05:55:55 +0000	[diff] [blame]	12015	.. _int_overflow:
				12016
Sanjay Patel	c71adc8	2018-07-16 22:59:31 +0000	[diff] [blame]	12017	'``llvm.fshl.*``' Intrinsic
				12018	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12019
				12020	Syntax:
				12021	"""""""
				12022
				12023	This is an overloaded intrinsic. You can use ``llvm.fshl`` on any
				12024	integer bit width or any vector of integer elements. Not all targets
				12025	support all bit widths or vector types, however.
				12026
				12027	::
				12028
				12029	declare i8 @llvm.fshl.i8 (i8 %a, i8 %b, i8 %c)
				12030	declare i67 @llvm.fshl.i67(i67 %a, i67 %b, i67 %c)
				12031	declare <2 x i32> @llvm.fshl.v2i32(<2 x i32> %a, <2 x i32> %b, <2 x i32> %c)
				12032
				12033	Overview:
				12034	"""""""""
				12035
				12036	The '``llvm.fshl``' family of intrinsic functions performs a funnel shift left:
				12037	the first two values are concatenated as { %a : %b } (%a is the most significant
				12038	bits of the wide value), the combined value is shifted left, and the most
				12039	significant bits are extracted to produce a result that is the same size as the
				12040	original arguments. If the first 2 arguments are identical, this is equivalent
				12041	to a rotate left operation. For vector types, the operation occurs for each
				12042	element of the vector. The shift argument is treated as an unsigned amount
				12043	modulo the element size of the arguments.
				12044
				12045	Arguments:
				12046	""""""""""
				12047
				12048	The first two arguments are the values to be concatenated. The third
				12049	argument is the shift amount. The arguments may be any integer type or a
				12050	vector with integer element type. All arguments and the return value must
				12051	have the same type.
				12052
				12053	Example:
				12054	""""""""
				12055
				12056	.. code-block:: text
				12057
				12058	%r = call i8 @llvm.fshl.i8(i8 %x, i8 %y, i8 %z) ; %r = i8: msb_extract((concat(x, y) << (z % 8)), 8)
				12059	%r = call i8 @llvm.fshl.i8(i8 255, i8 0, i8 15) ; %r = i8: 128 (0b10000000)
				12060	%r = call i8 @llvm.fshl.i8(i8 15, i8 15, i8 11) ; %r = i8: 120 (0b01111000)
				12061	%r = call i8 @llvm.fshl.i8(i8 0, i8 255, i8 8) ; %r = i8: 0 (0b00000000)
				12062
				12063	'``llvm.fshr.*``' Intrinsic
				12064	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12065
				12066	Syntax:
				12067	"""""""
				12068
				12069	This is an overloaded intrinsic. You can use ``llvm.fshr`` on any
				12070	integer bit width or any vector of integer elements. Not all targets
				12071	support all bit widths or vector types, however.
				12072
				12073	::
				12074
				12075	declare i8 @llvm.fshr.i8 (i8 %a, i8 %b, i8 %c)
				12076	declare i67 @llvm.fshr.i67(i67 %a, i67 %b, i67 %c)
				12077	declare <2 x i32> @llvm.fshr.v2i32(<2 x i32> %a, <2 x i32> %b, <2 x i32> %c)
				12078
				12079	Overview:
				12080	"""""""""
				12081
				12082	The '``llvm.fshr``' family of intrinsic functions performs a funnel shift right:
				12083	the first two values are concatenated as { %a : %b } (%a is the most significant
				12084	bits of the wide value), the combined value is shifted right, and the least
				12085	significant bits are extracted to produce a result that is the same size as the
				12086	original arguments. If the first 2 arguments are identical, this is equivalent
				12087	to a rotate right operation. For vector types, the operation occurs for each
				12088	element of the vector. The shift argument is treated as an unsigned amount
				12089	modulo the element size of the arguments.
				12090
				12091	Arguments:
				12092	""""""""""
				12093
				12094	The first two arguments are the values to be concatenated. The third
				12095	argument is the shift amount. The arguments may be any integer type or a
				12096	vector with integer element type. All arguments and the return value must
				12097	have the same type.
				12098
				12099	Example:
				12100	""""""""
				12101
				12102	.. code-block:: text
				12103
				12104	%r = call i8 @llvm.fshr.i8(i8 %x, i8 %y, i8 %z) ; %r = i8: lsb_extract((concat(x, y) >> (z % 8)), 8)
				12105	%r = call i8 @llvm.fshr.i8(i8 255, i8 0, i8 15) ; %r = i8: 254 (0b11111110)
				12106	%r = call i8 @llvm.fshr.i8(i8 15, i8 15, i8 11) ; %r = i8: 225 (0b11100001)
				12107	%r = call i8 @llvm.fshr.i8(i8 0, i8 255, i8 8) ; %r = i8: 255 (0b11111111)
				12108
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12109	Arithmetic with Overflow Intrinsics
				12110	-----------------------------------
				12111
John Regehr	6a493f2	2016-05-12 20:55:09 +0000	[diff] [blame]	12112	LLVM provides intrinsics for fast arithmetic overflow checking.
				12113
				12114	Each of these intrinsics returns a two-element struct. The first
				12115	element of this struct contains the result of the corresponding
				12116	arithmetic operation modulo 2\ :sup:`n`\ , where n is the bit width of
				12117	the result. Therefore, for example, the first element of the struct
				12118	returned by ``llvm.sadd.with.overflow.i32`` is always the same as the
				12119	result of a 32-bit ``add`` instruction with the same operands, where
				12120	the ``add`` is not modified by an ``nsw`` or ``nuw`` flag.
				12121
				12122	The second element of the result is an ``i1`` that is 1 if the
				12123	arithmetic operation overflowed and 0 otherwise. An operation
				12124	overflows if, for any values of its operands ``A`` and ``B`` and for
				12125	any ``N`` larger than the operands' width, ``ext(A op B) to iN`` is
				12126	not equal to ``(ext(A) to iN) op (ext(B) to iN)`` where ``ext`` is
				12127	``sext`` for signed overflow and ``zext`` for unsigned overflow, and
				12128	``op`` is the underlying arithmetic operation.
				12129
				12130	The behavior of these intrinsics is well-defined for all argument
				12131	values.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12132
				12133	'``llvm.sadd.with.overflow.*``' Intrinsics
				12134	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12135
				12136	Syntax:
				12137	"""""""
				12138
				12139	This is an overloaded intrinsic. You can use ``llvm.sadd.with.overflow``
				12140	on any integer bit width.
				12141
				12142	::
				12143
				12144	declare {i16, i1} @llvm.sadd.with.overflow.i16(i16 %a, i16 %b)
				12145	declare {i32, i1} @llvm.sadd.with.overflow.i32(i32 %a, i32 %b)
				12146	declare {i64, i1} @llvm.sadd.with.overflow.i64(i64 %a, i64 %b)
				12147
				12148	Overview:
				12149	"""""""""
				12150
				12151	The '``llvm.sadd.with.overflow``' family of intrinsic functions perform
				12152	a signed addition of the two arguments, and indicate whether an overflow
				12153	occurred during the signed summation.
				12154
				12155	Arguments:
				12156	""""""""""
				12157
				12158	The arguments (%a and %b) and the first element of the result structure
				12159	may be of integer types of any bit width, but they must have the same
				12160	bit width. The second element of the result structure must be of type
				12161	``i1``. ``%a`` and ``%b`` are the two values that will undergo signed
				12162	addition.
				12163
				12164	Semantics:
				12165	""""""""""
				12166
				12167	The '``llvm.sadd.with.overflow``' family of intrinsic functions perform
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	12168	a signed addition of the two variables. They return a structure --- the
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12169	first element of which is the signed summation, and the second element
				12170	of which is a bit specifying if the signed summation resulted in an
				12171	overflow.
				12172
				12173	Examples:
				12174	"""""""""
				12175
				12176	.. code-block:: llvm
				12177
				12178	%res = call {i32, i1} @llvm.sadd.with.overflow.i32(i32 %a, i32 %b)
				12179	%sum = extractvalue {i32, i1} %res, 0
				12180	%obit = extractvalue {i32, i1} %res, 1
				12181	br i1 %obit, label %overflow, label %normal
				12182
				12183	'``llvm.uadd.with.overflow.*``' Intrinsics
				12184	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12185
				12186	Syntax:
				12187	"""""""
				12188
				12189	This is an overloaded intrinsic. You can use ``llvm.uadd.with.overflow``
				12190	on any integer bit width.
				12191
				12192	::
				12193
				12194	declare {i16, i1} @llvm.uadd.with.overflow.i16(i16 %a, i16 %b)
				12195	declare {i32, i1} @llvm.uadd.with.overflow.i32(i32 %a, i32 %b)
				12196	declare {i64, i1} @llvm.uadd.with.overflow.i64(i64 %a, i64 %b)
				12197
				12198	Overview:
				12199	"""""""""
				12200
				12201	The '``llvm.uadd.with.overflow``' family of intrinsic functions perform
				12202	an unsigned addition of the two arguments, and indicate whether a carry
				12203	occurred during the unsigned summation.
				12204
				12205	Arguments:
				12206	""""""""""
				12207
				12208	The arguments (%a and %b) and the first element of the result structure
				12209	may be of integer types of any bit width, but they must have the same
				12210	bit width. The second element of the result structure must be of type
				12211	``i1``. ``%a`` and ``%b`` are the two values that will undergo unsigned
				12212	addition.
				12213
				12214	Semantics:
				12215	""""""""""
				12216
				12217	The '``llvm.uadd.with.overflow``' family of intrinsic functions perform
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	12218	an unsigned addition of the two arguments. They return a structure --- the
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12219	first element of which is the sum, and the second element of which is a
				12220	bit specifying if the unsigned summation resulted in a carry.
				12221
				12222	Examples:
				12223	"""""""""
				12224
				12225	.. code-block:: llvm
				12226
				12227	%res = call {i32, i1} @llvm.uadd.with.overflow.i32(i32 %a, i32 %b)
				12228	%sum = extractvalue {i32, i1} %res, 0
				12229	%obit = extractvalue {i32, i1} %res, 1
				12230	br i1 %obit, label %carry, label %normal
				12231
				12232	'``llvm.ssub.with.overflow.*``' Intrinsics
				12233	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12234
				12235	Syntax:
				12236	"""""""
				12237
				12238	This is an overloaded intrinsic. You can use ``llvm.ssub.with.overflow``
				12239	on any integer bit width.
				12240
				12241	::
				12242
				12243	declare {i16, i1} @llvm.ssub.with.overflow.i16(i16 %a, i16 %b)
				12244	declare {i32, i1} @llvm.ssub.with.overflow.i32(i32 %a, i32 %b)
				12245	declare {i64, i1} @llvm.ssub.with.overflow.i64(i64 %a, i64 %b)
				12246
				12247	Overview:
				12248	"""""""""
				12249
				12250	The '``llvm.ssub.with.overflow``' family of intrinsic functions perform
				12251	a signed subtraction of the two arguments, and indicate whether an
				12252	overflow occurred during the signed subtraction.
				12253
				12254	Arguments:
				12255	""""""""""
				12256
				12257	The arguments (%a and %b) and the first element of the result structure
				12258	may be of integer types of any bit width, but they must have the same
				12259	bit width. The second element of the result structure must be of type
				12260	``i1``. ``%a`` and ``%b`` are the two values that will undergo signed
				12261	subtraction.
				12262
				12263	Semantics:
				12264	""""""""""
				12265
				12266	The '``llvm.ssub.with.overflow``' family of intrinsic functions perform
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	12267	a signed subtraction of the two arguments. They return a structure --- the
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12268	first element of which is the subtraction, and the second element of
				12269	which is a bit specifying if the signed subtraction resulted in an
				12270	overflow.
				12271
				12272	Examples:
				12273	"""""""""
				12274
				12275	.. code-block:: llvm
				12276
				12277	%res = call {i32, i1} @llvm.ssub.with.overflow.i32(i32 %a, i32 %b)
				12278	%sum = extractvalue {i32, i1} %res, 0
				12279	%obit = extractvalue {i32, i1} %res, 1
				12280	br i1 %obit, label %overflow, label %normal
				12281
				12282	'``llvm.usub.with.overflow.*``' Intrinsics
				12283	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12284
				12285	Syntax:
				12286	"""""""
				12287
				12288	This is an overloaded intrinsic. You can use ``llvm.usub.with.overflow``
				12289	on any integer bit width.
				12290
				12291	::
				12292
				12293	declare {i16, i1} @llvm.usub.with.overflow.i16(i16 %a, i16 %b)
				12294	declare {i32, i1} @llvm.usub.with.overflow.i32(i32 %a, i32 %b)
				12295	declare {i64, i1} @llvm.usub.with.overflow.i64(i64 %a, i64 %b)
				12296
				12297	Overview:
				12298	"""""""""
				12299
				12300	The '``llvm.usub.with.overflow``' family of intrinsic functions perform
				12301	an unsigned subtraction of the two arguments, and indicate whether an
				12302	overflow occurred during the unsigned subtraction.
				12303
				12304	Arguments:
				12305	""""""""""
				12306
				12307	The arguments (%a and %b) and the first element of the result structure
				12308	may be of integer types of any bit width, but they must have the same
				12309	bit width. The second element of the result structure must be of type
				12310	``i1``. ``%a`` and ``%b`` are the two values that will undergo unsigned
				12311	subtraction.
				12312
				12313	Semantics:
				12314	""""""""""
				12315
				12316	The '``llvm.usub.with.overflow``' family of intrinsic functions perform
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	12317	an unsigned subtraction of the two arguments. They return a structure ---
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12318	the first element of which is the subtraction, and the second element of
				12319	which is a bit specifying if the unsigned subtraction resulted in an
				12320	overflow.
				12321
				12322	Examples:
				12323	"""""""""
				12324
				12325	.. code-block:: llvm
				12326
				12327	%res = call {i32, i1} @llvm.usub.with.overflow.i32(i32 %a, i32 %b)
				12328	%sum = extractvalue {i32, i1} %res, 0
				12329	%obit = extractvalue {i32, i1} %res, 1
				12330	br i1 %obit, label %overflow, label %normal
				12331
				12332	'``llvm.smul.with.overflow.*``' Intrinsics
				12333	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12334
				12335	Syntax:
				12336	"""""""
				12337
				12338	This is an overloaded intrinsic. You can use ``llvm.smul.with.overflow``
				12339	on any integer bit width.
				12340
				12341	::
				12342
				12343	declare {i16, i1} @llvm.smul.with.overflow.i16(i16 %a, i16 %b)
				12344	declare {i32, i1} @llvm.smul.with.overflow.i32(i32 %a, i32 %b)
				12345	declare {i64, i1} @llvm.smul.with.overflow.i64(i64 %a, i64 %b)
				12346
				12347	Overview:
				12348	"""""""""
				12349
				12350	The '``llvm.smul.with.overflow``' family of intrinsic functions perform
				12351	a signed multiplication of the two arguments, and indicate whether an
				12352	overflow occurred during the signed multiplication.
				12353
				12354	Arguments:
				12355	""""""""""
				12356
				12357	The arguments (%a and %b) and the first element of the result structure
				12358	may be of integer types of any bit width, but they must have the same
				12359	bit width. The second element of the result structure must be of type
				12360	``i1``. ``%a`` and ``%b`` are the two values that will undergo signed
				12361	multiplication.
				12362
				12363	Semantics:
				12364	""""""""""
				12365
				12366	The '``llvm.smul.with.overflow``' family of intrinsic functions perform
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	12367	a signed multiplication of the two arguments. They return a structure ---
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12368	the first element of which is the multiplication, and the second element
				12369	of which is a bit specifying if the signed multiplication resulted in an
				12370	overflow.
				12371
				12372	Examples:
				12373	"""""""""
				12374
				12375	.. code-block:: llvm
				12376
				12377	%res = call {i32, i1} @llvm.smul.with.overflow.i32(i32 %a, i32 %b)
				12378	%sum = extractvalue {i32, i1} %res, 0
				12379	%obit = extractvalue {i32, i1} %res, 1
				12380	br i1 %obit, label %overflow, label %normal
				12381
				12382	'``llvm.umul.with.overflow.*``' Intrinsics
				12383	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12384
				12385	Syntax:
				12386	"""""""
				12387
				12388	This is an overloaded intrinsic. You can use ``llvm.umul.with.overflow``
				12389	on any integer bit width.
				12390
				12391	::
				12392
				12393	declare {i16, i1} @llvm.umul.with.overflow.i16(i16 %a, i16 %b)
				12394	declare {i32, i1} @llvm.umul.with.overflow.i32(i32 %a, i32 %b)
				12395	declare {i64, i1} @llvm.umul.with.overflow.i64(i64 %a, i64 %b)
				12396
				12397	Overview:
				12398	"""""""""
				12399
				12400	The '``llvm.umul.with.overflow``' family of intrinsic functions perform
				12401	a unsigned multiplication of the two arguments, and indicate whether an
				12402	overflow occurred during the unsigned multiplication.
				12403
				12404	Arguments:
				12405	""""""""""
				12406
				12407	The arguments (%a and %b) and the first element of the result structure
				12408	may be of integer types of any bit width, but they must have the same
				12409	bit width. The second element of the result structure must be of type
				12410	``i1``. ``%a`` and ``%b`` are the two values that will undergo unsigned
				12411	multiplication.
				12412
				12413	Semantics:
				12414	""""""""""
				12415
				12416	The '``llvm.umul.with.overflow``' family of intrinsic functions perform
Dmitri Gribenko	e813112	2013-01-19 20:34:20 +0000	[diff] [blame]	12417	an unsigned multiplication of the two arguments. They return a structure ---
				12418	the first element of which is the multiplication, and the second
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12419	element of which is a bit specifying if the unsigned multiplication
				12420	resulted in an overflow.
				12421
				12422	Examples:
				12423	"""""""""
				12424
				12425	.. code-block:: llvm
				12426
				12427	%res = call {i32, i1} @llvm.umul.with.overflow.i32(i32 %a, i32 %b)
				12428	%sum = extractvalue {i32, i1} %res, 0
				12429	%obit = extractvalue {i32, i1} %res, 1
				12430	br i1 %obit, label %overflow, label %normal
				12431
				12432	Specialised Arithmetic Intrinsics
				12433	---------------------------------
				12434
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12435	'``llvm.canonicalize.*``' Intrinsic
				12436	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12437
				12438	Syntax:
				12439	"""""""
				12440
				12441	::
				12442
				12443	declare float @llvm.canonicalize.f32(float %a)
				12444	declare double @llvm.canonicalize.f64(double %b)
				12445
				12446	Overview:
				12447	"""""""""
				12448
				12449	The '``llvm.canonicalize.*``' intrinsic returns the platform specific canonical
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12450	encoding of a floating-point number. This canonicalization is useful for
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12451	implementing certain numeric primitives such as frexp. The canonical encoding is
				12452	defined by IEEE-754-2008 to be:
				12453
				12454	::
				12455
				12456	2.1.8 canonical encoding: The preferred encoding of a floating-point
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	12457	representation in a format. Applied to declets, significands of finite
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12458	numbers, infinities, and NaNs, especially in decimal formats.
				12459
				12460	This operation can also be considered equivalent to the IEEE-754-2008
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	12461	conversion of a floating-point value to the same format. NaNs are handled
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12462	according to section 6.2.
				12463
				12464	Examples of non-canonical encodings:
				12465
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	12466	- x87 pseudo denormals, pseudo NaNs, pseudo Infinity, Unnormals. These are
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12467	converted to a canonical representation per hardware-specific protocol.
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12468	- Many normal decimal floating-point numbers have non-canonical alternative
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12469	encodings.
				12470	- Some machines, like GPUs or ARMv7 NEON, do not support subnormal values.
Sanjay Patel	cc33096	2016-02-24 23:44:19 +0000	[diff] [blame]	12471	These are treated as non-canonical encodings of zero and will be flushed to
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12472	a zero of the same sign by this operation.
				12473
				12474	Note that per IEEE-754-2008 6.2, systems that support signaling NaNs with
				12475	default exception handling must signal an invalid exception, and produce a
				12476	quiet NaN result.
				12477
				12478	This function should always be implementable as multiplication by 1.0, provided
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	12479	that the compiler does not constant fold the operation. Likewise, division by
				12480	1.0 and ``llvm.minnum(x, x)`` are possible implementations. Addition with
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12481	-0.0 is also sufficient provided that the rounding mode is not -Infinity.
				12482
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	12483	``@llvm.canonicalize`` must preserve the equality relation. That is:
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12484
				12485	- ``(@llvm.canonicalize(x) == x)`` is equivalent to ``(x == x)``
				12486	- ``(@llvm.canonicalize(x) == @llvm.canonicalize(y))`` is equivalent to
				12487	to ``(x == y)``
				12488
				12489	Additionally, the sign of zero must be conserved:
				12490	``@llvm.canonicalize(-0.0) = -0.0`` and ``@llvm.canonicalize(+0.0) = +0.0``
				12491
				12492	The payload bits of a NaN must be conserved, with two exceptions.
				12493	First, environments which use only a single canonical representation of NaN
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	12494	must perform said canonicalization. Second, SNaNs must be quieted per the
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12495	usual methods.
				12496
				12497	The canonicalization operation may be optimized away if:
				12498
Sean Silva	a119032	2015-08-06 22:56:48 +0000	[diff] [blame]	12499	- The input is known to be canonical. For example, it was produced by a
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12500	floating-point operation that is required by the standard to be canonical.
				12501	- The result is consumed only by (or fused with) other floating-point
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12502	operations. That is, the bits of the floating-point value are not examined.
Owen Anderson	1056a92	2015-07-11 07:01:27 +0000	[diff] [blame]	12503
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12504	'``llvm.fmuladd.*``' Intrinsic
				12505	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12506
				12507	Syntax:
				12508	"""""""
				12509
				12510	::
				12511
				12512	declare float @llvm.fmuladd.f32(float %a, float %b, float %c)
				12513	declare double @llvm.fmuladd.f64(double %a, double %b, double %c)
				12514
				12515	Overview:
				12516	"""""""""
				12517
				12518	The '``llvm.fmuladd.*``' intrinsic functions represent multiply-add
Lang Hames	045f439	2013-01-17 00:00:49 +0000	[diff] [blame]	12519	expressions that can be fused if the code generator determines that (a) the
				12520	target instruction set has support for a fused operation, and (b) that the
				12521	fused operation is more efficient than the equivalent, separate pair of mul
				12522	and add instructions.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12523
				12524	Arguments:
				12525	""""""""""
				12526
				12527	The '``llvm.fmuladd.*``' intrinsics each take three arguments: two
				12528	multiplicands, a and b, and an addend c.
				12529
				12530	Semantics:
				12531	""""""""""
				12532
				12533	The expression:
				12534
				12535	::
				12536
				12537	%0 = call float @llvm.fmuladd.f32(%a, %b, %c)
				12538
				12539	is equivalent to the expression a \* b + c, except that rounding will
				12540	not be performed between the multiplication and addition steps if the
				12541	code generator fuses the operations. Fusion is not guaranteed, even if
				12542	the target platform supports it. If a fused multiply-add is required the
Matt Arsenault	ee364ee	2014-01-31 00:09:00 +0000	[diff] [blame]	12543	corresponding llvm.fma.\* intrinsic function should be used
				12544	instead. This never sets errno, just as '``llvm.fma.*``'.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12545
				12546	Examples:
				12547	"""""""""
				12548
				12549	.. code-block:: llvm
				12550
Tim Northover	675a096	2014-06-13 14:24:23 +0000	[diff] [blame]	12551	%r2 = call float @llvm.fmuladd.f32(float %a, float %b, float %c) ; yields float:r2 = (a * b) + c
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12552
Amara Emerson	cf9daa3	2017-05-09 10:43:25 +0000	[diff] [blame]	12553
				12554	Experimental Vector Reduction Intrinsics
				12555	----------------------------------------
				12556
				12557	Horizontal reductions of vectors can be expressed using the following
				12558	intrinsics. Each one takes a vector operand as an input and applies its
				12559	respective operation across all elements of the vector, returning a single
				12560	scalar result of the same element type.
				12561
				12562
				12563	'``llvm.experimental.vector.reduce.add.*``' Intrinsic
				12564	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12565
				12566	Syntax:
				12567	"""""""
				12568
				12569	::
				12570
				12571	declare i32 @llvm.experimental.vector.reduce.add.i32.v4i32(<4 x i32> %a)
				12572	declare i64 @llvm.experimental.vector.reduce.add.i64.v2i64(<2 x i64> %a)
				12573
				12574	Overview:
				12575	"""""""""
				12576
				12577	The '``llvm.experimental.vector.reduce.add.*``' intrinsics do an integer ``ADD``
				12578	reduction of a vector, returning the result as a scalar. The return type matches
				12579	the element-type of the vector input.
				12580
				12581	Arguments:
				12582	""""""""""
				12583	The argument to this intrinsic must be a vector of integer values.
				12584
				12585	'``llvm.experimental.vector.reduce.fadd.*``' Intrinsic
				12586	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12587
				12588	Syntax:
				12589	"""""""
				12590
				12591	::
				12592
				12593	declare float @llvm.experimental.vector.reduce.fadd.f32.v4f32(float %acc, <4 x float> %a)
				12594	declare double @llvm.experimental.vector.reduce.fadd.f64.v2f64(double %acc, <2 x double> %a)
				12595
				12596	Overview:
				12597	"""""""""
				12598
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12599	The '``llvm.experimental.vector.reduce.fadd.*``' intrinsics do a floating-point
Amara Emerson	cf9daa3	2017-05-09 10:43:25 +0000	[diff] [blame]	12600	``ADD`` reduction of a vector, returning the result as a scalar. The return type
				12601	matches the element-type of the vector input.
				12602
				12603	If the intrinsic call has fast-math flags, then the reduction will not preserve
				12604	the associativity of an equivalent scalarized counterpart. If it does not have
				12605	fast-math flags, then the reduction will be ordered, implying that the
				12606	operation respects the associativity of a scalarized reduction.
				12607
				12608
				12609	Arguments:
				12610	""""""""""
				12611	The first argument to this intrinsic is a scalar accumulator value, which is
				12612	only used when there are no fast-math flags attached. This argument may be undef
				12613	when fast-math flags are used.
				12614
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12615	The second argument must be a vector of floating-point values.
Amara Emerson	cf9daa3	2017-05-09 10:43:25 +0000	[diff] [blame]	12616
				12617	Examples:
				12618	"""""""""
				12619
				12620	.. code-block:: llvm
				12621
				12622	%fast = call fast float @llvm.experimental.vector.reduce.fadd.f32.v4f32(float undef, <4 x float> %input) ; fast reduction
				12623	%ord = call float @llvm.experimental.vector.reduce.fadd.f32.v4f32(float %acc, <4 x float> %input) ; ordered reduction
				12624
				12625
				12626	'``llvm.experimental.vector.reduce.mul.*``' Intrinsic
				12627	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12628
				12629	Syntax:
				12630	"""""""
				12631
				12632	::
				12633
				12634	declare i32 @llvm.experimental.vector.reduce.mul.i32.v4i32(<4 x i32> %a)
				12635	declare i64 @llvm.experimental.vector.reduce.mul.i64.v2i64(<2 x i64> %a)
				12636
				12637	Overview:
				12638	"""""""""
				12639
				12640	The '``llvm.experimental.vector.reduce.mul.*``' intrinsics do an integer ``MUL``
				12641	reduction of a vector, returning the result as a scalar. The return type matches
				12642	the element-type of the vector input.
				12643
				12644	Arguments:
				12645	""""""""""
				12646	The argument to this intrinsic must be a vector of integer values.
				12647
				12648	'``llvm.experimental.vector.reduce.fmul.*``' Intrinsic
				12649	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12650
				12651	Syntax:
				12652	"""""""
				12653
				12654	::
				12655
				12656	declare float @llvm.experimental.vector.reduce.fmul.f32.v4f32(float %acc, <4 x float> %a)
				12657	declare double @llvm.experimental.vector.reduce.fmul.f64.v2f64(double %acc, <2 x double> %a)
				12658
				12659	Overview:
				12660	"""""""""
				12661
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12662	The '``llvm.experimental.vector.reduce.fmul.*``' intrinsics do a floating-point
Amara Emerson	cf9daa3	2017-05-09 10:43:25 +0000	[diff] [blame]	12663	``MUL`` reduction of a vector, returning the result as a scalar. The return type
				12664	matches the element-type of the vector input.
				12665
				12666	If the intrinsic call has fast-math flags, then the reduction will not preserve
				12667	the associativity of an equivalent scalarized counterpart. If it does not have
				12668	fast-math flags, then the reduction will be ordered, implying that the
				12669	operation respects the associativity of a scalarized reduction.
				12670
				12671
				12672	Arguments:
				12673	""""""""""
				12674	The first argument to this intrinsic is a scalar accumulator value, which is
				12675	only used when there are no fast-math flags attached. This argument may be undef
				12676	when fast-math flags are used.
				12677
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12678	The second argument must be a vector of floating-point values.
Amara Emerson	cf9daa3	2017-05-09 10:43:25 +0000	[diff] [blame]	12679
				12680	Examples:
				12681	"""""""""
				12682
				12683	.. code-block:: llvm
				12684
				12685	%fast = call fast float @llvm.experimental.vector.reduce.fmul.f32.v4f32(float undef, <4 x float> %input) ; fast reduction
				12686	%ord = call float @llvm.experimental.vector.reduce.fmul.f32.v4f32(float %acc, <4 x float> %input) ; ordered reduction
				12687
				12688	'``llvm.experimental.vector.reduce.and.*``' Intrinsic
				12689	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12690
				12691	Syntax:
				12692	"""""""
				12693
				12694	::
				12695
				12696	declare i32 @llvm.experimental.vector.reduce.and.i32.v4i32(<4 x i32> %a)
				12697
				12698	Overview:
				12699	"""""""""
				12700
				12701	The '``llvm.experimental.vector.reduce.and.*``' intrinsics do a bitwise ``AND``
				12702	reduction of a vector, returning the result as a scalar. The return type matches
				12703	the element-type of the vector input.
				12704
				12705	Arguments:
				12706	""""""""""
				12707	The argument to this intrinsic must be a vector of integer values.
				12708
				12709	'``llvm.experimental.vector.reduce.or.*``' Intrinsic
				12710	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12711
				12712	Syntax:
				12713	"""""""
				12714
				12715	::
				12716
				12717	declare i32 @llvm.experimental.vector.reduce.or.i32.v4i32(<4 x i32> %a)
				12718
				12719	Overview:
				12720	"""""""""
				12721
				12722	The '``llvm.experimental.vector.reduce.or.*``' intrinsics do a bitwise ``OR`` reduction
				12723	of a vector, returning the result as a scalar. The return type matches the
				12724	element-type of the vector input.
				12725
				12726	Arguments:
				12727	""""""""""
				12728	The argument to this intrinsic must be a vector of integer values.
				12729
				12730	'``llvm.experimental.vector.reduce.xor.*``' Intrinsic
				12731	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12732
				12733	Syntax:
				12734	"""""""
				12735
				12736	::
				12737
				12738	declare i32 @llvm.experimental.vector.reduce.xor.i32.v4i32(<4 x i32> %a)
				12739
				12740	Overview:
				12741	"""""""""
				12742
				12743	The '``llvm.experimental.vector.reduce.xor.*``' intrinsics do a bitwise ``XOR``
				12744	reduction of a vector, returning the result as a scalar. The return type matches
				12745	the element-type of the vector input.
				12746
				12747	Arguments:
				12748	""""""""""
				12749	The argument to this intrinsic must be a vector of integer values.
				12750
				12751	'``llvm.experimental.vector.reduce.smax.*``' Intrinsic
				12752	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12753
				12754	Syntax:
				12755	"""""""
				12756
				12757	::
				12758
				12759	declare i32 @llvm.experimental.vector.reduce.smax.i32.v4i32(<4 x i32> %a)
				12760
				12761	Overview:
				12762	"""""""""
				12763
				12764	The '``llvm.experimental.vector.reduce.smax.*``' intrinsics do a signed integer
				12765	``MAX`` reduction of a vector, returning the result as a scalar. The return type
				12766	matches the element-type of the vector input.
				12767
				12768	Arguments:
				12769	""""""""""
				12770	The argument to this intrinsic must be a vector of integer values.
				12771
				12772	'``llvm.experimental.vector.reduce.smin.*``' Intrinsic
				12773	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12774
				12775	Syntax:
				12776	"""""""
				12777
				12778	::
				12779
				12780	declare i32 @llvm.experimental.vector.reduce.smin.i32.v4i32(<4 x i32> %a)
				12781
				12782	Overview:
				12783	"""""""""
				12784
				12785	The '``llvm.experimental.vector.reduce.smin.*``' intrinsics do a signed integer
				12786	``MIN`` reduction of a vector, returning the result as a scalar. The return type
				12787	matches the element-type of the vector input.
				12788
				12789	Arguments:
				12790	""""""""""
				12791	The argument to this intrinsic must be a vector of integer values.
				12792
				12793	'``llvm.experimental.vector.reduce.umax.*``' Intrinsic
				12794	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12795
				12796	Syntax:
				12797	"""""""
				12798
				12799	::
				12800
				12801	declare i32 @llvm.experimental.vector.reduce.umax.i32.v4i32(<4 x i32> %a)
				12802
				12803	Overview:
				12804	"""""""""
				12805
				12806	The '``llvm.experimental.vector.reduce.umax.*``' intrinsics do an unsigned
				12807	integer ``MAX`` reduction of a vector, returning the result as a scalar. The
				12808	return type matches the element-type of the vector input.
				12809
				12810	Arguments:
				12811	""""""""""
				12812	The argument to this intrinsic must be a vector of integer values.
				12813
				12814	'``llvm.experimental.vector.reduce.umin.*``' Intrinsic
				12815	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12816
				12817	Syntax:
				12818	"""""""
				12819
				12820	::
				12821
				12822	declare i32 @llvm.experimental.vector.reduce.umin.i32.v4i32(<4 x i32> %a)
				12823
				12824	Overview:
				12825	"""""""""
				12826
				12827	The '``llvm.experimental.vector.reduce.umin.*``' intrinsics do an unsigned
				12828	integer ``MIN`` reduction of a vector, returning the result as a scalar. The
				12829	return type matches the element-type of the vector input.
				12830
				12831	Arguments:
				12832	""""""""""
				12833	The argument to this intrinsic must be a vector of integer values.
				12834
				12835	'``llvm.experimental.vector.reduce.fmax.*``' Intrinsic
				12836	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12837
				12838	Syntax:
				12839	"""""""
				12840
				12841	::
				12842
				12843	declare float @llvm.experimental.vector.reduce.fmax.f32.v4f32(<4 x float> %a)
				12844	declare double @llvm.experimental.vector.reduce.fmax.f64.v2f64(<2 x double> %a)
				12845
				12846	Overview:
				12847	"""""""""
				12848
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12849	The '``llvm.experimental.vector.reduce.fmax.*``' intrinsics do a floating-point
Amara Emerson	cf9daa3	2017-05-09 10:43:25 +0000	[diff] [blame]	12850	``MAX`` reduction of a vector, returning the result as a scalar. The return type
				12851	matches the element-type of the vector input.
				12852
				12853	If the intrinsic call has the ``nnan`` fast-math flag then the operation can
				12854	assume that NaNs are not present in the input vector.
				12855
				12856	Arguments:
				12857	""""""""""
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12858	The argument to this intrinsic must be a vector of floating-point values.
Amara Emerson	cf9daa3	2017-05-09 10:43:25 +0000	[diff] [blame]	12859
				12860	'``llvm.experimental.vector.reduce.fmin.*``' Intrinsic
				12861	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12862
				12863	Syntax:
				12864	"""""""
				12865
				12866	::
				12867
				12868	declare float @llvm.experimental.vector.reduce.fmin.f32.v4f32(<4 x float> %a)
				12869	declare double @llvm.experimental.vector.reduce.fmin.f64.v2f64(<2 x double> %a)
				12870
				12871	Overview:
				12872	"""""""""
				12873
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12874	The '``llvm.experimental.vector.reduce.fmin.*``' intrinsics do a floating-point
Amara Emerson	cf9daa3	2017-05-09 10:43:25 +0000	[diff] [blame]	12875	``MIN`` reduction of a vector, returning the result as a scalar. The return type
				12876	matches the element-type of the vector input.
				12877
				12878	If the intrinsic call has the ``nnan`` fast-math flag then the operation can
				12879	assume that NaNs are not present in the input vector.
				12880
				12881	Arguments:
				12882	""""""""""
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12883	The argument to this intrinsic must be a vector of floating-point values.
Amara Emerson	cf9daa3	2017-05-09 10:43:25 +0000	[diff] [blame]	12884
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12885	Half Precision Floating-Point Intrinsics
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12886	----------------------------------------
				12887
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12888	For most target platforms, half precision floating-point is a
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12889	storage-only format. This means that it is a dense encoding (in memory)
				12890	but does not support computation in the format.
				12891
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12892	This means that code must first load the half-precision floating-point
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12893	value as an i16, then convert it to float with
				12894	:ref:`llvm.convert.from.fp16 <int_convert_from_fp16>`. Computation can
				12895	then be performed on the float value (including extending to double
				12896	etc). To store the value back to memory, it is first converted to float
				12897	if needed, then converted to i16 with
				12898	:ref:`llvm.convert.to.fp16 <int_convert_to_fp16>`, then storing as an
				12899	i16 value.
				12900
				12901	.. _int_convert_to_fp16:
				12902
				12903	'``llvm.convert.to.fp16``' Intrinsic
				12904	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12905
				12906	Syntax:
				12907	"""""""
				12908
				12909	::
				12910
Tim Northover	fd7e424	2014-07-17 10:51:23 +0000	[diff] [blame]	12911	declare i16 @llvm.convert.to.fp16.f32(float %a)
				12912	declare i16 @llvm.convert.to.fp16.f64(double %a)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12913
				12914	Overview:
				12915	"""""""""
				12916
Tim Northover	fd7e424	2014-07-17 10:51:23 +0000	[diff] [blame]	12917	The '``llvm.convert.to.fp16``' intrinsic function performs a conversion from a
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12918	conventional floating-point type to half precision floating-point format.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12919
				12920	Arguments:
				12921	""""""""""
				12922
				12923	The intrinsic function contains single argument - the value to be
				12924	converted.
				12925
				12926	Semantics:
				12927	""""""""""
				12928
Tim Northover	fd7e424	2014-07-17 10:51:23 +0000	[diff] [blame]	12929	The '``llvm.convert.to.fp16``' intrinsic function performs a conversion from a
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12930	conventional floating-point format to half precision floating-point format. The
Tim Northover	fd7e424	2014-07-17 10:51:23 +0000	[diff] [blame]	12931	return value is an ``i16`` which contains the converted number.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12932
				12933	Examples:
				12934	"""""""""
				12935
				12936	.. code-block:: llvm
				12937
Tim Northover	fd7e424	2014-07-17 10:51:23 +0000	[diff] [blame]	12938	%res = call i16 @llvm.convert.to.fp16.f32(float %a)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12939	store i16 %res, i16* @x, align 2
				12940
				12941	.. _int_convert_from_fp16:
				12942
				12943	'``llvm.convert.from.fp16``' Intrinsic
				12944	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				12945
				12946	Syntax:
				12947	"""""""
				12948
				12949	::
				12950
Tim Northover	fd7e424	2014-07-17 10:51:23 +0000	[diff] [blame]	12951	declare float @llvm.convert.from.fp16.f32(i16 %a)
				12952	declare double @llvm.convert.from.fp16.f64(i16 %a)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12953
				12954	Overview:
				12955	"""""""""
				12956
				12957	The '``llvm.convert.from.fp16``' intrinsic function performs a
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12958	conversion from half precision floating-point format to single precision
				12959	floating-point format.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12960
				12961	Arguments:
				12962	""""""""""
				12963
				12964	The intrinsic function contains single argument - the value to be
				12965	converted.
				12966
				12967	Semantics:
				12968	""""""""""
				12969
				12970	The '``llvm.convert.from.fp16``' intrinsic function performs a
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	12971	conversion from half single precision floating-point format to single
				12972	precision floating-point format. The input half-float value is
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12973	represented by an ``i16`` value.
				12974
				12975	Examples:
				12976	"""""""""
				12977
				12978	.. code-block:: llvm
				12979
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	12980	%a = load i16, i16* @x, align 2
Matt Arsenault	3e3ddda	2014-07-10 03:22:16 +0000	[diff] [blame]	12981	%res = call float @llvm.convert.from.fp16(i16 %a)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12982
Duncan P. N. Exon Smith	e274180	2015-03-03 17:24:31 +0000	[diff] [blame]	12983	.. _dbg_intrinsics:
				12984
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12985	Debugger Intrinsics
				12986	-------------------
				12987
				12988	The LLVM debugger intrinsics (which all start with ``llvm.dbg.``
				12989	prefix), are described in the `LLVM Source Level
Hans Wennborg	6519562	2017-09-28 15:16:37 +0000	[diff] [blame]	12990	Debugging <SourceLevelDebugging.html#format-common-intrinsics>`_
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12991	document.
				12992
				12993	Exception Handling Intrinsics
				12994	-----------------------------
				12995
				12996	The LLVM exception handling intrinsics (which all start with
				12997	``llvm.eh.`` prefix), are described in the `LLVM Exception
Hans Wennborg	6519562	2017-09-28 15:16:37 +0000	[diff] [blame]	12998	Handling <ExceptionHandling.html#format-common-intrinsics>`_ document.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	12999
				13000	.. _int_trampoline:
				13001
				13002	Trampoline Intrinsics
				13003	---------------------
				13004
				13005	These intrinsics make it possible to excise one parameter, marked with
				13006	the :ref:`nest <nest>` attribute, from a function. The result is a
				13007	callable function pointer lacking the nest parameter - the caller does
				13008	not need to provide a value for it. Instead, the value to use is stored
				13009	in advance in a "trampoline", a block of memory usually allocated on the
				13010	stack, which also contains code to splice the nest value into the
				13011	argument list. This is used to implement the GCC nested function address
				13012	extension.
				13013
				13014	For example, if the function is ``i32 f(i8* nest %c, i32 %x, i32 %y)``
				13015	then the resulting function pointer has signature ``i32 (i32, i32)*``.
				13016	It can be created as follows:
				13017
				13018	.. code-block:: llvm
				13019
				13020	%tramp = alloca [10 x i8], align 4 ; size and alignment only correct for X86
David Blaikie	16a97eb	2015-03-04 22:02:58 +0000	[diff] [blame]	13021	%tramp1 = getelementptr [10 x i8], [10 x i8]* %tramp, i32 0, i32 0
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13022	call i8* @llvm.init.trampoline(i8* %tramp1, i8* bitcast (i32 (i8, i32, i32) @f to i8), i8 %nval)
				13023	%p = call i8* @llvm.adjust.trampoline(i8* %tramp1)
				13024	%fp = bitcast i8* %p to i32 (i32, i32)*
				13025
				13026	The call ``%val = call i32 %fp(i32 %x, i32 %y)`` is then equivalent to
				13027	``%val = call i32 %f(i8* %nval, i32 %x, i32 %y)``.
				13028
				13029	.. _int_it:
				13030
				13031	'``llvm.init.trampoline``' Intrinsic
				13032	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13033
				13034	Syntax:
				13035	"""""""
				13036
				13037	::
				13038
				13039	declare void @llvm.init.trampoline(i8* <tramp>, i8* <func>, i8* <nval>)
				13040
				13041	Overview:
				13042	"""""""""
				13043
				13044	This fills the memory pointed to by ``tramp`` with executable code,
				13045	turning it into a trampoline.
				13046
				13047	Arguments:
				13048	""""""""""
				13049
				13050	The ``llvm.init.trampoline`` intrinsic takes three arguments, all
				13051	pointers. The ``tramp`` argument must point to a sufficiently large and
				13052	sufficiently aligned block of memory; this memory is written to by the
				13053	intrinsic. Note that the size and the alignment are target-specific -
				13054	LLVM currently provides no portable way of determining them, so a
				13055	front-end that generates this intrinsic needs to have some
				13056	target-specific knowledge. The ``func`` argument must hold a function
				13057	bitcast to an ``i8*``.
				13058
				13059	Semantics:
				13060	""""""""""
				13061
				13062	The block of memory pointed to by ``tramp`` is filled with target
				13063	dependent code, turning it into a function. Then ``tramp`` needs to be
				13064	passed to :ref:`llvm.adjust.trampoline <int_at>` to get a pointer which can
				13065	be :ref:`bitcast (to a new function) and called <int_trampoline>`. The new
				13066	function's signature is the same as that of ``func`` with any arguments
				13067	marked with the ``nest`` attribute removed. At most one such ``nest``
				13068	argument is allowed, and it must be of pointer type. Calling the new
				13069	function is equivalent to calling ``func`` with the same argument list,
				13070	but with ``nval`` used for the missing ``nest`` argument. If, after
				13071	calling ``llvm.init.trampoline``, the memory pointed to by ``tramp`` is
				13072	modified, then the effect of any later call to the returned function
				13073	pointer is undefined.
				13074
				13075	.. _int_at:
				13076
				13077	'``llvm.adjust.trampoline``' Intrinsic
				13078	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13079
				13080	Syntax:
				13081	"""""""
				13082
				13083	::
				13084
				13085	declare i8* @llvm.adjust.trampoline(i8* <tramp>)
				13086
				13087	Overview:
				13088	"""""""""
				13089
				13090	This performs any required machine-specific adjustment to the address of
				13091	a trampoline (passed as ``tramp``).
				13092
				13093	Arguments:
				13094	""""""""""
				13095
				13096	``tramp`` must point to a block of memory which already has trampoline
				13097	code filled in by a previous call to
				13098	:ref:`llvm.init.trampoline <int_it>`.
				13099
				13100	Semantics:
				13101	""""""""""
				13102
				13103	On some architectures the address of the code to be executed needs to be
Sanjay Patel	69bf48e	2014-07-04 19:40:43 +0000	[diff] [blame]	13104	different than the address where the trampoline is actually stored. This
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13105	intrinsic returns the executable address corresponding to ``tramp``
				13106	after performing the required machine specific adjustments. The pointer
				13107	returned can then be :ref:`bitcast and executed <int_trampoline>`.
				13108
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13109	.. _int_mload_mstore:
				13110
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13111	Masked Vector Load and Store Intrinsics
				13112	---------------------------------------
				13113
				13114	LLVM provides intrinsics for predicated vector load and store operations. The predicate is specified by a mask operand, which holds one bit per vector element, switching the associated vector lane on or off. The memory addresses corresponding to the "off" lanes are not accessed. When all bits of the mask are on, the intrinsic is identical to a regular vector load or store. When all bits are off, no memory is accessed.
				13115
				13116	.. _int_mload:
				13117
				13118	'``llvm.masked.load.*``' Intrinsics
				13119	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13120
				13121	Syntax:
				13122	"""""""
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13123	This is an overloaded intrinsic. The loaded data is a vector of any integer, floating-point or pointer data type.
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13124
				13125	::
				13126
Artur Pilipenko	7ad95ec	2016-06-28 18:27:25 +0000	[diff] [blame]	13127	declare <16 x float> @llvm.masked.load.v16f32.p0v16f32 (<16 x float>* <ptr>, i32 <alignment>, <16 x i1> <mask>, <16 x float> <passthru>)
				13128	declare <2 x double> @llvm.masked.load.v2f64.p0v2f64 (<2 x double>* <ptr>, i32 <alignment>, <2 x i1> <mask>, <2 x double> <passthru>)
Elena Demikhovsky	1ca72e1	2015-11-19 07:17:16 +0000	[diff] [blame]	13129	;; The data is a vector of pointers to double
Artur Pilipenko	7ad95ec	2016-06-28 18:27:25 +0000	[diff] [blame]	13130	declare <8 x double> @llvm.masked.load.v8p0f64.p0v8p0f64 (<8 x double>* <ptr>, i32 <alignment>, <8 x i1> <mask>, <8 x double*> <passthru>)
Elena Demikhovsky	1ca72e1	2015-11-19 07:17:16 +0000	[diff] [blame]	13131	;; The data is a vector of function pointers
Artur Pilipenko	7ad95ec	2016-06-28 18:27:25 +0000	[diff] [blame]	13132	declare <8 x i32 ()> @llvm.masked.load.v8p0f_i32f.p0v8p0f_i32f (<8 x i32 ()>* <ptr>, i32 <alignment>, <8 x i1> <mask>, <8 x i32 ()*> <passthru>)
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13133
				13134	Overview:
				13135	"""""""""
				13136
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13137	Reads a vector from memory according to the provided mask. The mask holds a bit for each vector lane, and is used to prevent memory accesses to the masked-off lanes. The masked-off lanes in the result vector are taken from the corresponding lanes of the '``passthru``' operand.
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13138
				13139
				13140	Arguments:
				13141	""""""""""
				13142
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13143	The first operand is the base pointer for the load. The second operand is the alignment of the source location. It must be a constant integer value. The third operand, mask, is a vector of boolean values with the same number of elements as the return type. The fourth is a pass-through value that is used to fill the masked-off lanes of the result. The return type, underlying type of the base pointer and the type of the '``passthru``' operand are the same vector types.
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13144
				13145
				13146	Semantics:
				13147	""""""""""
				13148
				13149	The '``llvm.masked.load``' intrinsic is designed for conditional reading of selected vector elements in a single IR operation. It is useful for targets that support vector masked loads and allows vectorizing predicated basic blocks on these targets. Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar load operations.
				13150	The result of this operation is equivalent to a regular vector load instruction followed by a 'select' between the loaded and the passthru values, predicated on the same mask. However, using this intrinsic prevents exceptions on memory access to masked-off lanes.
				13151
				13152
				13153	::
				13154
Artur Pilipenko	7ad95ec	2016-06-28 18:27:25 +0000	[diff] [blame]	13155	%res = call <16 x float> @llvm.masked.load.v16f32.p0v16f32 (<16 x float>* %ptr, i32 4, <16 x i1>%mask, <16 x float> %passthru)
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	13156
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13157	;; The result of the two following instructions is identical aside from potential memory access exception
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	13158	%loadlal = load <16 x float>, <16 x float>* %ptr, align 4
Elena Demikhovsky	e86c8c8	2014-12-29 09:47:51 +0000	[diff] [blame]	13159	%res = select <16 x i1> %mask, <16 x float> %loadlal, <16 x float> %passthru
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13160
				13161	.. _int_mstore:
				13162
				13163	'``llvm.masked.store.*``' Intrinsics
				13164	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13165
				13166	Syntax:
				13167	"""""""
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13168	This is an overloaded intrinsic. The data stored in memory is a vector of any integer, floating-point or pointer data type.
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13169
				13170	::
				13171
Artur Pilipenko	7ad95ec	2016-06-28 18:27:25 +0000	[diff] [blame]	13172	declare void @llvm.masked.store.v8i32.p0v8i32 (<8 x i32> <value>, <8 x i32>* <ptr>, i32 <alignment>, <8 x i1> <mask>)
				13173	declare void @llvm.masked.store.v16f32.p0v16f32 (<16 x float> <value>, <16 x float>* <ptr>, i32 <alignment>, <16 x i1> <mask>)
Elena Demikhovsky	1ca72e1	2015-11-19 07:17:16 +0000	[diff] [blame]	13174	;; The data is a vector of pointers to double
Artur Pilipenko	7ad95ec	2016-06-28 18:27:25 +0000	[diff] [blame]	13175	declare void @llvm.masked.store.v8p0f64.p0v8p0f64 (<8 x double> <value>, <8 x double>* <ptr>, i32 <alignment>, <8 x i1> <mask>)
Elena Demikhovsky	1ca72e1	2015-11-19 07:17:16 +0000	[diff] [blame]	13176	;; The data is a vector of function pointers
Artur Pilipenko	7ad95ec	2016-06-28 18:27:25 +0000	[diff] [blame]	13177	declare void @llvm.masked.store.v4p0f_i32f.p0v4p0f_i32f (<4 x i32 ()> <value>, <4 x i32 ()>* <ptr>, i32 <alignment>, <4 x i1> <mask>)
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13178
				13179	Overview:
				13180	"""""""""
				13181
				13182	Writes a vector to memory according to the provided mask. The mask holds a bit for each vector lane, and is used to prevent memory accesses to the masked-off lanes.
				13183
				13184	Arguments:
				13185	""""""""""
				13186
				13187	The first operand is the vector value to be written to memory. The second operand is the base pointer for the store, it has the same underlying type as the value operand. The third operand is the alignment of the destination location. The fourth operand, mask, is a vector of boolean values. The types of the mask and the value operand must have the same number of vector elements.
				13188
				13189
				13190	Semantics:
				13191	""""""""""
				13192
				13193	The '``llvm.masked.store``' intrinsics is designed for conditional writing of selected vector elements in a single IR operation. It is useful for targets that support vector masked store and allows vectorizing predicated basic blocks on these targets. Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar store operations.
				13194	The result of this operation is equivalent to a load-modify-store sequence. However, using this intrinsic prevents exceptions and data races on memory access to masked-off lanes.
				13195
				13196	::
				13197
Artur Pilipenko	7ad95ec	2016-06-28 18:27:25 +0000	[diff] [blame]	13198	call void @llvm.masked.store.v16f32.p0v16f32(<16 x float> %value, <16 x float>* %ptr, i32 4, <16 x i1> %mask)
Mehdi Amini	4a121fa	2015-03-14 22:04:06 +0000	[diff] [blame]	13199
Elena Demikhovsky	e86c8c8	2014-12-29 09:47:51 +0000	[diff] [blame]	13200	;; The result of the following instructions is identical aside from potential data races and memory access exceptions
David Blaikie	c7aabbb	2015-03-04 22:06:14 +0000	[diff] [blame]	13201	%oldval = load <16 x float>, <16 x float>* %ptr, align 4
Elena Demikhovsky	3d13f1c	2014-12-25 09:29:13 +0000	[diff] [blame]	13202	%res = select <16 x i1> %mask, <16 x float> %value, <16 x float> %oldval
				13203	store <16 x float> %res, <16 x float>* %ptr, align 4
				13204
				13205
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13206	Masked Vector Gather and Scatter Intrinsics
				13207	-------------------------------------------
				13208
				13209	LLVM provides intrinsics for vector gather and scatter operations. They are similar to :ref:`Masked Vector Load and Store <int_mload_mstore>`, except they are designed for arbitrary memory accesses, rather than sequential memory accesses. Gather and scatter also employ a mask operand, which holds one bit per vector element, switching the associated vector lane on or off. The memory addresses corresponding to the "off" lanes are not accessed. When all bits are off, no memory is accessed.
				13210
				13211	.. _int_mgather:
				13212
				13213	'``llvm.masked.gather.*``' Intrinsics
				13214	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13215
				13216	Syntax:
				13217	"""""""
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13218	This is an overloaded intrinsic. The loaded data are multiple scalar values of any integer, floating-point or pointer data type gathered together into one vector.
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13219
				13220	::
				13221
Elad Cohen	ef5798a	2017-05-03 12:28:54 +0000	[diff] [blame]	13222	declare <16 x float> @llvm.masked.gather.v16f32.v16p0f32 (<16 x float*> <ptrs>, i32 <alignment>, <16 x i1> <mask>, <16 x float> <passthru>)
				13223	declare <2 x double> @llvm.masked.gather.v2f64.v2p1f64 (<2 x double addrspace(1)*> <ptrs>, i32 <alignment>, <2 x i1> <mask>, <2 x double> <passthru>)
				13224	declare <8 x float> @llvm.masked.gather.v8p0f32.v8p0p0f32 (<8 x float> <ptrs>, i32 <alignment>, <8 x i1> <mask>, <8 x float> <passthru>)
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13225
				13226	Overview:
				13227	"""""""""
				13228
				13229	Reads scalar values from arbitrary memory locations and gathers them into one vector. The memory locations are provided in the vector of pointers '``ptrs``'. The memory is accessed according to the provided mask. The mask holds a bit for each vector lane, and is used to prevent memory accesses to the masked-off lanes. The masked-off lanes in the result vector are taken from the corresponding lanes of the '``passthru``' operand.
				13230
				13231
				13232	Arguments:
				13233	""""""""""
				13234
				13235	The first operand is a vector of pointers which holds all memory addresses to read. The second operand is an alignment of the source addresses. It must be a constant integer value. The third operand, mask, is a vector of boolean values with the same number of elements as the return type. The fourth is a pass-through value that is used to fill the masked-off lanes of the result. The return type, underlying type of the vector of pointers and the type of the '``passthru``' operand are the same vector types.
				13236
				13237
				13238	Semantics:
				13239	""""""""""
				13240
				13241	The '``llvm.masked.gather``' intrinsic is designed for conditional reading of multiple scalar values from arbitrary memory locations in a single IR operation. It is useful for targets that support vector masked gathers and allows vectorizing basic blocks with data and control divergence. Other targets may support this intrinsic differently, for example by lowering it into a sequence of scalar load operations.
				13242	The semantics of this operation are equivalent to a sequence of conditional scalar loads with subsequent gathering all loaded values into a single vector. The mask restricts memory access to certain lanes and facilitates vectorization of predicated basic blocks.
				13243
				13244
				13245	::
				13246
Elad Cohen	ef5798a	2017-05-03 12:28:54 +0000	[diff] [blame]	13247	%res = call <4 x double> @llvm.masked.gather.v4f64.v4p0f64 (<4 x double*> %ptrs, i32 8, <4 x i1> <i1 true, i1 true, i1 true, i1 true>, <4 x double> undef)
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13248
				13249	;; The gather with all-true mask is equivalent to the following instruction sequence
				13250	%ptr0 = extractelement <4 x double*> %ptrs, i32 0
				13251	%ptr1 = extractelement <4 x double*> %ptrs, i32 1
				13252	%ptr2 = extractelement <4 x double*> %ptrs, i32 2
				13253	%ptr3 = extractelement <4 x double*> %ptrs, i32 3
				13254
				13255	%val0 = load double, double* %ptr0, align 8
				13256	%val1 = load double, double* %ptr1, align 8
				13257	%val2 = load double, double* %ptr2, align 8
				13258	%val3 = load double, double* %ptr3, align 8
				13259
				13260	%vec0 = insertelement <4 x double>undef, %val0, 0
				13261	%vec01 = insertelement <4 x double>%vec0, %val1, 1
				13262	%vec012 = insertelement <4 x double>%vec01, %val2, 2
				13263	%vec0123 = insertelement <4 x double>%vec012, %val3, 3
				13264
				13265	.. _int_mscatter:
				13266
				13267	'``llvm.masked.scatter.*``' Intrinsics
				13268	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13269
				13270	Syntax:
				13271	"""""""
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13272	This is an overloaded intrinsic. The data stored in memory is a vector of any integer, floating-point or pointer data type. Each vector element is stored in an arbitrary memory address. Scatter with overlapping addresses is guaranteed to be ordered from least-significant to most-significant element.
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13273
				13274	::
				13275
Elad Cohen	ef5798a	2017-05-03 12:28:54 +0000	[diff] [blame]	13276	declare void @llvm.masked.scatter.v8i32.v8p0i32 (<8 x i32> <value>, <8 x i32*> <ptrs>, i32 <alignment>, <8 x i1> <mask>)
				13277	declare void @llvm.masked.scatter.v16f32.v16p1f32 (<16 x float> <value>, <16 x float addrspace(1)*> <ptrs>, i32 <alignment>, <16 x i1> <mask>)
				13278	declare void @llvm.masked.scatter.v4p0f64.v4p0p0f64 (<4 x double> <value>, <4 x double*> <ptrs>, i32 <alignment>, <4 x i1> <mask>)
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13279
				13280	Overview:
				13281	"""""""""
				13282
				13283	Writes each element from the value vector to the corresponding memory address. The memory addresses are represented as a vector of pointers. Writing is done according to the provided mask. The mask holds a bit for each vector lane, and is used to prevent memory accesses to the masked-off lanes.
				13284
				13285	Arguments:
				13286	""""""""""
				13287
				13288	The first operand is a vector value to be written to memory. The second operand is a vector of pointers, pointing to where the value elements should be stored. It has the same underlying type as the value operand. The third operand is an alignment of the destination addresses. The fourth operand, mask, is a vector of boolean values. The types of the mask and the value operand must have the same number of vector elements.
				13289
				13290
				13291	Semantics:
				13292	""""""""""
				13293
Bruce Mitchener	e9ffb45	2015-09-12 01:17:08 +0000	[diff] [blame]	13294	The '``llvm.masked.scatter``' intrinsics is designed for writing selected vector elements to arbitrary memory addresses in a single IR operation. The operation may be conditional, when not all bits in the mask are switched on. It is useful for targets that support vector masked scatter and allows vectorizing basic blocks with data and control divergence. Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar store operations.
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13295
				13296	::
				13297
Sylvestre Ledru	84666a1	2016-02-14 20:16:22 +0000	[diff] [blame]	13298	;; This instruction unconditionally stores data vector in multiple addresses
Elad Cohen	ef5798a	2017-05-03 12:28:54 +0000	[diff] [blame]	13299	call @llvm.masked.scatter.v8i32.v8p0i32 (<8 x i32> %value, <8 x i32*> %ptrs, i32 4, <8 x i1> <true, true, .. true>)
Elena Demikhovsky	82cdd65	2015-05-07 12:25:11 +0000	[diff] [blame]	13300
				13301	;; It is equivalent to a list of scalar stores
				13302	%val0 = extractelement <8 x i32> %value, i32 0
				13303	%val1 = extractelement <8 x i32> %value, i32 1
				13304	..
				13305	%val7 = extractelement <8 x i32> %value, i32 7
				13306	%ptr0 = extractelement <8 x i32*> %ptrs, i32 0
				13307	%ptr1 = extractelement <8 x i32*> %ptrs, i32 1
				13308	..
				13309	%ptr7 = extractelement <8 x i32*> %ptrs, i32 7
				13310	;; Note: the order of the following stores is important when they overlap:
				13311	store i32 %val0, i32* %ptr0, align 4
				13312	store i32 %val1, i32* %ptr1, align 4
				13313	..
				13314	store i32 %val7, i32* %ptr7, align 4
				13315
				13316
Elena Demikhovsky	0ef2ce3	2018-06-06 09:11:46 +0000	[diff] [blame]	13317	Masked Vector Expanding Load and Compressing Store Intrinsics
				13318	-------------------------------------------------------------
				13319
				13320	LLVM provides intrinsics for expanding load and compressing store operations. Data selected from a vector according to a mask is stored in consecutive memory addresses (compressed store), and vice-versa (expanding load). These operations effective map to "if (cond.i) a[j++] = v.i" and "if (cond.i) v.i = a[j++]" patterns, respectively. Note that when the mask starts with '1' bits followed by '0' bits, these operations are identical to :ref:`llvm.masked.store <int_mstore>` and :ref:`llvm.masked.load <int_mload>`.
				13321
				13322	.. _int_expandload:
				13323
				13324	'``llvm.masked.expandload.*``' Intrinsics
				13325	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13326
				13327	Syntax:
				13328	"""""""
				13329	This is an overloaded intrinsic. Several values of integer, floating point or pointer data type are loaded from consecutive memory addresses and stored into the elements of a vector according to the mask.
				13330
				13331	::
				13332
				13333	declare <16 x float> @llvm.masked.expandload.v16f32 (float* <ptr>, <16 x i1> <mask>, <16 x float> <passthru>)
				13334	declare <2 x i64> @llvm.masked.expandload.v2i64 (i64* <ptr>, <2 x i1> <mask>, <2 x i64> <passthru>)
				13335
				13336	Overview:
				13337	"""""""""
				13338
				13339	Reads a number of scalar values sequentially from memory location provided in '``ptr``' and spreads them in a vector. The '``mask``' holds a bit for each vector lane. The number of elements read from memory is equal to the number of '1' bits in the mask. The loaded elements are positioned in the destination vector according to the sequence of '1' and '0' bits in the mask. E.g., if the mask vector is '10010001', "explandload" reads 3 values from memory addresses ptr, ptr+1, ptr+2 and places them in lanes 0, 3 and 7 accordingly. The masked-off lanes are filled by elements from the corresponding lanes of the '``passthru``' operand.
				13340
				13341
				13342	Arguments:
				13343	""""""""""
				13344
				13345	The first operand is the base pointer for the load. It has the same underlying type as the element of the returned vector. The second operand, mask, is a vector of boolean values with the same number of elements as the return type. The third is a pass-through value that is used to fill the masked-off lanes of the result. The return type and the type of the '``passthru``' operand have the same vector type.
				13346
				13347	Semantics:
				13348	""""""""""
				13349
				13350	The '``llvm.masked.expandload``' intrinsic is designed for reading multiple scalar values from adjacent memory addresses into possibly non-adjacent vector lanes. It is useful for targets that support vector expanding loads and allows vectorizing loop with cross-iteration dependency like in the following example:
				13351
				13352	.. code-block:: c
				13353
				13354	// In this loop we load from B and spread the elements into array A.
				13355	double A, B; int C;
				13356	for (int i = 0; i < size; ++i) {
				13357	if (C[i] != 0)
				13358	A[i] = B[j++];
				13359	}
				13360
				13361
				13362	.. code-block:: llvm
				13363
				13364	; Load several elements from array B and expand them in a vector.
				13365	; The number of loaded elements is equal to the number of '1' elements in the Mask.
				13366	%Tmp = call <8 x double> @llvm.masked.expandload.v8f64(double* %Bptr, <8 x i1> %Mask, <8 x double> undef)
				13367	; Store the result in A
				13368	call void @llvm.masked.store.v8f64.p0v8f64(<8 x double> %Tmp, <8 x double>* %Aptr, i32 8, <8 x i1> %Mask)
				13369
				13370	; %Bptr should be increased on each iteration according to the number of '1' elements in the Mask.
				13371	%MaskI = bitcast <8 x i1> %Mask to i8
				13372	%MaskIPopcnt = call i8 @llvm.ctpop.i8(i8 %MaskI)
				13373	%MaskI64 = zext i8 %MaskIPopcnt to i64
				13374	%BNextInd = add i64 %BInd, %MaskI64
				13375
				13376
				13377	Other targets may support this intrinsic differently, for example, by lowering it into a sequence of conditional scalar load operations and shuffles.
				13378	If all mask elements are '1', the intrinsic behavior is equivalent to the regular unmasked vector load.
				13379
				13380	.. _int_compressstore:
				13381
				13382	'``llvm.masked.compressstore.*``' Intrinsics
				13383	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13384
				13385	Syntax:
				13386	"""""""
				13387	This is an overloaded intrinsic. A number of scalar values of integer, floating point or pointer data type are collected from an input vector and stored into adjacent memory addresses. A mask defines which elements to collect from the vector.
				13388
				13389	::
				13390
				13391	declare void @llvm.masked.compressstore.v8i32 (<8 x i32> <value>, i32* <ptr>, <8 x i1> <mask>)
				13392	declare void @llvm.masked.compressstore.v16f32 (<16 x float> <value>, float* <ptr>, <16 x i1> <mask>)
				13393
				13394	Overview:
				13395	"""""""""
				13396
				13397	Selects elements from input vector '``value``' according to the '``mask``'. All selected elements are written into adjacent memory addresses starting at address '`ptr`', from lower to higher. The mask holds a bit for each vector lane, and is used to select elements to be stored. The number of elements to be stored is equal to the number of active bits in the mask.
				13398
				13399	Arguments:
				13400	""""""""""
				13401
				13402	The first operand is the input vector, from which elements are collected and written to memory. The second operand is the base pointer for the store, it has the same underlying type as the element of the input vector operand. The third operand is the mask, a vector of boolean values. The mask and the input vector must have the same number of vector elements.
				13403
				13404
				13405	Semantics:
				13406	""""""""""
				13407
				13408	The '``llvm.masked.compressstore``' intrinsic is designed for compressing data in memory. It allows to collect elements from possibly non-adjacent lanes of a vector and store them contiguously in memory in one IR operation. It is useful for targets that support compressing store operations and allows vectorizing loops with cross-iteration dependences like in the following example:
				13409
				13410	.. code-block:: c
				13411
				13412	// In this loop we load elements from A and store them consecutively in B
				13413	double A, B; int C;
				13414	for (int i = 0; i < size; ++i) {
				13415	if (C[i] != 0)
				13416	B[j++] = A[i]
				13417	}
				13418
				13419
				13420	.. code-block:: llvm
				13421
				13422	; Load elements from A.
				13423	%Tmp = call <8 x double> @llvm.masked.load.v8f64.p0v8f64(<8 x double>* %Aptr, i32 8, <8 x i1> %Mask, <8 x double> undef)
				13424	; Store all selected elements consecutively in array B
				13425	call <void> @llvm.masked.compressstore.v8f64(<8 x double> %Tmp, double* %Bptr, <8 x i1> %Mask)
				13426
				13427	; %Bptr should be increased on each iteration according to the number of '1' elements in the Mask.
				13428	%MaskI = bitcast <8 x i1> %Mask to i8
				13429	%MaskIPopcnt = call i8 @llvm.ctpop.i8(i8 %MaskI)
				13430	%MaskI64 = zext i8 %MaskIPopcnt to i64
				13431	%BNextInd = add i64 %BInd, %MaskI64
				13432
				13433
				13434	Other targets may support this intrinsic differently, for example, by lowering it into a sequence of branches that guard scalar store operations.
				13435
				13436
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13437	Memory Use Markers
				13438	------------------
				13439
Sanjay Patel	69bf48e	2014-07-04 19:40:43 +0000	[diff] [blame]	13440	This class of intrinsics provides information about the lifetime of
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13441	memory objects and ranges where variables are immutable.
				13442
Reid Kleckner	a534a38	2013-12-19 02:14:12 +0000	[diff] [blame]	13443	.. _int_lifestart:
				13444
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13445	'``llvm.lifetime.start``' Intrinsic
				13446	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13447
				13448	Syntax:
				13449	"""""""
				13450
				13451	::
				13452
				13453	declare void @llvm.lifetime.start(i64 <size>, i8* nocapture <ptr>)
				13454
				13455	Overview:
				13456	"""""""""
				13457
				13458	The '``llvm.lifetime.start``' intrinsic specifies the start of a memory
				13459	object's lifetime.
				13460
				13461	Arguments:
				13462	""""""""""
				13463
				13464	The first argument is a constant integer representing the size of the
				13465	object, or -1 if it is variable sized. The second argument is a pointer
				13466	to the object.
				13467
				13468	Semantics:
				13469	""""""""""
				13470
				13471	This intrinsic indicates that before this point in the code, the value
				13472	of the memory pointed to by ``ptr`` is dead. This means that it is known
				13473	to never be used and has an undefined value. A load from the pointer
				13474	that precedes this intrinsic can be replaced with ``'undef'``.
				13475
Reid Kleckner	a534a38	2013-12-19 02:14:12 +0000	[diff] [blame]	13476	.. _int_lifeend:
				13477
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13478	'``llvm.lifetime.end``' Intrinsic
				13479	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13480
				13481	Syntax:
				13482	"""""""
				13483
				13484	::
				13485
				13486	declare void @llvm.lifetime.end(i64 <size>, i8* nocapture <ptr>)
				13487
				13488	Overview:
				13489	"""""""""
				13490
				13491	The '``llvm.lifetime.end``' intrinsic specifies the end of a memory
				13492	object's lifetime.
				13493
				13494	Arguments:
				13495	""""""""""
				13496
				13497	The first argument is a constant integer representing the size of the
				13498	object, or -1 if it is variable sized. The second argument is a pointer
				13499	to the object.
				13500
				13501	Semantics:
				13502	""""""""""
				13503
				13504	This intrinsic indicates that after this point in the code, the value of
				13505	the memory pointed to by ``ptr`` is dead. This means that it is known to
				13506	never be used and has an undefined value. Any stores into the memory
				13507	object following this intrinsic may be removed as dead.
				13508
				13509	'``llvm.invariant.start``' Intrinsic
				13510	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13511
				13512	Syntax:
				13513	"""""""
Mehdi Amini	8c629ec	2016-08-13 23:31:24 +0000	[diff] [blame]	13514	This is an overloaded intrinsic. The memory object can belong to any address space.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13515
				13516	::
				13517
Mehdi Amini	8c629ec	2016-08-13 23:31:24 +0000	[diff] [blame]	13518	declare {}* @llvm.invariant.start.p0i8(i64 <size>, i8* nocapture <ptr>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13519
				13520	Overview:
				13521	"""""""""
				13522
				13523	The '``llvm.invariant.start``' intrinsic specifies that the contents of
				13524	a memory object will not change.
				13525
				13526	Arguments:
				13527	""""""""""
				13528
				13529	The first argument is a constant integer representing the size of the
				13530	object, or -1 if it is variable sized. The second argument is a pointer
				13531	to the object.
				13532
				13533	Semantics:
				13534	""""""""""
				13535
				13536	This intrinsic indicates that until an ``llvm.invariant.end`` that uses
				13537	the return value, the referenced memory location is constant and
				13538	unchanging.
				13539
				13540	'``llvm.invariant.end``' Intrinsic
				13541	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13542
				13543	Syntax:
				13544	"""""""
Mehdi Amini	8c629ec	2016-08-13 23:31:24 +0000	[diff] [blame]	13545	This is an overloaded intrinsic. The memory object can belong to any address space.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13546
				13547	::
				13548
Mehdi Amini	8c629ec	2016-08-13 23:31:24 +0000	[diff] [blame]	13549	declare void @llvm.invariant.end.p0i8({}* <start>, i64 <size>, i8* nocapture <ptr>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	13550
				13551	Overview:
				13552	"""""""""
				13553
				13554	The '``llvm.invariant.end``' intrinsic specifies that the contents of a
				13555	memory object are mutable.
				13556
				13557	Arguments:
				13558	""""""""""
				13559
				13560	The first argument is the matching ``llvm.invariant.start`` intrinsic.
				13561	The second argument is a constant integer representing the size of the
				13562	object, or -1 if it is variable sized and the third argument is a
				13563	pointer to the object.
				13564
				13565	Semantics:
				13566	""""""""""
				13567
				13568	This intrinsic indicates that the memory is mutable again.
				13569
Piotr Padlewski	5dde809	2018-05-03 11:03:01 +0000	[diff] [blame]	13570	'``llvm.launder.invariant.group``' Intrinsic
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	13571	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13572
				13573	Syntax:
				13574	"""""""
Yaxun Liu	407ca36	2017-11-16 16:32:16 +0000	[diff] [blame]	13575	This is an overloaded intrinsic. The memory object can belong to any address
				13576	space. The returned pointer must belong to the same address space as the
				13577	argument.
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	13578
				13579	::
				13580
Piotr Padlewski	5dde809	2018-05-03 11:03:01 +0000	[diff] [blame]	13581	declare i8* @llvm.launder.invariant.group.p0i8(i8* <ptr>)
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	13582
				13583	Overview:
				13584	"""""""""
				13585
Piotr Padlewski	5dde809	2018-05-03 11:03:01 +0000	[diff] [blame]	13586	The '``llvm.launder.invariant.group``' intrinsic can be used when an invariant
Piotr Padlewski	5b3db45	2018-07-02 04:49:30 +0000	[diff] [blame]	13587	established by ``invariant.group`` metadata no longer holds, to obtain a new
				13588	pointer value that carries fresh invariant group information. It is an
				13589	experimental intrinsic, which means that its semantics might change in the
				13590	future.
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	13591
				13592
				13593	Arguments:
				13594	""""""""""
				13595
Piotr Padlewski	5b3db45	2018-07-02 04:49:30 +0000	[diff] [blame]	13596	The ``llvm.launder.invariant.group`` takes only one argument, which is a pointer
				13597	to the memory.
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	13598
				13599	Semantics:
				13600	""""""""""
				13601
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	13602	Returns another pointer that aliases its argument but which is considered different
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	13603	for the purposes of ``load``/``store`` ``invariant.group`` metadata.
Piotr Padlewski	5dde809	2018-05-03 11:03:01 +0000	[diff] [blame]	13604	It does not read any accessible memory and the execution can be speculated.
Piotr Padlewski	6c15ec4	2015-09-15 18:32:14 +0000	[diff] [blame]	13605
Piotr Padlewski	5b3db45	2018-07-02 04:49:30 +0000	[diff] [blame]	13606	'``llvm.strip.invariant.group``' Intrinsic
				13607	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13608
				13609	Syntax:
				13610	"""""""
				13611	This is an overloaded intrinsic. The memory object can belong to any address
				13612	space. The returned pointer must belong to the same address space as the
				13613	argument.
				13614
				13615	::
				13616
				13617	declare i8* @llvm.strip.invariant.group.p0i8(i8* <ptr>)
				13618
				13619	Overview:
				13620	"""""""""
				13621
				13622	The '``llvm.strip.invariant.group``' intrinsic can be used when an invariant
				13623	established by ``invariant.group`` metadata no longer holds, to obtain a new pointer
				13624	value that does not carry the invariant information. It is an experimental
				13625	intrinsic, which means that its semantics might change in the future.
				13626
				13627
				13628	Arguments:
				13629	""""""""""
				13630
				13631	The ``llvm.strip.invariant.group`` takes only one argument, which is a pointer
				13632	to the memory.
				13633
				13634	Semantics:
				13635	""""""""""
				13636
				13637	Returns another pointer that aliases its argument but which has no associated
				13638	``invariant.group`` metadata.
				13639	It does not read any memory and can be speculated.
				13640
				13641
				13642
Sanjay Patel	54b161e	2018-03-20 16:38:22 +0000	[diff] [blame]	13643	.. _constrainedfp:
				13644
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13645	Constrained Floating-Point Intrinsics
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13646	-------------------------------------
				13647
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13648	These intrinsics are used to provide special handling of floating-point
				13649	operations when specific rounding mode or floating-point exception behavior is
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13650	required. By default, LLVM optimization passes assume that the rounding mode is
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13651	round-to-nearest and that floating-point exceptions will not be monitored.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13652	Constrained FP intrinsics are used to support non-default rounding modes and
				13653	accurately preserve exception behavior without compromising LLVM's ability to
				13654	optimize FP code when the default behavior is used.
				13655
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13656	Each of these intrinsics corresponds to a normal floating-point operation. The
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13657	first two arguments and the return value are the same as the corresponding FP
				13658	operation.
				13659
				13660	The third argument is a metadata argument specifying the rounding mode to be
				13661	assumed. This argument must be one of the following strings:
				13662
				13663	::
Andrew Kaylor	73b4a9a	2017-04-20 18:18:36 +0000	[diff] [blame]	13664
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13665	"round.dynamic"
				13666	"round.tonearest"
				13667	"round.downward"
				13668	"round.upward"
				13669	"round.towardzero"
				13670
				13671	If this argument is "round.dynamic" optimization passes must assume that the
				13672	rounding mode is unknown and may change at runtime. No transformations that
				13673	depend on rounding mode may be performed in this case.
				13674
				13675	The other possible values for the rounding mode argument correspond to the
				13676	similarly named IEEE rounding modes. If the argument is any of these values
				13677	optimization passes may perform transformations as long as they are consistent
				13678	with the specified rounding mode.
				13679
				13680	For example, 'x-0'->'x' is not a valid transformation if the rounding mode is
				13681	"round.downward" or "round.dynamic" because if the value of 'x' is +0 then
				13682	'x-0' should evaluate to '-0' when rounding downward. However, this
				13683	transformation is legal for all other rounding modes.
				13684
				13685	For values other than "round.dynamic" optimization passes may assume that the
				13686	actual runtime rounding mode (as defined in a target-specific manner) matches
				13687	the specified rounding mode, but this is not guaranteed. Using a specific
				13688	non-dynamic rounding mode which does not match the actual rounding mode at
				13689	runtime results in undefined behavior.
				13690
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13691	The fourth argument to the constrained floating-point intrinsics specifies the
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13692	required exception behavior. This argument must be one of the following
				13693	strings:
				13694
				13695	::
Andrew Kaylor	73b4a9a	2017-04-20 18:18:36 +0000	[diff] [blame]	13696
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13697	"fpexcept.ignore"
				13698	"fpexcept.maytrap"
				13699	"fpexcept.strict"
				13700
				13701	If this argument is "fpexcept.ignore" optimization passes may assume that the
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13702	exception status flags will not be read and that floating-point exceptions will
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13703	be masked. This allows transformations to be performed that may change the
				13704	exception semantics of the original code. For example, FP operations may be
				13705	speculatively executed in this case whereas they must not be for either of the
				13706	other possible values of this argument.
				13707
				13708	If the exception behavior argument is "fpexcept.maytrap" optimization passes
				13709	must avoid transformations that may raise exceptions that would not have been
				13710	raised by the original code (such as speculatively executing FP operations), but
				13711	passes are not required to preserve all exceptions that are implied by the
				13712	original code. For example, exceptions may be potentially hidden by constant
				13713	folding.
				13714
				13715	If the exception behavior argument is "fpexcept.strict" all transformations must
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13716	strictly preserve the floating-point exception semantics of the original code.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13717	Any FP exception that would have been raised by the original code must be raised
				13718	by the transformed code, and the transformed code must not raise any FP
				13719	exceptions that would not have been raised by the original code. This is the
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	13720	exception behavior argument that will be used if the code being compiled reads
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13721	the FP exception status flags, but this mode can also be used with code that
				13722	unmasks FP exceptions.
				13723
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13724	The number and order of floating-point exceptions is NOT guaranteed. For
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13725	example, a series of FP operations that each may raise exceptions may be
				13726	vectorized into a single instruction that raises each unique exception a single
				13727	time.
				13728
				13729
				13730	'``llvm.experimental.constrained.fadd``' Intrinsic
				13731	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13732
				13733	Syntax:
				13734	"""""""
				13735
				13736	::
				13737
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	13738	declare <type>
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13739	@llvm.experimental.constrained.fadd(<type> <op1>, <type> <op2>,
				13740	metadata <rounding mode>,
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13741	metadata <exception behavior>)
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13742
				13743	Overview:
				13744	"""""""""
				13745
				13746	The '``llvm.experimental.constrained.fadd``' intrinsic returns the sum of its
				13747	two operands.
				13748
				13749
				13750	Arguments:
				13751	""""""""""
				13752
				13753	The first two arguments to the '``llvm.experimental.constrained.fadd``'
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13754	intrinsic must be :ref:`floating-point <t_floating>` or :ref:`vector <t_vector>`
				13755	of floating-point values. Both arguments must have identical types.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13756
				13757	The third and fourth arguments specify the rounding mode and exception
				13758	behavior as described above.
				13759
				13760	Semantics:
				13761	""""""""""
				13762
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13763	The value produced is the floating-point sum of the two value operands and has
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13764	the same type as the operands.
				13765
				13766
				13767	'``llvm.experimental.constrained.fsub``' Intrinsic
				13768	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13769
				13770	Syntax:
				13771	"""""""
				13772
				13773	::
				13774
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	13775	declare <type>
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13776	@llvm.experimental.constrained.fsub(<type> <op1>, <type> <op2>,
				13777	metadata <rounding mode>,
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13778	metadata <exception behavior>)
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13779
				13780	Overview:
				13781	"""""""""
				13782
				13783	The '``llvm.experimental.constrained.fsub``' intrinsic returns the difference
				13784	of its two operands.
				13785
				13786
				13787	Arguments:
				13788	""""""""""
				13789
				13790	The first two arguments to the '``llvm.experimental.constrained.fsub``'
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13791	intrinsic must be :ref:`floating-point <t_floating>` or :ref:`vector <t_vector>`
				13792	of floating-point values. Both arguments must have identical types.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13793
				13794	The third and fourth arguments specify the rounding mode and exception
				13795	behavior as described above.
				13796
				13797	Semantics:
				13798	""""""""""
				13799
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13800	The value produced is the floating-point difference of the two value operands
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13801	and has the same type as the operands.
				13802
				13803
				13804	'``llvm.experimental.constrained.fmul``' Intrinsic
				13805	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13806
				13807	Syntax:
				13808	"""""""
				13809
				13810	::
				13811
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	13812	declare <type>
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13813	@llvm.experimental.constrained.fmul(<type> <op1>, <type> <op2>,
				13814	metadata <rounding mode>,
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13815	metadata <exception behavior>)
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13816
				13817	Overview:
				13818	"""""""""
				13819
				13820	The '``llvm.experimental.constrained.fmul``' intrinsic returns the product of
				13821	its two operands.
				13822
				13823
				13824	Arguments:
				13825	""""""""""
				13826
				13827	The first two arguments to the '``llvm.experimental.constrained.fmul``'
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13828	intrinsic must be :ref:`floating-point <t_floating>` or :ref:`vector <t_vector>`
				13829	of floating-point values. Both arguments must have identical types.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13830
				13831	The third and fourth arguments specify the rounding mode and exception
				13832	behavior as described above.
				13833
				13834	Semantics:
				13835	""""""""""
				13836
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13837	The value produced is the floating-point product of the two value operands and
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13838	has the same type as the operands.
				13839
				13840
				13841	'``llvm.experimental.constrained.fdiv``' Intrinsic
				13842	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13843
				13844	Syntax:
				13845	"""""""
				13846
				13847	::
				13848
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	13849	declare <type>
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13850	@llvm.experimental.constrained.fdiv(<type> <op1>, <type> <op2>,
				13851	metadata <rounding mode>,
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13852	metadata <exception behavior>)
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13853
				13854	Overview:
				13855	"""""""""
				13856
				13857	The '``llvm.experimental.constrained.fdiv``' intrinsic returns the quotient of
				13858	its two operands.
				13859
				13860
				13861	Arguments:
				13862	""""""""""
				13863
				13864	The first two arguments to the '``llvm.experimental.constrained.fdiv``'
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13865	intrinsic must be :ref:`floating-point <t_floating>` or :ref:`vector <t_vector>`
				13866	of floating-point values. Both arguments must have identical types.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13867
				13868	The third and fourth arguments specify the rounding mode and exception
				13869	behavior as described above.
				13870
				13871	Semantics:
				13872	""""""""""
				13873
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13874	The value produced is the floating-point quotient of the two value operands and
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13875	has the same type as the operands.
				13876
				13877
				13878	'``llvm.experimental.constrained.frem``' Intrinsic
				13879	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13880
				13881	Syntax:
				13882	"""""""
				13883
				13884	::
				13885
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	13886	declare <type>
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13887	@llvm.experimental.constrained.frem(<type> <op1>, <type> <op2>,
				13888	metadata <rounding mode>,
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13889	metadata <exception behavior>)
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13890
				13891	Overview:
				13892	"""""""""
				13893
				13894	The '``llvm.experimental.constrained.frem``' intrinsic returns the remainder
				13895	from the division of its two operands.
				13896
				13897
				13898	Arguments:
				13899	""""""""""
				13900
				13901	The first two arguments to the '``llvm.experimental.constrained.frem``'
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13902	intrinsic must be :ref:`floating-point <t_floating>` or :ref:`vector <t_vector>`
				13903	of floating-point values. Both arguments must have identical types.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13904
				13905	The third and fourth arguments specify the rounding mode and exception
				13906	behavior as described above. The rounding mode argument has no effect, since
				13907	the result of frem is never rounded, but the argument is included for
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13908	consistency with the other constrained floating-point intrinsics.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13909
				13910	Semantics:
				13911	""""""""""
				13912
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13913	The value produced is the floating-point remainder from the division of the two
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13914	value operands and has the same type as the operands. The remainder has the
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	13915	same sign as the dividend.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13916
Wei Ding	a131d3f	2017-08-24 04:18:24 +0000	[diff] [blame]	13917	'``llvm.experimental.constrained.fma``' Intrinsic
				13918	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13919
				13920	Syntax:
				13921	"""""""
				13922
				13923	::
				13924
				13925	declare <type>
				13926	@llvm.experimental.constrained.fma(<type> <op1>, <type> <op2>, <type> <op3>,
				13927	metadata <rounding mode>,
				13928	metadata <exception behavior>)
				13929
				13930	Overview:
				13931	"""""""""
				13932
				13933	The '``llvm.experimental.constrained.fma``' intrinsic returns the result of a
				13934	fused-multiply-add operation on its operands.
				13935
				13936	Arguments:
				13937	""""""""""
				13938
				13939	The first three arguments to the '``llvm.experimental.constrained.fma``'
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13940	intrinsic must be :ref:`floating-point <t_floating>` or :ref:`vector
				13941	<t_vector>` of floating-point values. All arguments must have identical types.
Wei Ding	a131d3f	2017-08-24 04:18:24 +0000	[diff] [blame]	13942
				13943	The fourth and fifth arguments specify the rounding mode and exception behavior
				13944	as described above.
				13945
				13946	Semantics:
				13947	""""""""""
				13948
				13949	The result produced is the product of the first two operands added to the third
				13950	operand computed with infinite precision, and then rounded to the target
				13951	precision.
Andrew Kaylor	a0a1164	2017-01-26 23:27:59 +0000	[diff] [blame]	13952
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13953	Constrained libm-equivalent Intrinsics
				13954	--------------------------------------
				13955
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13956	In addition to the basic floating-point operations for which constrained
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13957	intrinsics are described above, there are constrained versions of various
				13958	operations which provide equivalent behavior to a corresponding libm function.
				13959	These intrinsics allow the precise behavior of these operations with respect to
				13960	rounding mode and exception behavior to be controlled.
				13961
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13962	As with the basic constrained floating-point intrinsics, the rounding mode
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13963	and exception behavior arguments only control the behavior of the optimizer.
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13964	They do not change the runtime floating-point environment.
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13965
				13966
				13967	'``llvm.experimental.constrained.sqrt``' Intrinsic
				13968	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				13969
				13970	Syntax:
				13971	"""""""
				13972
				13973	::
				13974
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	13975	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13976	@llvm.experimental.constrained.sqrt(<type> <op1>,
				13977	metadata <rounding mode>,
				13978	metadata <exception behavior>)
				13979
				13980	Overview:
				13981	"""""""""
				13982
				13983	The '``llvm.experimental.constrained.sqrt``' intrinsic returns the square root
				13984	of the specified value, returning the same value as the libm '``sqrt``'
				13985	functions would, but without setting ``errno``.
				13986
				13987	Arguments:
				13988	""""""""""
				13989
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	13990	The first argument and the return type are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	13991	type.
				13992
				13993	The second and third arguments specify the rounding mode and exception
				13994	behavior as described above.
				13995
				13996	Semantics:
				13997	""""""""""
				13998
				13999	This function returns the nonnegative square root of the specified value.
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14000	If the value is less than negative zero, a floating-point exception occurs
Hiroshi Inoue	760c0c9	2018-01-16 13:19:48 +0000	[diff] [blame]	14001	and the return value is architecture specific.
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14002
				14003
				14004	'``llvm.experimental.constrained.pow``' Intrinsic
				14005	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14006
				14007	Syntax:
				14008	"""""""
				14009
				14010	::
				14011
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14012	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14013	@llvm.experimental.constrained.pow(<type> <op1>, <type> <op2>,
				14014	metadata <rounding mode>,
				14015	metadata <exception behavior>)
				14016
				14017	Overview:
				14018	"""""""""
				14019
				14020	The '``llvm.experimental.constrained.pow``' intrinsic returns the first operand
				14021	raised to the (positive or negative) power specified by the second operand.
				14022
				14023	Arguments:
				14024	""""""""""
				14025
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14026	The first two arguments and the return value are floating-point numbers of the
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14027	same type. The second argument specifies the power to which the first argument
				14028	should be raised.
				14029
				14030	The third and fourth arguments specify the rounding mode and exception
				14031	behavior as described above.
				14032
				14033	Semantics:
				14034	""""""""""
				14035
				14036	This function returns the first value raised to the second power,
				14037	returning the same values as the libm ``pow`` functions would, and
				14038	handles error conditions in the same way.
				14039
				14040
				14041	'``llvm.experimental.constrained.powi``' Intrinsic
				14042	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14043
				14044	Syntax:
				14045	"""""""
				14046
				14047	::
				14048
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14049	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14050	@llvm.experimental.constrained.powi(<type> <op1>, i32 <op2>,
				14051	metadata <rounding mode>,
				14052	metadata <exception behavior>)
				14053
				14054	Overview:
				14055	"""""""""
				14056
				14057	The '``llvm.experimental.constrained.powi``' intrinsic returns the first operand
				14058	raised to the (positive or negative) power specified by the second operand. The
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14059	order of evaluation of multiplications is not defined. When a vector of
				14060	floating-point type is used, the second argument remains a scalar integer value.
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14061
				14062
				14063	Arguments:
				14064	""""""""""
				14065
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14066	The first argument and the return value are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14067	type. The second argument is a 32-bit signed integer specifying the power to
				14068	which the first argument should be raised.
				14069
				14070	The third and fourth arguments specify the rounding mode and exception
				14071	behavior as described above.
				14072
				14073	Semantics:
				14074	""""""""""
				14075
				14076	This function returns the first value raised to the second power with an
				14077	unspecified sequence of rounding operations.
				14078
				14079
				14080	'``llvm.experimental.constrained.sin``' Intrinsic
				14081	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14082
				14083	Syntax:
				14084	"""""""
				14085
				14086	::
				14087
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14088	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14089	@llvm.experimental.constrained.sin(<type> <op1>,
				14090	metadata <rounding mode>,
				14091	metadata <exception behavior>)
				14092
				14093	Overview:
				14094	"""""""""
				14095
				14096	The '``llvm.experimental.constrained.sin``' intrinsic returns the sine of the
				14097	first operand.
				14098
				14099	Arguments:
				14100	""""""""""
				14101
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14102	The first argument and the return type are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14103	type.
				14104
				14105	The second and third arguments specify the rounding mode and exception
				14106	behavior as described above.
				14107
				14108	Semantics:
				14109	""""""""""
				14110
				14111	This function returns the sine of the specified operand, returning the
				14112	same values as the libm ``sin`` functions would, and handles error
				14113	conditions in the same way.
				14114
				14115
				14116	'``llvm.experimental.constrained.cos``' Intrinsic
				14117	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14118
				14119	Syntax:
				14120	"""""""
				14121
				14122	::
				14123
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14124	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14125	@llvm.experimental.constrained.cos(<type> <op1>,
				14126	metadata <rounding mode>,
				14127	metadata <exception behavior>)
				14128
				14129	Overview:
				14130	"""""""""
				14131
				14132	The '``llvm.experimental.constrained.cos``' intrinsic returns the cosine of the
				14133	first operand.
				14134
				14135	Arguments:
				14136	""""""""""
				14137
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14138	The first argument and the return type are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14139	type.
				14140
				14141	The second and third arguments specify the rounding mode and exception
				14142	behavior as described above.
				14143
				14144	Semantics:
				14145	""""""""""
				14146
				14147	This function returns the cosine of the specified operand, returning the
				14148	same values as the libm ``cos`` functions would, and handles error
				14149	conditions in the same way.
				14150
				14151
				14152	'``llvm.experimental.constrained.exp``' Intrinsic
				14153	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14154
				14155	Syntax:
				14156	"""""""
				14157
				14158	::
				14159
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14160	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14161	@llvm.experimental.constrained.exp(<type> <op1>,
				14162	metadata <rounding mode>,
				14163	metadata <exception behavior>)
				14164
				14165	Overview:
				14166	"""""""""
				14167
				14168	The '``llvm.experimental.constrained.exp``' intrinsic computes the base-e
				14169	exponential of the specified value.
				14170
				14171	Arguments:
				14172	""""""""""
				14173
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14174	The first argument and the return value are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14175	type.
				14176
				14177	The second and third arguments specify the rounding mode and exception
				14178	behavior as described above.
				14179
				14180	Semantics:
				14181	""""""""""
				14182
				14183	This function returns the same values as the libm ``exp`` functions
				14184	would, and handles error conditions in the same way.
				14185
				14186
				14187	'``llvm.experimental.constrained.exp2``' Intrinsic
				14188	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14189
				14190	Syntax:
				14191	"""""""
				14192
				14193	::
				14194
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14195	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14196	@llvm.experimental.constrained.exp2(<type> <op1>,
				14197	metadata <rounding mode>,
				14198	metadata <exception behavior>)
				14199
				14200	Overview:
				14201	"""""""""
				14202
				14203	The '``llvm.experimental.constrained.exp2``' intrinsic computes the base-2
				14204	exponential of the specified value.
				14205
				14206
				14207	Arguments:
				14208	""""""""""
				14209
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14210	The first argument and the return value are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14211	type.
				14212
				14213	The second and third arguments specify the rounding mode and exception
				14214	behavior as described above.
				14215
				14216	Semantics:
				14217	""""""""""
				14218
				14219	This function returns the same values as the libm ``exp2`` functions
				14220	would, and handles error conditions in the same way.
				14221
				14222
				14223	'``llvm.experimental.constrained.log``' Intrinsic
				14224	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14225
				14226	Syntax:
				14227	"""""""
				14228
				14229	::
				14230
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14231	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14232	@llvm.experimental.constrained.log(<type> <op1>,
				14233	metadata <rounding mode>,
				14234	metadata <exception behavior>)
				14235
				14236	Overview:
				14237	"""""""""
				14238
				14239	The '``llvm.experimental.constrained.log``' intrinsic computes the base-e
				14240	logarithm of the specified value.
				14241
				14242	Arguments:
				14243	""""""""""
				14244
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14245	The first argument and the return value are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14246	type.
				14247
				14248	The second and third arguments specify the rounding mode and exception
				14249	behavior as described above.
				14250
				14251
				14252	Semantics:
				14253	""""""""""
				14254
				14255	This function returns the same values as the libm ``log`` functions
				14256	would, and handles error conditions in the same way.
				14257
				14258
				14259	'``llvm.experimental.constrained.log10``' Intrinsic
				14260	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14261
				14262	Syntax:
				14263	"""""""
				14264
				14265	::
				14266
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14267	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14268	@llvm.experimental.constrained.log10(<type> <op1>,
				14269	metadata <rounding mode>,
				14270	metadata <exception behavior>)
				14271
				14272	Overview:
				14273	"""""""""
				14274
				14275	The '``llvm.experimental.constrained.log10``' intrinsic computes the base-10
				14276	logarithm of the specified value.
				14277
				14278	Arguments:
				14279	""""""""""
				14280
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14281	The first argument and the return value are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14282	type.
				14283
				14284	The second and third arguments specify the rounding mode and exception
				14285	behavior as described above.
				14286
				14287	Semantics:
				14288	""""""""""
				14289
				14290	This function returns the same values as the libm ``log10`` functions
				14291	would, and handles error conditions in the same way.
				14292
				14293
				14294	'``llvm.experimental.constrained.log2``' Intrinsic
				14295	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14296
				14297	Syntax:
				14298	"""""""
				14299
				14300	::
				14301
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14302	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14303	@llvm.experimental.constrained.log2(<type> <op1>,
				14304	metadata <rounding mode>,
				14305	metadata <exception behavior>)
				14306
				14307	Overview:
				14308	"""""""""
				14309
				14310	The '``llvm.experimental.constrained.log2``' intrinsic computes the base-2
				14311	logarithm of the specified value.
				14312
				14313	Arguments:
				14314	""""""""""
				14315
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14316	The first argument and the return value are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14317	type.
				14318
				14319	The second and third arguments specify the rounding mode and exception
				14320	behavior as described above.
				14321
				14322	Semantics:
				14323	""""""""""
				14324
				14325	This function returns the same values as the libm ``log2`` functions
				14326	would, and handles error conditions in the same way.
				14327
				14328
				14329	'``llvm.experimental.constrained.rint``' Intrinsic
				14330	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14331
				14332	Syntax:
				14333	"""""""
				14334
				14335	::
				14336
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14337	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14338	@llvm.experimental.constrained.rint(<type> <op1>,
				14339	metadata <rounding mode>,
				14340	metadata <exception behavior>)
				14341
				14342	Overview:
				14343	"""""""""
				14344
				14345	The '``llvm.experimental.constrained.rint``' intrinsic returns the first
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14346	operand rounded to the nearest integer. It may raise an inexact floating-point
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14347	exception if the operand is not an integer.
				14348
				14349	Arguments:
				14350	""""""""""
				14351
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14352	The first argument and the return value are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14353	type.
				14354
				14355	The second and third arguments specify the rounding mode and exception
				14356	behavior as described above.
				14357
				14358	Semantics:
				14359	""""""""""
				14360
				14361	This function returns the same values as the libm ``rint`` functions
				14362	would, and handles error conditions in the same way. The rounding mode is
				14363	described, not determined, by the rounding mode argument. The actual rounding
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14364	mode is determined by the runtime floating-point environment. The rounding
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14365	mode argument is only intended as information to the compiler.
				14366
				14367
				14368	'``llvm.experimental.constrained.nearbyint``' Intrinsic
				14369	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14370
				14371	Syntax:
				14372	"""""""
				14373
				14374	::
				14375
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	14376	declare <type>
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14377	@llvm.experimental.constrained.nearbyint(<type> <op1>,
				14378	metadata <rounding mode>,
				14379	metadata <exception behavior>)
				14380
				14381	Overview:
				14382	"""""""""
				14383
				14384	The '``llvm.experimental.constrained.nearbyint``' intrinsic returns the first
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14385	operand rounded to the nearest integer. It will not raise an inexact
				14386	floating-point exception if the operand is not an integer.
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14387
				14388
				14389	Arguments:
				14390	""""""""""
				14391
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14392	The first argument and the return value are floating-point numbers of the same
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14393	type.
				14394
				14395	The second and third arguments specify the rounding mode and exception
				14396	behavior as described above.
				14397
				14398	Semantics:
				14399	""""""""""
				14400
				14401	This function returns the same values as the libm ``nearbyint`` functions
				14402	would, and handles error conditions in the same way. The rounding mode is
				14403	described, not determined, by the rounding mode argument. The actual rounding
Sanjay Patel	85fa9ef	2018-03-21 14:15:33 +0000	[diff] [blame]	14404	mode is determined by the runtime floating-point environment. The rounding
Andrew Kaylor	f466001	2017-05-25 21:31:00 +0000	[diff] [blame]	14405	mode argument is only intended as information to the compiler.
				14406
				14407
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14408	General Intrinsics
				14409	------------------
				14410
				14411	This class of intrinsics is designed to be generic and has no specific
				14412	purpose.
				14413
				14414	'``llvm.var.annotation``' Intrinsic
				14415	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14416
				14417	Syntax:
				14418	"""""""
				14419
				14420	::
				14421
				14422	declare void @llvm.var.annotation(i8* <val>, i8* <str>, i8* <str>, i32 <int>)
				14423
				14424	Overview:
				14425	"""""""""
				14426
				14427	The '``llvm.var.annotation``' intrinsic.
				14428
				14429	Arguments:
				14430	""""""""""
				14431
				14432	The first argument is a pointer to a value, the second is a pointer to a
				14433	global string, the third is a pointer to a global string which is the
				14434	source file name, and the last argument is the line number.
				14435
				14436	Semantics:
				14437	""""""""""
				14438
				14439	This intrinsic allows annotation of local variables with arbitrary
				14440	strings. This can be useful for special purpose optimizations that want
				14441	to look for these annotations. These have no other defined use; they are
				14442	ignored by code generation and optimization.
				14443
Michael Gottesman	88d1883	2013-03-26 00:34:27 +0000	[diff] [blame]	14444	'``llvm.ptr.annotation.*``' Intrinsic
				14445	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14446
				14447	Syntax:
				14448	"""""""
				14449
				14450	This is an overloaded intrinsic. You can use '``llvm.ptr.annotation``' on a
				14451	pointer to an integer of any width. NOTE you must specify an address space for
				14452	the pointer. The identifier for the default address space is the integer
				14453	'``0``'.
				14454
				14455	::
				14456
				14457	declare i8* @llvm.ptr.annotation.p<address space>i8(i8* <val>, i8* <str>, i8* <str>, i32 <int>)
				14458	declare i16* @llvm.ptr.annotation.p<address space>i16(i16* <val>, i8* <str>, i8* <str>, i32 <int>)
				14459	declare i32* @llvm.ptr.annotation.p<address space>i32(i32* <val>, i8* <str>, i8* <str>, i32 <int>)
				14460	declare i64* @llvm.ptr.annotation.p<address space>i64(i64* <val>, i8* <str>, i8* <str>, i32 <int>)
				14461	declare i256* @llvm.ptr.annotation.p<address space>i256(i256* <val>, i8* <str>, i8* <str>, i32 <int>)
				14462
				14463	Overview:
				14464	"""""""""
				14465
				14466	The '``llvm.ptr.annotation``' intrinsic.
				14467
				14468	Arguments:
				14469	""""""""""
				14470
				14471	The first argument is a pointer to an integer value of arbitrary bitwidth
				14472	(result of some expression), the second is a pointer to a global string, the
				14473	third is a pointer to a global string which is the source file name, and the
				14474	last argument is the line number. It returns the value of the first argument.
				14475
				14476	Semantics:
				14477	""""""""""
				14478
				14479	This intrinsic allows annotation of a pointer to an integer with arbitrary
				14480	strings. This can be useful for special purpose optimizations that want to look
				14481	for these annotations. These have no other defined use; they are ignored by code
				14482	generation and optimization.
				14483
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14484	'``llvm.annotation.*``' Intrinsic
				14485	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14486
				14487	Syntax:
				14488	"""""""
				14489
				14490	This is an overloaded intrinsic. You can use '``llvm.annotation``' on
				14491	any integer bit width.
				14492
				14493	::
				14494
				14495	declare i8 @llvm.annotation.i8(i8 <val>, i8* <str>, i8* <str>, i32 <int>)
				14496	declare i16 @llvm.annotation.i16(i16 <val>, i8* <str>, i8* <str>, i32 <int>)
				14497	declare i32 @llvm.annotation.i32(i32 <val>, i8* <str>, i8* <str>, i32 <int>)
				14498	declare i64 @llvm.annotation.i64(i64 <val>, i8* <str>, i8* <str>, i32 <int>)
				14499	declare i256 @llvm.annotation.i256(i256 <val>, i8* <str>, i8* <str>, i32 <int>)
				14500
				14501	Overview:
				14502	"""""""""
				14503
				14504	The '``llvm.annotation``' intrinsic.
				14505
				14506	Arguments:
				14507	""""""""""
				14508
				14509	The first argument is an integer value (result of some expression), the
				14510	second is a pointer to a global string, the third is a pointer to a
				14511	global string which is the source file name, and the last argument is
				14512	the line number. It returns the value of the first argument.
				14513
				14514	Semantics:
				14515	""""""""""
				14516
				14517	This intrinsic allows annotations to be put on arbitrary expressions
				14518	with arbitrary strings. This can be useful for special purpose
				14519	optimizations that want to look for these annotations. These have no
				14520	other defined use; they are ignored by code generation and optimization.
				14521
Reid Kleckner	e33c94f	2017-09-05 20:14:58 +0000	[diff] [blame]	14522	'``llvm.codeview.annotation``' Intrinsic
Reid Kleckner	d452368	2017-09-05 20:26:25 +0000	[diff] [blame]	14523	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Reid Kleckner	e33c94f	2017-09-05 20:14:58 +0000	[diff] [blame]	14524
				14525	Syntax:
				14526	"""""""
				14527
				14528	This annotation emits a label at its program point and an associated
				14529	``S_ANNOTATION`` codeview record with some additional string metadata. This is
				14530	used to implement MSVC's ``__annotation`` intrinsic. It is marked
				14531	``noduplicate``, so calls to this intrinsic prevent inlining and should be
				14532	considered expensive.
				14533
				14534	::
				14535
				14536	declare void @llvm.codeview.annotation(metadata)
				14537
				14538	Arguments:
				14539	""""""""""
				14540
				14541	The argument should be an MDTuple containing any number of MDStrings.
				14542
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14543	'``llvm.trap``' Intrinsic
				14544	^^^^^^^^^^^^^^^^^^^^^^^^^
				14545
				14546	Syntax:
				14547	"""""""
				14548
				14549	::
				14550
				14551	declare void @llvm.trap() noreturn nounwind
				14552
				14553	Overview:
				14554	"""""""""
				14555
				14556	The '``llvm.trap``' intrinsic.
				14557
				14558	Arguments:
				14559	""""""""""
				14560
				14561	None.
				14562
				14563	Semantics:
				14564	""""""""""
				14565
				14566	This intrinsic is lowered to the target dependent trap instruction. If
				14567	the target does not have a trap instruction, this intrinsic will be
				14568	lowered to a call of the ``abort()`` function.
				14569
				14570	'``llvm.debugtrap``' Intrinsic
				14571	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14572
				14573	Syntax:
				14574	"""""""
				14575
				14576	::
				14577
				14578	declare void @llvm.debugtrap() nounwind
				14579
				14580	Overview:
				14581	"""""""""
				14582
				14583	The '``llvm.debugtrap``' intrinsic.
				14584
				14585	Arguments:
				14586	""""""""""
				14587
				14588	None.
				14589
				14590	Semantics:
				14591	""""""""""
				14592
				14593	This intrinsic is lowered to code which is intended to cause an
				14594	execution trap with the intention of requesting the attention of a
				14595	debugger.
				14596
				14597	'``llvm.stackprotector``' Intrinsic
				14598	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14599
				14600	Syntax:
				14601	"""""""
				14602
				14603	::
				14604
				14605	declare void @llvm.stackprotector(i8* <guard>, i8** <slot>)
				14606
				14607	Overview:
				14608	"""""""""
				14609
				14610	The ``llvm.stackprotector`` intrinsic takes the ``guard`` and stores it
				14611	onto the stack at ``slot``. The stack slot is adjusted to ensure that it
				14612	is placed on the stack before local variables.
				14613
				14614	Arguments:
				14615	""""""""""
				14616
				14617	The ``llvm.stackprotector`` intrinsic requires two pointer arguments.
				14618	The first argument is the value loaded from the stack guard
				14619	``@__stack_chk_guard``. The second variable is an ``alloca`` that has
				14620	enough space to hold the value of the guard.
				14621
				14622	Semantics:
				14623	""""""""""
				14624
Michael Gottesman	dafc7d9	2013-08-12 18:35:32 +0000	[diff] [blame]	14625	This intrinsic causes the prologue/epilogue inserter to force the position of
				14626	the ``AllocaInst`` stack slot to be before local variables on the stack. This is
				14627	to ensure that if a local variable on the stack is overwritten, it will destroy
				14628	the value of the guard. When the function exits, the guard on the stack is
				14629	checked against the original guard by ``llvm.stackprotectorcheck``. If they are
				14630	different, then ``llvm.stackprotectorcheck`` causes the program to abort by
				14631	calling the ``__stack_chk_fail()`` function.
				14632
Tim Shen	e885d5e	2016-04-19 19:40:37 +0000	[diff] [blame]	14633	'``llvm.stackguard``' Intrinsic
				14634	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14635
				14636	Syntax:
				14637	"""""""
				14638
				14639	::
				14640
				14641	declare i8* @llvm.stackguard()
				14642
				14643	Overview:
				14644	"""""""""
				14645
				14646	The ``llvm.stackguard`` intrinsic returns the system stack guard value.
				14647
				14648	It should not be generated by frontends, since it is only for internal usage.
				14649	The reason why we create this intrinsic is that we still support IR form Stack
				14650	Protector in FastISel.
				14651
				14652	Arguments:
				14653	""""""""""
				14654
				14655	None.
				14656
				14657	Semantics:
				14658	""""""""""
				14659
				14660	On some platforms, the value returned by this intrinsic remains unchanged
				14661	between loads in the same thread. On other platforms, it returns the same
				14662	global variable value, if any, e.g. ``@__stack_chk_guard``.
				14663
				14664	Currently some platforms have IR-level customized stack guard loading (e.g.
				14665	X86 Linux) that is not handled by ``llvm.stackguard()``, while they should be
				14666	in the future.
				14667
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14668	'``llvm.objectsize``' Intrinsic
				14669	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14670
				14671	Syntax:
				14672	"""""""
				14673
				14674	::
				14675
George Burgess IV	56c7e88	2017-03-21 20:08:59 +0000	[diff] [blame]	14676	declare i32 @llvm.objectsize.i32(i8* <object>, i1 <min>, i1 <nullunknown>)
				14677	declare i64 @llvm.objectsize.i64(i8* <object>, i1 <min>, i1 <nullunknown>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14678
				14679	Overview:
				14680	"""""""""
				14681
				14682	The ``llvm.objectsize`` intrinsic is designed to provide information to
				14683	the optimizers to determine at compile time whether a) an operation
				14684	(like memcpy) will overflow a buffer that corresponds to an object, or
				14685	b) that a runtime check for overflow isn't necessary. An object in this
				14686	context means an allocation of a specific class, structure, array, or
				14687	other object.
				14688
				14689	Arguments:
				14690	""""""""""
				14691
George Burgess IV	56c7e88	2017-03-21 20:08:59 +0000	[diff] [blame]	14692	The ``llvm.objectsize`` intrinsic takes three arguments. The first argument is
				14693	a pointer to or into the ``object``. The second argument determines whether
				14694	``llvm.objectsize`` returns 0 (if true) or -1 (if false) when the object size
				14695	is unknown. The third argument controls how ``llvm.objectsize`` acts when
George Burgess IV	3fbfa9c4	2018-07-09 22:21:16 +0000	[diff] [blame]	14696	``null`` in address space 0 is used as its pointer argument. If it's ``false``,
				14697	``llvm.objectsize`` reports 0 bytes available when given ``null``. Otherwise, if
				14698	the ``null`` is in a non-zero address space or if ``true`` is given for the
				14699	third argument of ``llvm.objectsize``, we assume its size is unknown.
George Burgess IV	56c7e88	2017-03-21 20:08:59 +0000	[diff] [blame]	14700
				14701	The second and third arguments only accept constants.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14702
				14703	Semantics:
				14704	""""""""""
				14705
				14706	The ``llvm.objectsize`` intrinsic is lowered to a constant representing
				14707	the size of the object concerned. If the size cannot be determined at
				14708	compile time, ``llvm.objectsize`` returns ``i32/i64 -1 or 0`` (depending
				14709	on the ``min`` argument).
				14710
				14711	'``llvm.expect``' Intrinsic
				14712	^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14713
				14714	Syntax:
				14715	"""""""
				14716
Duncan P. N. Exon Smith	1ff08e3	2014-02-02 22:43:55 +0000	[diff] [blame]	14717	This is an overloaded intrinsic. You can use ``llvm.expect`` on any
				14718	integer bit width.
				14719
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14720	::
				14721
Duncan P. N. Exon Smith	1ff08e3	2014-02-02 22:43:55 +0000	[diff] [blame]	14722	declare i1 @llvm.expect.i1(i1 <val>, i1 <expected_val>)
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14723	declare i32 @llvm.expect.i32(i32 <val>, i32 <expected_val>)
				14724	declare i64 @llvm.expect.i64(i64 <val>, i64 <expected_val>)
				14725
				14726	Overview:
				14727	"""""""""
				14728
				14729	The ``llvm.expect`` intrinsic provides information about expected (the
				14730	most probable) value of ``val``, which can be used by optimizers.
				14731
				14732	Arguments:
				14733	""""""""""
				14734
				14735	The ``llvm.expect`` intrinsic takes two arguments. The first argument is
				14736	a value. The second argument is an expected value, this needs to be a
				14737	constant value, variables are not allowed.
				14738
				14739	Semantics:
				14740	""""""""""
				14741
				14742	This intrinsic is lowered to the ``val``.
				14743
Philip Reames	e0e9083	2015-04-26 22:23:12 +0000	[diff] [blame]	14744	.. _int_assume:
				14745
Hal Finkel	9304691	2014-07-25 21:13:35 +0000	[diff] [blame]	14746	'``llvm.assume``' Intrinsic
				14747	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14748
				14749	Syntax:
				14750	"""""""
				14751
				14752	::
				14753
				14754	declare void @llvm.assume(i1 %cond)
				14755
				14756	Overview:
				14757	"""""""""
				14758
				14759	The ``llvm.assume`` allows the optimizer to assume that the provided
				14760	condition is true. This information can then be used in simplifying other parts
				14761	of the code.
				14762
				14763	Arguments:
				14764	""""""""""
				14765
				14766	The condition which the optimizer may assume is always true.
				14767
				14768	Semantics:
				14769	""""""""""
				14770
				14771	The intrinsic allows the optimizer to assume that the provided condition is
				14772	always true whenever the control flow reaches the intrinsic call. No code is
				14773	generated for this intrinsic, and instructions that contribute only to the
				14774	provided condition are not used for code generation. If the condition is
				14775	violated during execution, the behavior is undefined.
				14776
Sanjay Patel	1ed2bb5	2015-01-14 16:03:58 +0000	[diff] [blame]	14777	Note that the optimizer might limit the transformations performed on values
Hal Finkel	9304691	2014-07-25 21:13:35 +0000	[diff] [blame]	14778	used by the ``llvm.assume`` intrinsic in order to preserve the instructions
				14779	only used to form the intrinsic's input argument. This might prove undesirable
Sanjay Patel	1ed2bb5	2015-01-14 16:03:58 +0000	[diff] [blame]	14780	if the extra information provided by the ``llvm.assume`` intrinsic does not cause
Hal Finkel	9304691	2014-07-25 21:13:35 +0000	[diff] [blame]	14781	sufficient overall improvement in code quality. For this reason,
				14782	``llvm.assume`` should not be used to document basic mathematical invariants
				14783	that the optimizer can otherwise deduce or facts that are of little use to the
				14784	optimizer.
				14785
Daniel Berlin	2c438a3	2017-02-07 19:29:25 +0000	[diff] [blame]	14786	.. _int_ssa_copy:
				14787
				14788	'``llvm.ssa_copy``' Intrinsic
				14789	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14790
				14791	Syntax:
				14792	"""""""
				14793
				14794	::
				14795
				14796	declare type @llvm.ssa_copy(type %operand) returned(1) readnone
				14797
				14798	Arguments:
				14799	""""""""""
				14800
				14801	The first argument is an operand which is used as the returned value.
				14802
				14803	Overview:
				14804	""""""""""
				14805
				14806	The ``llvm.ssa_copy`` intrinsic can be used to attach information to
				14807	operations by copying them and giving them new names. For example,
				14808	the PredicateInfo utility uses it to build Extended SSA form, and
				14809	attach various forms of information to operands that dominate specific
				14810	uses. It is not meant for general use, only for building temporary
				14811	renaming forms that require value splits at certain points.
				14812
Peter Collingbourne	7efd750	2016-06-24 21:21:32 +0000	[diff] [blame]	14813	.. _type.test:
Peter Collingbourne	e6909c8	2015-02-20 20:30:47 +0000	[diff] [blame]	14814
Peter Collingbourne	7efd750	2016-06-24 21:21:32 +0000	[diff] [blame]	14815	'``llvm.type.test``' Intrinsic
Peter Collingbourne	e6909c8	2015-02-20 20:30:47 +0000	[diff] [blame]	14816	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14817
				14818	Syntax:
				14819	"""""""
				14820
				14821	::
				14822
Peter Collingbourne	7efd750	2016-06-24 21:21:32 +0000	[diff] [blame]	14823	declare i1 @llvm.type.test(i8* %ptr, metadata %type) nounwind readnone
Peter Collingbourne	e6909c8	2015-02-20 20:30:47 +0000	[diff] [blame]	14824
				14825
				14826	Arguments:
				14827	""""""""""
				14828
				14829	The first argument is a pointer to be tested. The second argument is a
Peter Collingbourne	7efd750	2016-06-24 21:21:32 +0000	[diff] [blame]	14830	metadata object representing a :doc:`type identifier <TypeMetadata>`.
Peter Collingbourne	e6909c8	2015-02-20 20:30:47 +0000	[diff] [blame]	14831
				14832	Overview:
				14833	"""""""""
				14834
Peter Collingbourne	7efd750	2016-06-24 21:21:32 +0000	[diff] [blame]	14835	The ``llvm.type.test`` intrinsic tests whether the given pointer is associated
				14836	with the given type identifier.
Peter Collingbourne	e6909c8	2015-02-20 20:30:47 +0000	[diff] [blame]	14837
Peter Collingbourne	0312f61	2016-06-25 00:23:04 +0000	[diff] [blame]	14838	'``llvm.type.checked.load``' Intrinsic
				14839	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14840
				14841	Syntax:
				14842	"""""""
				14843
				14844	::
				14845
				14846	declare {i8, i1} @llvm.type.checked.load(i8 %ptr, i32 %offset, metadata %type) argmemonly nounwind readonly
				14847
				14848
				14849	Arguments:
				14850	""""""""""
				14851
				14852	The first argument is a pointer from which to load a function pointer. The
				14853	second argument is the byte offset from which to load the function pointer. The
				14854	third argument is a metadata object representing a :doc:`type identifier
				14855	<TypeMetadata>`.
				14856
				14857	Overview:
				14858	"""""""""
				14859
				14860	The ``llvm.type.checked.load`` intrinsic safely loads a function pointer from a
				14861	virtual table pointer using type metadata. This intrinsic is used to implement
				14862	control flow integrity in conjunction with virtual call optimization. The
				14863	virtual call optimization pass will optimize away ``llvm.type.checked.load``
				14864	intrinsics associated with devirtualized calls, thereby removing the type
				14865	check in cases where it is not needed to enforce the control flow integrity
				14866	constraint.
				14867
				14868	If the given pointer is associated with a type metadata identifier, this
				14869	function returns true as the second element of its return value. (Note that
				14870	the function may also return true if the given pointer is not associated
				14871	with a type metadata identifier.) If the function's return value's second
				14872	element is true, the following rules apply to the first element:
				14873
				14874	- If the given pointer is associated with the given type metadata identifier,
				14875	it is the function pointer loaded from the given byte offset from the given
				14876	pointer.
				14877
				14878	- If the given pointer is not associated with the given type metadata
				14879	identifier, it is one of the following (the choice of which is unspecified):
				14880
				14881	1. The function pointer that would have been loaded from an arbitrarily chosen
				14882	(through an unspecified mechanism) pointer associated with the type
				14883	metadata.
				14884
				14885	2. If the function has a non-void return type, a pointer to a function that
				14886	returns an unspecified value without causing side effects.
				14887
				14888	If the function's return value's second element is false, the value of the
				14889	first element is undefined.
				14890
				14891
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14892	'``llvm.donothing``' Intrinsic
				14893	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14894
				14895	Syntax:
				14896	"""""""
				14897
				14898	::
				14899
				14900	declare void @llvm.donothing() nounwind readnone
				14901
				14902	Overview:
				14903	"""""""""
				14904
Juergen Ributzka	c916119	2014-10-23 22:36:13 +0000	[diff] [blame]	14905	The ``llvm.donothing`` intrinsic doesn't perform any operation. It's one of only
Sanjoy Das	7a4c94d	2016-02-26 03:33:59 +0000	[diff] [blame]	14906	three intrinsics (besides ``llvm.experimental.patchpoint`` and
				14907	``llvm.experimental.gc.statepoint``) that can be called with an invoke
				14908	instruction.
Sean Silva	b084af4	2012-12-07 10:36:55 +0000	[diff] [blame]	14909
				14910	Arguments:
				14911	""""""""""
				14912
				14913	None.
				14914
				14915	Semantics:
				14916	""""""""""
				14917
				14918	This intrinsic does nothing, and it's removed by optimizers and ignored
				14919	by codegen.
Andrew Trick	5e029ce	2013-12-24 02:57:25 +0000	[diff] [blame]	14920
Sanjoy Das	b51325d	2016-03-11 19:08:34 +0000	[diff] [blame]	14921	'``llvm.experimental.deoptimize``' Intrinsic
				14922	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14923
				14924	Syntax:
				14925	"""""""
				14926
				14927	::
				14928
				14929	declare type @llvm.experimental.deoptimize(...) [ "deopt"(...) ]
				14930
				14931	Overview:
				14932	"""""""""
				14933
				14934	This intrinsic, together with :ref:`deoptimization operand bundles
				14935	<deopt_opbundles>`, allow frontends to express transfer of control and
				14936	frame-local state from the currently executing (typically more specialized,
				14937	hence faster) version of a function into another (typically more generic, hence
				14938	slower) version.
				14939
				14940	In languages with a fully integrated managed runtime like Java and JavaScript
				14941	this intrinsic can be used to implement "uncommon trap" or "side exit" like
				14942	functionality. In unmanaged languages like C and C++, this intrinsic can be
				14943	used to represent the slow paths of specialized functions.
				14944
				14945
				14946	Arguments:
				14947	""""""""""
				14948
				14949	The intrinsic takes an arbitrary number of arguments, whose meaning is
				14950	decided by the :ref:`lowering strategy<deoptimize_lowering>`.
				14951
				14952	Semantics:
				14953	""""""""""
				14954
				14955	The ``@llvm.experimental.deoptimize`` intrinsic executes an attached
				14956	deoptimization continuation (denoted using a :ref:`deoptimization
				14957	operand bundle <deopt_opbundles>`) and returns the value returned by
				14958	the deoptimization continuation. Defining the semantic properties of
				14959	the continuation itself is out of scope of the language reference --
				14960	as far as LLVM is concerned, the deoptimization continuation can
				14961	invoke arbitrary side effects, including reading from and writing to
				14962	the entire heap.
				14963
				14964	Deoptimization continuations expressed using ``"deopt"`` operand bundles always
				14965	continue execution to the end of the physical frame containing them, so all
				14966	calls to ``@llvm.experimental.deoptimize`` must be in "tail position":
				14967
				14968	- ``@llvm.experimental.deoptimize`` cannot be invoked.
				14969	- The call must immediately precede a :ref:`ret <i_ret>` instruction.
				14970	- The ``ret`` instruction must return the value produced by the
				14971	``@llvm.experimental.deoptimize`` call if there is one, or void.
				14972
				14973	Note that the above restrictions imply that the return type for a call to
				14974	``@llvm.experimental.deoptimize`` will match the return type of its immediate
				14975	caller.
				14976
				14977	The inliner composes the ``"deopt"`` continuations of the caller into the
				14978	``"deopt"`` continuations present in the inlinee, and also updates calls to this
				14979	intrinsic to return directly from the frame of the function it inlined into.
				14980
Sanjoy Das	e0aa414	2016-05-12 01:17:38 +0000	[diff] [blame]	14981	All declarations of ``@llvm.experimental.deoptimize`` must share the
				14982	same calling convention.
				14983
Sanjoy Das	b51325d	2016-03-11 19:08:34 +0000	[diff] [blame]	14984	.. _deoptimize_lowering:
				14985
				14986	Lowering:
				14987	"""""""""
				14988
Sanjoy Das	df9ae70	2016-03-24 20:23:29 +0000	[diff] [blame]	14989	Calls to ``@llvm.experimental.deoptimize`` are lowered to calls to the
				14990	symbol ``__llvm_deoptimize`` (it is the frontend's responsibility to
				14991	ensure that this symbol is defined). The call arguments to
				14992	``@llvm.experimental.deoptimize`` are lowered as if they were formal
				14993	arguments of the specified types, and not as varargs.
				14994
Sanjoy Das	b51325d	2016-03-11 19:08:34 +0000	[diff] [blame]	14995
Sanjoy Das	021de05	2016-03-31 00:18:46 +0000	[diff] [blame]	14996	'``llvm.experimental.guard``' Intrinsic
				14997	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				14998
				14999	Syntax:
				15000	"""""""
				15001
				15002	::
				15003
				15004	declare void @llvm.experimental.guard(i1, ...) [ "deopt"(...) ]
				15005
				15006	Overview:
				15007	"""""""""
				15008
				15009	This intrinsic, together with :ref:`deoptimization operand bundles
				15010	<deopt_opbundles>`, allows frontends to express guards or checks on
				15011	optimistic assumptions made during compilation. The semantics of
				15012	``@llvm.experimental.guard`` is defined in terms of
				15013	``@llvm.experimental.deoptimize`` -- its body is defined to be
				15014	equivalent to:
				15015
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	15016	.. code-block:: text
Sanjoy Das	021de05	2016-03-31 00:18:46 +0000	[diff] [blame]	15017
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	15018	define void @llvm.experimental.guard(i1 %pred, <args...>) {
				15019	%realPred = and i1 %pred, undef
				15020	br i1 %realPred, label %continue, label %leave [, !make.implicit !{}]
Sanjoy Das	021de05	2016-03-31 00:18:46 +0000	[diff] [blame]	15021
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	15022	leave:
				15023	call void @llvm.experimental.deoptimize(<args...>) [ "deopt"() ]
				15024	ret void
Sanjoy Das	021de05	2016-03-31 00:18:46 +0000	[diff] [blame]	15025
Renato Golin	124f259	2016-07-20 12:16:38 +0000	[diff] [blame]	15026	continue:
				15027	ret void
				15028	}
Sanjoy Das	021de05	2016-03-31 00:18:46 +0000	[diff] [blame]	15029
Sanjoy Das	47cf2af	2016-04-30 00:55:59 +0000	[diff] [blame]	15030
				15031	with the optional ``[, !make.implicit !{}]`` present if and only if it
				15032	is present on the call site. For more details on ``!make.implicit``,
				15033	see :doc:`FaultMaps`.
				15034
Sanjoy Das	021de05	2016-03-31 00:18:46 +0000	[diff] [blame]	15035	In words, ``@llvm.experimental.guard`` executes the attached
				15036	``"deopt"`` continuation if (but not only if) its first argument
				15037	is ``false``. Since the optimizer is allowed to replace the ``undef``
				15038	with an arbitrary value, it can optimize guard to fail "spuriously",
				15039	i.e. without the original condition being false (hence the "not only
				15040	if"); and this allows for "check widening" type optimizations.
				15041
				15042	``@llvm.experimental.guard`` cannot be invoked.
				15043
				15044
Peter Collingbourne	7dd8dbf	2016-04-22 21:18:02 +0000	[diff] [blame]	15045	'``llvm.load.relative``' Intrinsic
				15046	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				15047
				15048	Syntax:
				15049	"""""""
				15050
				15051	::
				15052
				15053	declare i8* @llvm.load.relative.iN(i8* %ptr, iN %offset) argmemonly nounwind readonly
				15054
				15055	Overview:
				15056	"""""""""
				15057
				15058	This intrinsic loads a 32-bit value from the address ``%ptr + %offset``,
				15059	adds ``%ptr`` to that value and returns it. The constant folder specifically
				15060	recognizes the form of this intrinsic and the constant initializers it may
				15061	load from; if a loaded constant initializer is known to have the form
				15062	``i32 trunc(x - %ptr)``, the intrinsic call is folded to ``x``.
				15063
				15064	LLVM provides that the calculation of such a constant initializer will
				15065	not overflow at link time under the medium code model if ``x`` is an
				15066	``unnamed_addr`` function. However, it does not provide this guarantee for
				15067	a constant initializer folded into a function body. This intrinsic can be
				15068	used to avoid the possibility of overflows when loading from such a constant.
				15069
Dan Gohman	2c74fe9	2017-11-08 21:59:51 +0000	[diff] [blame]	15070	'``llvm.sideeffect``' Intrinsic
				15071	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				15072
				15073	Syntax:
				15074	"""""""
				15075
				15076	::
				15077
				15078	declare void @llvm.sideeffect() inaccessiblememonly nounwind
				15079
				15080	Overview:
				15081	"""""""""
				15082
				15083	The ``llvm.sideeffect`` intrinsic doesn't perform any operation. Optimizers
				15084	treat it as having side effects, so it can be inserted into a loop to
				15085	indicate that the loop shouldn't be assumed to terminate (which could
				15086	potentially lead to the loop being optimized away entirely), even if it's
				15087	an infinite loop with no other side effects.
				15088
				15089	Arguments:
				15090	""""""""""
				15091
				15092	None.
				15093
				15094	Semantics:
				15095	""""""""""
				15096
				15097	This intrinsic actually does nothing, but optimizers must assume that it
				15098	has externally observable side effects.
				15099
Andrew Trick	5e029ce	2013-12-24 02:57:25 +0000	[diff] [blame]	15100	Stack Map Intrinsics
				15101	--------------------
				15102
				15103	LLVM provides experimental intrinsics to support runtime patching
				15104	mechanisms commonly desired in dynamic language JITs. These intrinsics
				15105	are described in :doc:`StackMaps`.
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15106
				15107	Element Wise Atomic Memory Intrinsics
Igor Laevsky	fedab15	2016-12-29 15:08:57 +0000	[diff] [blame]	15108	-------------------------------------
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15109
				15110	These intrinsics are similar to the standard library memory intrinsics except
				15111	that they perform memory transfer as a sequence of atomic memory accesses.
				15112
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15113	.. _int_memcpy_element_unordered_atomic:
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15114
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15115	'``llvm.memcpy.element.unordered.atomic``' Intrinsic
				15116	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15117
				15118	Syntax:
				15119	"""""""
				15120
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15121	This is an overloaded intrinsic. You can use ``llvm.memcpy.element.unordered.atomic`` on
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15122	any integer bit width and for different address spaces. Not all targets
				15123	support all bit widths however.
				15124
				15125	::
				15126
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15127	declare void @llvm.memcpy.element.unordered.atomic.p0i8.p0i8.i32(i8* <dest>,
				15128	i8* <src>,
				15129	i32 <len>,
				15130	i32 <element_size>)
				15131	declare void @llvm.memcpy.element.unordered.atomic.p0i8.p0i8.i64(i8* <dest>,
				15132	i8* <src>,
				15133	i64 <len>,
				15134	i32 <element_size>)
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15135
				15136	Overview:
				15137	"""""""""
				15138
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15139	The '``llvm.memcpy.element.unordered.atomic.*``' intrinsic is a specialization of the
				15140	'``llvm.memcpy.*``' intrinsic. It differs in that the ``dest`` and ``src`` are treated
				15141	as arrays with elements that are exactly ``element_size`` bytes, and the copy between
				15142	buffers uses a sequence of :ref:`unordered atomic <ordering>` load/store operations
				15143	that are a positive integer multiple of the ``element_size`` in size.
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15144
				15145	Arguments:
				15146	""""""""""
				15147
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15148	The first three arguments are the same as they are in the :ref:`@llvm.memcpy <int_memcpy>`
				15149	intrinsic, with the added constraint that ``len`` is required to be a positive integer
				15150	multiple of the ``element_size``. If ``len`` is not a positive integer multiple of
				15151	``element_size``, then the behaviour of the intrinsic is undefined.
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15152
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15153	``element_size`` must be a compile-time constant positive power of two no greater than
				15154	target-specific atomic access size limit.
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15155
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15156	For each of the input pointers ``align`` parameter attribute must be specified. It
				15157	must be a power of two no less than the ``element_size``. Caller guarantees that
				15158	both the source and destination pointers are aligned to that boundary.
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15159
				15160	Semantics:
				15161	""""""""""
				15162
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15163	The '``llvm.memcpy.element.unordered.atomic.*``' intrinsic copies ``len`` bytes of
				15164	memory from the source location to the destination location. These locations are not
				15165	allowed to overlap. The memory copy is performed as a sequence of load/store operations
				15166	where each access is guaranteed to be a multiple of ``element_size`` bytes wide and
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	15167	aligned at an ``element_size`` boundary.
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15168
				15169	The order of the copy is unspecified. The same value may be read from the source
				15170	buffer many times, but only one write is issued to the destination buffer per
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15171	element. It is well defined to have concurrent reads and writes to both source and
				15172	destination provided those reads and writes are unordered atomic when specified.
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15173
				15174	This intrinsic does not provide any additional ordering guarantees over those
				15175	provided by a set of unordered loads from the source location and stores to the
				15176	destination.
				15177
				15178	Lowering:
Igor Laevsky	fedab15	2016-12-29 15:08:57 +0000	[diff] [blame]	15179	"""""""""
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15180
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15181	In the most general case call to the '``llvm.memcpy.element.unordered.atomic.*``' is
				15182	lowered to a call to the symbol ``__llvm_memcpy_element_unordered_atomic_``. Where ''
				15183	is replaced with an actual element size.
Igor Laevsky	4f31e52	2016-12-29 14:31:07 +0000	[diff] [blame]	15184
Daniel Neilson	57226ef	2017-07-12 15:25:26 +0000	[diff] [blame]	15185	Optimizer is allowed to inline memory copy when it's profitable to do so.
				15186
				15187	'``llvm.memmove.element.unordered.atomic``' Intrinsic
				15188	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				15189
				15190	Syntax:
				15191	"""""""
				15192
				15193	This is an overloaded intrinsic. You can use
				15194	``llvm.memmove.element.unordered.atomic`` on any integer bit width and for
				15195	different address spaces. Not all targets support all bit widths however.
				15196
				15197	::
				15198
				15199	declare void @llvm.memmove.element.unordered.atomic.p0i8.p0i8.i32(i8* <dest>,
				15200	i8* <src>,
				15201	i32 <len>,
				15202	i32 <element_size>)
				15203	declare void @llvm.memmove.element.unordered.atomic.p0i8.p0i8.i64(i8* <dest>,
				15204	i8* <src>,
				15205	i64 <len>,
				15206	i32 <element_size>)
				15207
				15208	Overview:
				15209	"""""""""
				15210
				15211	The '``llvm.memmove.element.unordered.atomic.*``' intrinsic is a specialization
				15212	of the '``llvm.memmove.*``' intrinsic. It differs in that the ``dest`` and
				15213	``src`` are treated as arrays with elements that are exactly ``element_size``
				15214	bytes, and the copy between buffers uses a sequence of
				15215	:ref:`unordered atomic <ordering>` load/store operations that are a positive
				15216	integer multiple of the ``element_size`` in size.
				15217
				15218	Arguments:
				15219	""""""""""
				15220
				15221	The first three arguments are the same as they are in the
				15222	:ref:`@llvm.memmove <int_memmove>` intrinsic, with the added constraint that
				15223	``len`` is required to be a positive integer multiple of the ``element_size``.
				15224	If ``len`` is not a positive integer multiple of ``element_size``, then the
				15225	behaviour of the intrinsic is undefined.
				15226
				15227	``element_size`` must be a compile-time constant positive power of two no
				15228	greater than a target-specific atomic access size limit.
				15229
				15230	For each of the input pointers the ``align`` parameter attribute must be
				15231	specified. It must be a power of two no less than the ``element_size``. Caller
				15232	guarantees that both the source and destination pointers are aligned to that
				15233	boundary.
				15234
				15235	Semantics:
				15236	""""""""""
				15237
				15238	The '``llvm.memmove.element.unordered.atomic.*``' intrinsic copies ``len`` bytes
				15239	of memory from the source location to the destination location. These locations
				15240	are allowed to overlap. The memory copy is performed as a sequence of load/store
				15241	operations where each access is guaranteed to be a multiple of ``element_size``
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	15242	bytes wide and aligned at an ``element_size`` boundary.
Daniel Neilson	57226ef	2017-07-12 15:25:26 +0000	[diff] [blame]	15243
				15244	The order of the copy is unspecified. The same value may be read from the source
				15245	buffer many times, but only one write is issued to the destination buffer per
				15246	element. It is well defined to have concurrent reads and writes to both source
				15247	and destination provided those reads and writes are unordered atomic when
				15248	specified.
				15249
				15250	This intrinsic does not provide any additional ordering guarantees over those
				15251	provided by a set of unordered loads from the source location and stores to the
				15252	destination.
				15253
				15254	Lowering:
				15255	"""""""""
				15256
				15257	In the most general case call to the
				15258	'``llvm.memmove.element.unordered.atomic.*``' is lowered to a call to the symbol
				15259	``__llvm_memmove_element_unordered_atomic_``. Where '' is replaced with an
				15260	actual element size.
				15261
Daniel Neilson	3faabbb	2017-06-16 14:43:59 +0000	[diff] [blame]	15262	The optimizer is allowed to inline the memory copy when it's profitable to do so.
Daniel Neilson	965613e	2017-07-12 21:57:23 +0000	[diff] [blame]	15263
				15264	.. _int_memset_element_unordered_atomic:
				15265
				15266	'``llvm.memset.element.unordered.atomic``' Intrinsic
				15267	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
				15268
				15269	Syntax:
				15270	"""""""
				15271
				15272	This is an overloaded intrinsic. You can use ``llvm.memset.element.unordered.atomic`` on
				15273	any integer bit width and for different address spaces. Not all targets
				15274	support all bit widths however.
				15275
				15276	::
				15277
				15278	declare void @llvm.memset.element.unordered.atomic.p0i8.i32(i8* <dest>,
				15279	i8 <value>,
				15280	i32 <len>,
				15281	i32 <element_size>)
				15282	declare void @llvm.memset.element.unordered.atomic.p0i8.i64(i8* <dest>,
				15283	i8 <value>,
				15284	i64 <len>,
				15285	i32 <element_size>)
				15286
				15287	Overview:
				15288	"""""""""
				15289
				15290	The '``llvm.memset.element.unordered.atomic.*``' intrinsic is a specialization of the
				15291	'``llvm.memset.*``' intrinsic. It differs in that the ``dest`` is treated as an array
				15292	with elements that are exactly ``element_size`` bytes, and the assignment to that array
				15293	uses uses a sequence of :ref:`unordered atomic <ordering>` store operations
				15294	that are a positive integer multiple of the ``element_size`` in size.
				15295
				15296	Arguments:
				15297	""""""""""
				15298
				15299	The first three arguments are the same as they are in the :ref:`@llvm.memset <int_memset>`
				15300	intrinsic, with the added constraint that ``len`` is required to be a positive integer
				15301	multiple of the ``element_size``. If ``len`` is not a positive integer multiple of
				15302	``element_size``, then the behaviour of the intrinsic is undefined.
				15303
				15304	``element_size`` must be a compile-time constant positive power of two no greater than
				15305	target-specific atomic access size limit.
				15306
				15307	The ``dest`` input pointer must have the ``align`` parameter attribute specified. It
				15308	must be a power of two no less than the ``element_size``. Caller guarantees that
				15309	the destination pointer is aligned to that boundary.
				15310
				15311	Semantics:
				15312	""""""""""
				15313
				15314	The '``llvm.memset.element.unordered.atomic.*``' intrinsic sets the ``len`` bytes of
				15315	memory starting at the destination location to the given ``value``. The memory is
				15316	set with a sequence of store operations where each access is guaranteed to be a
Jonas Devlieghere	aaecdc4	2017-11-06 11:47:24 +0000	[diff] [blame]	15317	multiple of ``element_size`` bytes wide and aligned at an ``element_size`` boundary.
Daniel Neilson	965613e	2017-07-12 21:57:23 +0000	[diff] [blame]	15318
				15319	The order of the assignment is unspecified. Only one write is issued to the
				15320	destination buffer per element. It is well defined to have concurrent reads and
				15321	writes to the destination provided those reads and writes are unordered atomic
				15322	when specified.
				15323
				15324	This intrinsic does not provide any additional ordering guarantees over those
				15325	provided by a set of unordered stores to the destination.
				15326
				15327	Lowering:
				15328	"""""""""
				15329
				15330	In the most general case call to the '``llvm.memset.element.unordered.atomic.*``' is
				15331	lowered to a call to the symbol ``__llvm_memset_element_unordered_atomic_``. Where ''
				15332	is replaced with an actual element size.
				15333
				15334	The optimizer is allowed to inline the memory assignment when it's profitable to do so.