Nico Rieck | 1da4529 | 2013-04-10 23:28:17 +0000 | [diff] [blame] | 1 | =============== |
| 2 | LLVM Extensions |
| 3 | =============== |
| 4 | |
| 5 | .. contents:: |
| 6 | :local: |
Nico Rieck | 1da4529 | 2013-04-10 23:28:17 +0000 | [diff] [blame] | 7 | |
| 8 | .. toctree:: |
| 9 | :hidden: |
| 10 | |
| 11 | Introduction |
| 12 | ============ |
| 13 | |
| 14 | This document describes extensions to tools and formats LLVM seeks compatibility |
| 15 | with. |
| 16 | |
Tim Northover | 0937679 | 2013-08-14 15:27:20 +0000 | [diff] [blame] | 17 | General Assembly Syntax |
| 18 | =========================== |
| 19 | |
| 20 | C99-style Hexadecimal Floating-point Constants |
| 21 | ---------------------------------------------- |
| 22 | |
| 23 | LLVM's assemblers allow floating-point constants to be written in C99's |
| 24 | hexadecimal format instead of decimal if desired. |
| 25 | |
| 26 | .. code-block:: gas |
Benjamin Kramer | 24ab6b3 | 2013-08-14 16:18:47 +0000 | [diff] [blame] | 27 | |
Tim Northover | 0937679 | 2013-08-14 15:27:20 +0000 | [diff] [blame] | 28 | .section .data |
| 29 | .float 0x1c2.2ap3 |
| 30 | |
Nico Rieck | 1da4529 | 2013-04-10 23:28:17 +0000 | [diff] [blame] | 31 | Machine-specific Assembly Syntax |
| 32 | ================================ |
| 33 | |
| 34 | X86/COFF-Dependent |
| 35 | ------------------ |
| 36 | |
Nico Rieck | a37acf7 | 2013-07-06 12:13:10 +0000 | [diff] [blame] | 37 | Relocations |
| 38 | ^^^^^^^^^^^ |
| 39 | |
Timur Iskhodzhanov | c1fb2d6 | 2013-12-20 18:15:00 +0000 | [diff] [blame] | 40 | The following additional relocation types are supported: |
Nico Rieck | 1da4529 | 2013-04-10 23:28:17 +0000 | [diff] [blame] | 41 | |
| 42 | **@IMGREL** (AT&T syntax only) generates an image-relative relocation that |
| 43 | corresponds to the COFF relocation types ``IMAGE_REL_I386_DIR32NB`` (32-bit) or |
| 44 | ``IMAGE_REL_AMD64_ADDR32NB`` (64-bit). |
| 45 | |
Renato Golin | 124f259 | 2016-07-20 12:16:38 +0000 | [diff] [blame] | 46 | .. code-block:: text |
Nico Rieck | 1da4529 | 2013-04-10 23:28:17 +0000 | [diff] [blame] | 47 | |
| 48 | .text |
| 49 | fun: |
| 50 | mov foo@IMGREL(%ebx, %ecx, 4), %eax |
| 51 | |
| 52 | .section .pdata |
| 53 | .long fun@IMGREL |
| 54 | .long (fun@imgrel + 0x3F) |
| 55 | .long $unwind$fun@imgrel |
Nico Rieck | a37acf7 | 2013-07-06 12:13:10 +0000 | [diff] [blame] | 56 | |
Timur Iskhodzhanov | c1fb2d6 | 2013-12-20 18:15:00 +0000 | [diff] [blame] | 57 | **.secrel32** generates a relocation that corresponds to the COFF relocation |
| 58 | types ``IMAGE_REL_I386_SECREL`` (32-bit) or ``IMAGE_REL_AMD64_SECREL`` (64-bit). |
| 59 | |
| 60 | **.secidx** relocation generates an index of the section that contains |
| 61 | the target. It corresponds to the COFF relocation types |
| 62 | ``IMAGE_REL_I386_SECTION`` (32-bit) or ``IMAGE_REL_AMD64_SECTION`` (64-bit). |
| 63 | |
| 64 | .. code-block:: gas |
| 65 | |
| 66 | .section .debug$S,"rn" |
| 67 | .long 4 |
| 68 | .long 242 |
| 69 | .long 40 |
| 70 | .secrel32 _function_name |
| 71 | .secidx _function_name |
| 72 | ... |
Nico Rieck | a37acf7 | 2013-07-06 12:13:10 +0000 | [diff] [blame] | 73 | |
| 74 | ``.linkonce`` Directive |
| 75 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 76 | |
| 77 | Syntax: |
| 78 | |
Rafael Espindola | 0766ae0 | 2014-06-06 19:26:12 +0000 | [diff] [blame] | 79 | ``.linkonce [ comdat type ]`` |
Nico Rieck | a37acf7 | 2013-07-06 12:13:10 +0000 | [diff] [blame] | 80 | |
| 81 | Supported COMDAT types: |
| 82 | |
| 83 | ``discard`` |
| 84 | Discards duplicate sections with the same COMDAT symbol. This is the default |
| 85 | if no type is specified. |
| 86 | |
| 87 | ``one_only`` |
| 88 | If the symbol is defined multiple times, the linker issues an error. |
| 89 | |
| 90 | ``same_size`` |
| 91 | Duplicates are discarded, but the linker issues an error if any have |
| 92 | different sizes. |
| 93 | |
| 94 | ``same_contents`` |
| 95 | Duplicates are discarded, but the linker issues an error if any duplicates |
| 96 | do not have exactly the same content. |
| 97 | |
Nico Rieck | a37acf7 | 2013-07-06 12:13:10 +0000 | [diff] [blame] | 98 | ``largest`` |
| 99 | Links the largest section from among the duplicates. |
| 100 | |
| 101 | ``newest`` |
| 102 | Links the newest section from among the duplicates. |
| 103 | |
| 104 | |
| 105 | .. code-block:: gas |
| 106 | |
| 107 | .section .text$foo |
| 108 | .linkonce |
| 109 | ... |
| 110 | |
Rafael Espindola | 60ec383 | 2013-11-19 19:52:52 +0000 | [diff] [blame] | 111 | ``.section`` Directive |
| 112 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 113 | |
| 114 | MC supports passing the information in ``.linkonce`` at the end of |
| 115 | ``.section``. For example, these two codes are equivalent |
| 116 | |
| 117 | .. code-block:: gas |
| 118 | |
| 119 | .section secName, "dr", discard, "Symbol1" |
| 120 | .globl Symbol1 |
| 121 | Symbol1: |
| 122 | .long 1 |
| 123 | |
| 124 | .. code-block:: gas |
| 125 | |
| 126 | .section secName, "dr" |
| 127 | .linkonce discard |
| 128 | .globl Symbol1 |
| 129 | Symbol1: |
| 130 | .long 1 |
| 131 | |
Timur Iskhodzhanov | 18f666b | 2013-12-20 10:32:12 +0000 | [diff] [blame] | 132 | Note that in the combined form the COMDAT symbol is explicit. This |
Nico Rieck | e52d293 | 2014-02-15 06:02:36 +0000 | [diff] [blame] | 133 | extension exists to support multiple sections with the same name in |
| 134 | different COMDATs: |
Rafael Espindola | 60ec383 | 2013-11-19 19:52:52 +0000 | [diff] [blame] | 135 | |
| 136 | |
| 137 | .. code-block:: gas |
| 138 | |
| 139 | .section secName, "dr", discard, "Symbol1" |
| 140 | .globl Symbol1 |
| 141 | Symbol1: |
| 142 | .long 1 |
| 143 | |
| 144 | .section secName, "dr", discard, "Symbol2" |
| 145 | .globl Symbol2 |
| 146 | Symbol2: |
| 147 | .long 1 |
Saleem Abdulrasool | 25947c3 | 2014-04-30 07:05:07 +0000 | [diff] [blame] | 148 | |
Rafael Espindola | 0766ae0 | 2014-06-06 19:26:12 +0000 | [diff] [blame] | 149 | In addition to the types allowed with ``.linkonce``, ``.section`` also accepts |
| 150 | ``associative``. The meaning is that the section is linked if a certain other |
| 151 | COMDAT section is linked. This other section is indicated by the comdat symbol |
| 152 | in this directive. It can be any symbol defined in the associated section, but |
| 153 | is usually the associated section's comdat. |
| 154 | |
| 155 | The following restrictions apply to the associated section: |
| 156 | |
| 157 | 1. It must be a COMDAT section. |
| 158 | 2. It cannot be another associative COMDAT section. |
| 159 | |
| 160 | In the following example the symobl ``sym`` is the comdat symbol of ``.foo`` |
| 161 | and ``.bar`` is associated to ``.foo``. |
| 162 | |
| 163 | .. code-block:: gas |
| 164 | |
| 165 | .section .foo,"bw",discard, "sym" |
| 166 | .section .bar,"rd",associative, "sym" |
| 167 | |
Reid Kleckner | 8ddf07c | 2016-09-15 15:11:49 +0000 | [diff] [blame] | 168 | MC supports these flags in the COFF ``.section`` directive: |
| 169 | |
| 170 | - ``b``: BSS section (``IMAGE_SCN_CNT_INITIALIZED_DATA``) |
| 171 | - ``d``: Data section (``IMAGE_SCN_CNT_UNINITIALIZED_DATA``) |
| 172 | - ``n``: Section is not loaded (``IMAGE_SCN_LNK_REMOVE``) |
| 173 | - ``r``: Read-only |
| 174 | - ``s``: Shared section |
| 175 | - ``w``: Writable |
| 176 | - ``x``: Executable section |
| 177 | - ``y``: Not readable |
| 178 | - ``D``: Discardable (``IMAGE_SCN_MEM_DISCARDABLE``) |
| 179 | |
| 180 | These flags are all compatible with gas, with the exception of the ``D`` flag, |
| 181 | which gnu as does not support. For gas compatibility, sections with a name |
| 182 | starting with ".debug" are implicitly discardable. |
| 183 | |
Rafael Espindola | 8ca44f0 | 2015-04-04 18:02:01 +0000 | [diff] [blame] | 184 | |
| 185 | ELF-Dependent |
| 186 | ------------- |
| 187 | |
| 188 | ``.section`` Directive |
| 189 | ^^^^^^^^^^^^^^^^^^^^^^ |
| 190 | |
| 191 | In order to support creating multiple sections with the same name and comdat, |
| 192 | it is possible to add an unique number at the end of the ``.seciton`` directive. |
| 193 | For example, the following code creates two sections named ``.text``. |
| 194 | |
| 195 | .. code-block:: gas |
| 196 | |
Rafael Espindola | 41e2b5c | 2015-04-06 16:34:41 +0000 | [diff] [blame] | 197 | .section .text,"ax",@progbits,unique,1 |
Rafael Espindola | 8ca44f0 | 2015-04-04 18:02:01 +0000 | [diff] [blame] | 198 | nop |
| 199 | |
Rafael Espindola | 41e2b5c | 2015-04-06 16:34:41 +0000 | [diff] [blame] | 200 | .section .text,"ax",@progbits,unique,2 |
Rafael Espindola | 8ca44f0 | 2015-04-04 18:02:01 +0000 | [diff] [blame] | 201 | nop |
| 202 | |
| 203 | |
| 204 | The unique number is not present in the resulting object at all. It is just used |
| 205 | in the assembler to differentiate the sections. |
| 206 | |
Saleem Abdulrasool | 25947c3 | 2014-04-30 07:05:07 +0000 | [diff] [blame] | 207 | Target Specific Behaviour |
| 208 | ========================= |
| 209 | |
| 210 | Windows on ARM |
| 211 | -------------- |
| 212 | |
| 213 | Stack Probe Emission |
| 214 | ^^^^^^^^^^^^^^^^^^^^ |
| 215 | |
| 216 | The reference implementation (Microsoft Visual Studio 2012) emits stack probes |
| 217 | in the following fashion: |
| 218 | |
| 219 | .. code-block:: gas |
| 220 | |
| 221 | movw r4, #constant |
| 222 | bl __chkstk |
| 223 | sub.w sp, sp, r4 |
| 224 | |
Alp Toker | beaca19 | 2014-05-15 01:52:21 +0000 | [diff] [blame] | 225 | However, this has the limitation of 32 MiB (±16MiB). In order to accommodate |
Saleem Abdulrasool | 25947c3 | 2014-04-30 07:05:07 +0000 | [diff] [blame] | 226 | larger binaries, LLVM supports the use of ``-mcode-model=large`` to allow a 4GiB |
| 227 | range via a slight deviation. It will generate an indirect jump as follows: |
| 228 | |
| 229 | .. code-block:: gas |
| 230 | |
| 231 | movw r4, #constant |
| 232 | movw r12, :lower16:__chkstk |
| 233 | movt r12, :upper16:__chkstk |
| 234 | blx r12 |
| 235 | sub.w sp, sp, r4 |
| 236 | |
Saleem Abdulrasool | abac6e9 | 2014-06-09 20:18:42 +0000 | [diff] [blame] | 237 | Variable Length Arrays |
| 238 | ^^^^^^^^^^^^^^^^^^^^^^ |
| 239 | |
| 240 | The reference implementation (Microsoft Visual Studio 2012) does not permit the |
| 241 | emission of Variable Length Arrays (VLAs). |
| 242 | |
| 243 | The Windows ARM Itanium ABI extends the base ABI by adding support for emitting |
| 244 | a dynamic stack allocation. When emitting a variable stack allocation, a call |
| 245 | to ``__chkstk`` is emitted unconditionally to ensure that guard pages are setup |
| 246 | properly. The emission of this stack probe emission is handled similar to the |
| 247 | standard stack probe emission. |
| 248 | |
| 249 | The MSVC environment does not emit code for VLAs currently. |
| 250 | |