blob: 99fdcb979c17b0d353382e5e765e6978ea96a1ff [file] [log] [blame]
Reid Spencere26ed7a2004-11-11 09:21:18 +00001=pod
2
3=head1 NAME
4
5llvm-ar - LLVM archiver
6
7=head1 SYNOPSIS
8
Reid Spencerbd336192004-11-15 19:12:06 +00009B<llvm-ar> [-]{dmpqrtx}[Rabfikouz] [relpos] [count] <archive> [files...]
Reid Spencere26ed7a2004-11-11 09:21:18 +000010
11
12=head1 DESCRIPTION
13
14The B<llvm-ar> command is similar to the common Unix utility, C<ar>. It
15archives several files together into a single file. The intent for this is
16to produce archive libraries by LLVM bytecode that can be linked into an
Reid Spencerbd336192004-11-15 19:12:06 +000017LLVM program. However, the archive can contain any kind of file. By default,
18B<llvm-ar> generates a symbol table that makes linking faster because
Reid Spencere26ed7a2004-11-11 09:21:18 +000019only the symbol table needs to be consulted, not each individual file member
20of the archive.
21
Reid Spencerbd336192004-11-15 19:12:06 +000022The B<llvm-ar> command can be used to I<read> both SVR4 and BSD style archive
23files. However, it cannot be used to write them. While the B<llvm-ar> command
24produces files that are I<almost> identical to the format used by other C<ar>
25implementations, it has two significant departures in order to make the
26archive appropriate for LLVM. There are first departure is that B<llvm-ar> only
27uses BSD4.4 style long path names (stored immediately after the header) and
28never contains a string table for long names. The second departure is that the
29symbol table is formated for efficient construction of an in-memory data
30structure that permits rapid (red-black tree) lookups. Consequently, archives
31produced with B<llvm-ar> usually won't be readable or editable with any
32C<ar> implementation or useful for linking. Using the C<f> modifier to flatten
33file names will make the archive readable by other C<ar> implementations
34but not for linking because the symbol table format for LLVM is unique. If an
35SVR4 or BSD style archive is used with the C<r> (replace) or C<q> (quick
36update) operations, the archive will be reconstructed in LLVM format. This
37means that the string table will be dropped (in deference to BSD 4.4 long names)
38and an LLVM symbol table will be added (by default). The system symbol table
39will be retained.
Reid Spencere26ed7a2004-11-11 09:21:18 +000040
41Here's where B<llvm-ar> departs from previous C<ar> implementations:
42
43=over
44
45=item I<Symbol Table>
46
47Since B<llvm-ar> is intended to archive bytecode files, the symbol table
48won't make much sense to anything but LLVM. Consequently, the symbol table's
49format has been simplified. It consists simply of a sequence of pairs
50of a file member index number as an LSB 4byte integer and a null-terminated
51string.
52
53=item I<Long Paths>
54
55Some C<ar> implementations (SVR4) use a separate file member to record long
56path names (> 15 characters). B<llvm-ar> takes the BSD 4.4 and Mac OS X
57approach which is to simply store the full path name immediately preceding
58the data for the file. The path name is null terminated and may contain the
59slash (/) character.
60
61=item I<Compression>
62
63B<llvm-ar> can compress the members of an archive to save space. The
Reid Spencerbd336192004-11-15 19:12:06 +000064compression used depends on what's available on the platform and what choices
65the LLVM Compressor utility makes. It generally favors bzip2 but will select
66between "no compression", bzip2 or zlib depending on what makes sense for the
67file's content.
Reid Spencere26ed7a2004-11-11 09:21:18 +000068
69=item I<Directory Recursion>
70
71Most C<ar> implementations do not recurse through directories but simply
72ignore directories if they are presented to the program in the F<files>
73option. B<llvm-ar>, however, can recurse through directory structures and
74add all the files under a directory, if requested.
75
76=item I<TOC Verbose Output>
77
78When B<llvm-ar> prints out the verbose table of contents (C<tv> option), it
79precedes the usual output with a character indicating the basic kind of
80content in the file. A blank means the file is a regular file. A 'Z' means
81the file is compressed. A 'B' means the file is an LLVM bytecode file. An
82'S' means the file is the symbol table.
83
84=back
85
86=head1 OPTIONS
87
88The options to B<llvm-ar> are compatible with other C<ar> implementations.
89However, there are a few modifiers (F<zR>) that are not found in other
90C<ar>s. The options to B<llvm-ar> specify a single basic operation to
91perform on the archive, a variety of modifiers for that operation, the
92name of the archive file, and an optional list of file names. These options
93are used to determine how B<llvm-ar> should process the archive file.
94
95The Operations and Modifiers are explained in the sections below. The minimal
96set of options is at least one operator and the name of the archive. Typically
97archive files end with a C<.a> suffix, but this is not required. Following
Reid Spencerbd336192004-11-15 19:12:06 +000098the F<archive-name> comes a list of F<files> that indicate the specific members
Reid Spencere26ed7a2004-11-11 09:21:18 +000099of the archive to operate on. If the F<files> option is not specified, it
100generally means either "none" or "all" members, depending on the operation.
101
102=head2 Operations
103
104=over
105
106=item d
107
108Delete files from the archive. No modifiers are applicable to this operation.
109The F<files> options specify which members should be removed from the
110archive. It is not an error if a specified file does not appear in the archive.
111If no F<files> are specified, the archive is not modified.
112
113=item m[abi]
114
115Move files from one location in the archive to another. The F<a>, F<b>, and
116F<i> modifiers apply to this operation. The F<files> will all be moved
117to the location given by the modifiers. If no modifiers are used, the files
118will be moved to the end of the archive. If no F<files> are specified, the
119archive is not modified.
120
Reid Spencer742ecbc2004-11-12 00:15:43 +0000121=item p[k]
Reid Spencere26ed7a2004-11-11 09:21:18 +0000122
Reid Spencer742ecbc2004-11-12 00:15:43 +0000123Print files to the standard output. The F<k> modifier applies to this
Reid Spencere26ed7a2004-11-11 09:21:18 +0000124operation. This operation simply prints the F<files> indicated to the
125standard output. If no F<files> are specified, the entire archive is printed.
126Printing bytecode files is ill-advised as they might confuse your terminal
127settings. The F<p> operation never modifies the archive.
128
129=item q[Rfz]
130
131Quickly append files to the end of the archive. The F<R>, F<f>, and F<z>
132modifiers apply to this operation. This operation quickly adds the
Reid Spencerbd336192004-11-15 19:12:06 +0000133F<files> to the archive without checking for duplicates that should be
Reid Spencere26ed7a2004-11-11 09:21:18 +0000134removed first. If no F<files> are specified, the archive is not modified.
Reid Spencerbd336192004-11-15 19:12:06 +0000135Because of the way that B<llvm-ar> constructs the archive file, its dubious
Reid Spencere26ed7a2004-11-11 09:21:18 +0000136whether the F<q> operation is any faster than the F<r> operation.
137
138=item r[Rabfuz]
139
140Replace or insert file members. The F<R>, F<a>, F<b>, F<f>, F<u>, and F<z>
141modifiers apply to this operation. This operation will replace existing
142F<files> or insert them at the end of the archive if they do not exist. If no
143F<files> are specified, the archive is not modified.
144
145=item t[v]
146
147Print the table of contents. Without any modifiers, this operation just prints
148the names of the members to the standard output. With the F<v> modifier,
149B<llvm-ar> also prints out the file type (B=bytecode, Z=compressed, S=symbol
150table, blank=regular file), the permission mode, the owner and group, the
151size, and the date. If any F<files> are specified, the listing is only for
152those files. If no F<files> are specified, the table of contents for the
153whole archive is printed.
154
Reid Spencer742ecbc2004-11-12 00:15:43 +0000155=item x[oP]
Reid Spencere26ed7a2004-11-11 09:21:18 +0000156
157Extract archive members back to files. The F<o> modifier applies to this
158operation. This operation retrieves the indicated F<files> from the archive
159and writes them back to the operating system's file system. If no
160F<files> are specified, the entire archive is extract.
161
162=back
163
164=head2 Modifiers (operation specific)
165
Reid Spencer742ecbc2004-11-12 00:15:43 +0000166The modifiers below are specific to certain operations. See the Operations
167section (above) to determine which modifiers are applicable to which operations.
168
Reid Spencere26ed7a2004-11-11 09:21:18 +0000169=over
170
171=item [a]
172
Reid Spencer742ecbc2004-11-12 00:15:43 +0000173When inserting or moving member files, this option specifies the destination of
174the new files as being C<a>fter the F<relpos> member. If F<relpos> is not found,
175the files are placed at the end of the archive.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000176
177=item [b]
178
Reid Spencer742ecbc2004-11-12 00:15:43 +0000179When inserting or moving member files, this option specifies the destination of
180the new files as being C<b>efore the F<relpos> member. If F<relpos> is not
181found, the files are placed at the end of the archive. This modifier is
182identical to the the F<i> modifier.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000183
184=item [f]
185
Reid Spencer742ecbc2004-11-12 00:15:43 +0000186Normally, B<llvm-ar> stores the full path name to a file as presented to it on
187the command line. With this option, truncated (15 characters max) names are
188used. This ensures name compatibility with older versions of C<ar> but may also
189thwart correct extraction of the files (duplicates may overwrite). If used with
190the F<R> option, the directory recursion will be performed but the file names
191will all be C<f>lattened to simple file names.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000192
193=item [i]
194
Reid Spencer742ecbc2004-11-12 00:15:43 +0000195A synonym for the F<b> option.
196
197=item [k]
198
199Normally, B<llvm-ar> will not print the contents of bytecode files when the
200F<p> operation is used. This modifier defeats the default and allows the
201bytecode members to be printed.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000202
203=item [N]
204
Reid Spencer742ecbc2004-11-12 00:15:43 +0000205This option is ignored by B<llvm-ar> but provided for compatibility.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000206
207=item [o]
208
Reid Spencer742ecbc2004-11-12 00:15:43 +0000209When extracting files, this option will cause B<llvm-ar> to preserve the
210original modification times of the files it writes.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000211
212=item [P]
213
214use full path names when matching
215
Reid Spencer742ecbc2004-11-12 00:15:43 +0000216=item [R]
Reid Spencere26ed7a2004-11-11 09:21:18 +0000217
Reid Spencer742ecbc2004-11-12 00:15:43 +0000218This modifier instructions the F<r> option to recursively process directories.
219Without F<R>, directories are ignored and only those F<files> that refer to
220files will be added to the archive. When F<R> is used, any directories specified
221with F<files> will be scanned (recursively) to find files to be added to the
222archive. Any file whose name begins with a dot will not be added.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000223
224=item [u]
225
Reid Spencer742ecbc2004-11-12 00:15:43 +0000226When replacing existing files in the archive, only replace those files that have
Reid Spencerbd336192004-11-15 19:12:06 +0000227a time stamp than the time stamp of the member in the archive.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000228
229=item [z]
230
Reid Spencer742ecbc2004-11-12 00:15:43 +0000231When inserting or replacing any file in the archive, compress the file first.
232The compression will attempt to use the zlib compression algorithm. This
233modifier is safe to use when (previously) compressed bytecode files are added to
234the archive; the compress bytecode files will not be doubly compressed.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000235
236=back
237
238=head2 Modifiers (generic)
239
Reid Spencer742ecbc2004-11-12 00:15:43 +0000240The modifiers below may be applied to any operation.
241
Reid Spencere26ed7a2004-11-11 09:21:18 +0000242=over
243
244=item [c]
245
Reid Spencer742ecbc2004-11-12 00:15:43 +0000246For all operations, B<llvm-ar> will always create the archive if it doesn't
247exist. Normally, B<llvm-ar> will print a warning message indicating that the
248archive is being created. Using this modifier turns off that warning.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000249
250=item [s]
251
Reid Spencer742ecbc2004-11-12 00:15:43 +0000252This modifier requests that an archive index (or symbol table) be added to the
253archive. This is the default mode of operation. The symbol table will contain
254all the externally visible functions and global variables defined by all the
Reid Spencerbd336192004-11-15 19:12:06 +0000255bytecode files in the archive. Using this modifier is more efficient that using
Reid Spencer742ecbc2004-11-12 00:15:43 +0000256L<llvm-ranlib|llvm-ranlib> which also creates the symbol table.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000257
258=item [S]
259
Reid Spencer742ecbc2004-11-12 00:15:43 +0000260This modifier is the opposite of the F<s> modifier. It instructs B<llvm-ar> to
261not build the symbol table. If both F<s> and F<S> are used, the last modifier to
262occur in the options will prevail.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000263
264=item [v]
265
Reid Spencer742ecbc2004-11-12 00:15:43 +0000266This modifier instructs B<llvm-ar> to be verbose about what it is doing. Each
Reid Spencerbd336192004-11-15 19:12:06 +0000267editing operation taken against the archive will produce a line of output saying
Reid Spencer742ecbc2004-11-12 00:15:43 +0000268what is being done.
Reid Spencere26ed7a2004-11-11 09:21:18 +0000269
270=back
271
Reid Spencera1c15bd2004-11-17 20:19:18 +0000272=head1 STANDARDS
273
274The B<llvm-ar> utility is intended to provide a superset of the IEEE Std 1003.2
275(POSIX.2) functionality for C<ar>. B<llvm-ar> can read both SVR4 and BSD4.4 (or
276Mac OS X) archives. If the C<f> modifier is given to the C<x> or C<r> operations
277then B<llvm-ar> will write SVR4 compatible archives. Without this modifier,
278B<llvm-ar> will write BSD4.4 compatible archives that have long names
279immediately after the header and indicated using the "#1/ddd" notation for the
280name in the header.
281
Reid Spencer742ecbc2004-11-12 00:15:43 +0000282=head1 FILE FORMAT
283
284The file format for LLVM Archive files is similar to that of BSD 4.4 or Mac OSX
285archive files. In fact, except for the symbol table, the C<ar> commands on those
286operating systems should be able to read LLVM archive files. The details of the
287file format follow.
288
289Each archive begins with the archive magic number which is the eight printable
Reid Spencerbd336192004-11-15 19:12:06 +0000290characters "!<arch>\n" where \n represents the newline character (0x0A).
291Following the magic number, the file is composed of even length members that
292begin with an archive header and end with a \n padding character if necessary
293(to make the length even). Each file member is composed of a header (defined
294below), an optional newline-terminated "long file name" and the contents of
295the file.
Reid Spencer742ecbc2004-11-12 00:15:43 +0000296
297The fields of the header are described in the items below. All fields of the
298header contain only ASCII characters, are left justified and are right padded
299with space characters.
300
301=over
302
303=item name - char[16]
304
305This field of the header provides the name of the archive member. If the name is
306longer than 15 characters or contains a slash (/) character, then this field
307contains C<#1/nnn> where C<nnn> provides the length of the name and the C<#1/>
308is literal. In this case, the actual name of the file is provided in the C<nnn>
309bytes immediately following the header. If the name is 15 characters or less, it
310is contained directly in this field and terminated with a slash (/) character.
311
312=item date - char[12]
313
314This field provides the date of modification of the file in the form of a
315decimal encoded number that provides the number of seconds since the epoch
316(since 00:00:00 Jan 1, 1970) per Posix specifications.
317
318=item uid - char[6]
319
Reid Spencerbd336192004-11-15 19:12:06 +0000320This field provides the user id of the file encoded as a decimal ASCII string.
Reid Spencer742ecbc2004-11-12 00:15:43 +0000321This field might not make much sense on non-Unix systems. On Unix, it is the
322same value as the st_uid field of the stat structure returned by the stat(2)
323operating system call.
324
325=item gid - char[6]
326
Reid Spencerbd336192004-11-15 19:12:06 +0000327This field provides the group id of the file encoded as a decimal ASCII string.
Reid Spencer742ecbc2004-11-12 00:15:43 +0000328This field might not make much sense on non-Unix systems. On Unix, it is the
329same value as the st_gid field of the stat structure returned by the stat(2)
330operating system call.
331
332=item mode - char[8]
333
Reid Spencerbd336192004-11-15 19:12:06 +0000334This field provides the access mode of the file encoded as an octal ASCII
Reid Spencer742ecbc2004-11-12 00:15:43 +0000335string. This field might not make much sense on non-Unix systems. On Unix, it
336is the same value as the st_mode field of the stat structure returned by the
337stat(2) operating system call.
338
339=item size - char[10]
340
Reid Spencerbd336192004-11-15 19:12:06 +0000341This field provides the size of the file, in bytes, encoded as a decimal ASCII
Reid Spencer742ecbc2004-11-12 00:15:43 +0000342string. If the size field is negative (starts with a minus sign, 0x02D), then
343the archive member is stored in compressed form. The first byte of the archive
344member's data indicates the compression type used. A value of 0 (0x30) indicates
345that no compression was used. A value of 1 (0x31) indicates that zlib
346compression was used. A value of 2 (0x32) indicates that bzip2 compression was
347used.
348
349=item fmag - char[2]
350
351This field is the archive file member magic number. Its content is always the
Reid Spencerbd336192004-11-15 19:12:06 +0000352two characters back tick (0x60) and newline (0x0A). This provides some measure
Reid Spencer742ecbc2004-11-12 00:15:43 +0000353utility in identifying archive files that have been corrupted.
354
Reid Spencer2152cca2004-11-12 00:16:51 +0000355=back
356
Reid Spencerbd336192004-11-15 19:12:06 +0000357The LLVM symbol table has the special name "#_LLVM_SYM_TAB_#". It is presumed
358that no regular archive member file will want this name. The LLVM symbol table
359is simply composed of a sequence of triplets: byte offset, length of symbol,
360and the symbol itself. Symbols are not null or newline terminated. Here are
361the details on each of these items:
362
363=over
364
365=item offset - vbr encoded 32-bit integer
366
367The offset item provides the offset into the archive file where the bytecode
368member is stored that is associated with the symbol. The offset value is 0
369based at the start of the first "normal" file member. To derive the actual
370file offset of the member, you must add the number of bytes occupied by the file
371signature (8 bytes) and the symbol tables. The value of this item is encoded
372using variable bit rate encoding to reduce the size of the symbol table.
373Variable bit rate encoding uses the high bit (0x80) of each byte to indicate
374if there are more bytes to follow. The remaining 7 bits in each byte carry bits
375from the value. The final byte does not have the high bit set.
376
377=item length - vbr encoded 32-bit integer
378
379The length item provides the length of the symbol that follows. Like this
380I<offset> item, the length is variable bit rate encoded.
381
382=item symbol - character array
383
384The symbol item provides the text of the symbol that is associated with the
385I<offset>. The symbol is not terminated by any character. Its length is provided
386by the I<length> field. Note that is allowed (but unwise) to use non-printing
387characters (even 0x00) in the symbol. This allows for multiple encodings of
388symbol names.
389
Reid Spencere8ae2d72004-11-15 19:45:29 +0000390=back
391
Reid Spencere26ed7a2004-11-11 09:21:18 +0000392=head1 EXIT STATUS
393
Misha Brukmane2ca54f2004-11-15 19:50:15 +0000394If B<llvm-ar> succeeds, it will exit with 0. A usage error, results
Reid Spencere26ed7a2004-11-11 09:21:18 +0000395in an exit code of 1. A hard (file system typically) error results in an
396exit code of 2. Miscellaneous or unknown errors result in an
397exit code of 3.
398
399=head1 SEE ALSO
400
Reid Spencere8ae2d72004-11-15 19:45:29 +0000401L<llvm-ranlib|llvm-ranlib>
Reid Spencere26ed7a2004-11-11 09:21:18 +0000402
403=head1 AUTHORS
404
405Maintained by the LLVM Team (L<http://llvm.cs.uiuc.edu>).
406
407=cut