blob: 647232020e4b1b1f69deef49b93b66a1e405116b [file] [log] [blame]
Alex Lorenzbf4508b2014-07-30 20:30:11 +00001llvm-profdata - Profile data tool
2=================================
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +00003
James Hendersona0566842019-06-27 13:24:46 +00004.. program:: llvm-profdata
5
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +00006SYNOPSIS
7--------
8
Alex Lorenzbf4508b2014-07-30 20:30:11 +00009:program:`llvm-profdata` *command* [*args...*]
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +000010
11DESCRIPTION
12-----------
13
Alex Lorenzbf4508b2014-07-30 20:30:11 +000014The :program:`llvm-profdata` tool is a small utility for working with profile
15data files.
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +000016
Alex Lorenzbf4508b2014-07-30 20:30:11 +000017COMMANDS
18--------
19
Justin Bogner22b9f6a2015-03-12 01:38:50 +000020* :ref:`merge <profdata-merge>`
21* :ref:`show <profdata-show>`
Rong Xu998b97f2019-04-30 21:19:12 +000022* :ref:`overlap <profdata-overlap>`
Alex Lorenzbf4508b2014-07-30 20:30:11 +000023
24.. program:: llvm-profdata merge
25
Justin Bogner22b9f6a2015-03-12 01:38:50 +000026.. _profdata-merge:
Alex Lorenzbf4508b2014-07-30 20:30:11 +000027
28MERGE
29-----
30
31SYNOPSIS
32^^^^^^^^
33
Nathan Slingerland7f5b47d2015-12-15 17:37:09 +000034:program:`llvm-profdata merge` [*options*] [*filename...*]
Alex Lorenzbf4508b2014-07-30 20:30:11 +000035
36DESCRIPTION
37^^^^^^^^^^^
38
39:program:`llvm-profdata merge` takes several profile data files
40generated by PGO instrumentation and merges them together into a single
41indexed profile data file.
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +000042
Nathan Slingerland7f5b47d2015-12-15 17:37:09 +000043By default profile data is merged without modification. This means that the
44relative importance of each input file is proportional to the number of samples
45or counts it contains. In general, the input from a longer training run will be
46interpreted as relatively more important than a shorter run. Depending on the
47nature of the training runs it may be useful to adjust the weight given to each
48input file by using the ``-weighted-input`` option.
49
Vedant Kumarcef43602016-06-07 22:47:31 +000050Profiles passed in via ``-weighted-input``, ``-input-files``, or via positional
51arguments are processed once for each time they are seen.
52
Nathan Slingerland7f5b47d2015-12-15 17:37:09 +000053
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +000054OPTIONS
Alex Lorenzbf4508b2014-07-30 20:30:11 +000055^^^^^^^
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +000056
Alex Lorenzbf4508b2014-07-30 20:30:11 +000057.. option:: -help
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +000058
Alex Lorenzbf4508b2014-07-30 20:30:11 +000059 Print a summary of command line options.
60
61.. option:: -output=output, -o=output
62
63 Specify the output file name. *Output* cannot be ``-`` as the resulting
64 indexed profile data can't be written to standard output.
65
Nathan Slingerland7f5b47d2015-12-15 17:37:09 +000066.. option:: -weighted-input=weight,filename
67
Sean Silva84d19222016-05-28 01:03:36 +000068 Specify an input file name along with a weight. The profile counts of the
69 supplied ``filename`` will be scaled (multiplied) by the supplied
SCOTT-HAMILTON4d62c342020-04-13 08:39:58 +020070 ``weight``, where ``weight`` is a decimal integer >= 1.
Sean Silva84d19222016-05-28 01:03:36 +000071 Input files specified without using this option are assigned a default
72 weight of 1. Examples are shown below.
Nathan Slingerland7f5b47d2015-12-15 17:37:09 +000073
Vedant Kumarcef43602016-06-07 22:47:31 +000074.. option:: -input-files=path, -f=path
75
76 Specify a file which contains a list of files to merge. The entries in this
77 file are newline-separated. Lines starting with '#' are skipped. Entries may
78 be of the form <filename> or <weight>,<filename>.
79
Richard Smith3164fcf2018-09-13 20:22:02 +000080.. option:: -remapping-file=path, -r=path
81
82 Specify a file which contains a remapping from symbol names in the input
83 profile to the symbol names that should be used in the output profile. The
84 file should consist of lines of the form ``<input-symbol> <output-symbol>``.
85 Blank lines and lines starting with ``#`` are skipped.
86
87 The :doc:`llvm-cxxmap <llvm-cxxmap>` tool can be used to generate the symbol
88 remapping file.
89
Diego Novillo6555adb2015-05-28 21:57:17 +000090.. option:: -instr (default)
91
Xinliang David Li2fc25152015-11-24 20:48:25 +000092 Specify that the input profile is an instrumentation-based profile.
Diego Novillo6555adb2015-05-28 21:57:17 +000093
94.. option:: -sample
95
Xinliang David Li2fc25152015-11-24 20:48:25 +000096 Specify that the input profile is a sample-based profile.
97
98 The format of the generated file can be generated in one of three ways:
Diego Novillo6555adb2015-05-28 21:57:17 +000099
100 .. option:: -binary (default)
101
Xinliang David Li2fc25152015-11-24 20:48:25 +0000102 Emit the profile using a binary encoding. For instrumentation-based profile
103 the output format is the indexed binary format.
Diego Novillo6555adb2015-05-28 21:57:17 +0000104
Wei Mi67bb1602020-05-13 15:11:49 -0700105 .. option:: -extbinary
106
107 Emit the profile using an extensible binary encoding. This option can only
108 be used with sample-based profile. The extensible binary encoding can be
109 more compact with compression enabled and can be loaded faster than the
110 default binary encoding.
111
Diego Novillo6555adb2015-05-28 21:57:17 +0000112 .. option:: -text
113
Xinliang David Li2fc25152015-11-24 20:48:25 +0000114 Emit the profile in text mode. This option can also be used with both
115 sample-based and instrumentation-based profile. When this option is used
116 the profile will be dumped in the text format that is parsable by the profile
117 reader.
Diego Novillo6555adb2015-05-28 21:57:17 +0000118
119 .. option:: -gcc
120
121 Emit the profile using GCC's gcov format (Not yet supported).
122
Vedant Kumaree2ce4a52016-06-09 21:09:54 +0000123.. option:: -sparse[=true|false]
Vedant Kumar00dab222016-01-29 22:54:45 +0000124
125 Do not emit function records with 0 execution count. Can only be used in
126 conjunction with -instr. Defaults to false, since it can inhibit compiler
127 optimization during PGO.
128
Vedant Kumare3a0bf52016-07-19 01:17:20 +0000129.. option:: -num-threads=N, -j=N
130
131 Use N threads to perform profile merging. When N=0, llvm-profdata auto-detects
132 an appropriate number of threads to use. This is the default.
133
Vedant Kumar0fcfe892019-09-03 22:23:16 +0000134.. option:: -failure-mode=[any|all]
135
136 Set the failure mode. There are two options: 'any' causes the merge command to
137 fail if any profiles are invalid, and 'all' causes the merge command to fail
138 only if all profiles are invalid. If 'all' is set, information from any
139 invalid profiles is excluded from the final merged product. The default
140 failure mode is 'any'.
141
Wei Mi67bb1602020-05-13 15:11:49 -0700142.. option:: -prof-sym-list=path
143
144 Specify a file which contains a list of symbols to generate profile symbol
145 list in the profile. This option can only be used with sample-based profile
146 in extbinary format. The entries in this file are newline-separated.
147
148.. option:: -compress-all-sections=[true|false]
149
150 Compress all sections when writing the profile. This option can only be used
151 with sample-based profile in extbinary format.
152
153.. option:: -use-md5=[true|false]
154
155 Use MD5 to represent string in name table when writing the profile.
156 This option can only be used with sample-based profile in extbinary format.
157
158.. option:: -gen-partial-profile=[true|false]
159
160 Mark the profile to be a partial profile which only provides partial profile
161 coverage for the optimized target. This option can only be used with
162 sample-based profile in extbinary format.
163
Wei Mia23f6232020-07-08 15:19:44 -0700164.. option:: -supplement-instr-with-sample=path_to_sample_profile
165
166 Supplement an instrumentation profile with sample profile. The sample profile
167 is the input of the flag. Output will be in instrumentation format (only works
168 with -instr).
169
170.. option:: -zero-counter-threshold=threshold_float_number
171
172 For the function which is cold in instr profile but hot in sample profile, if
173 the ratio of the number of zero counters divided by the the total number of
174 counters is above the threshold, the profile of the function will be regarded
175 as being harmful for performance and will be dropped.
176
177.. option:: -instr-prof-cold-threshold=threshold_int_number
178
179 User specified cold threshold for instr profile which will override the cold
180 threshold got from profile summary.
181
182.. option:: -suppl-min-size-threshold=threshold_int_number
183
184 If the size of a function is smaller than the threshold, assume it can be
185 inlined by PGO early inliner and it will not be adjusted based on sample
186 profile.
187
Nathan Slingerland7f5b47d2015-12-15 17:37:09 +0000188EXAMPLES
189^^^^^^^^
190Basic Usage
191+++++++++++
192Merge three profiles:
193
194::
195
196 llvm-profdata merge foo.profdata bar.profdata baz.profdata -output merged.profdata
197
198Weighted Input
199++++++++++++++
200The input file `foo.profdata` is especially important, multiply its counts by 10:
201
202::
203
204 llvm-profdata merge -weighted-input=10,foo.profdata bar.profdata baz.profdata -output merged.profdata
205
206Exactly equivalent to the previous invocation (explicit form; useful for programmatic invocation):
207
208::
209
210 llvm-profdata merge -weighted-input=10,foo.profdata -weighted-input=1,bar.profdata -weighted-input=1,baz.profdata -output merged.profdata
211
Alex Lorenzbf4508b2014-07-30 20:30:11 +0000212.. program:: llvm-profdata show
213
Justin Bogner22b9f6a2015-03-12 01:38:50 +0000214.. _profdata-show:
Alex Lorenzbf4508b2014-07-30 20:30:11 +0000215
216SHOW
217----
218
219SYNOPSIS
220^^^^^^^^
221
222:program:`llvm-profdata show` [*options*] [*filename*]
223
224DESCRIPTION
225^^^^^^^^^^^
226
227:program:`llvm-profdata show` takes a profile data file and displays the
228information about the profile counters for this file and
229for any of the specified function(s).
230
231If *filename* is omitted or is ``-``, then **llvm-profdata show** reads its
232input from standard input.
233
234OPTIONS
235^^^^^^^
236
237.. option:: -all-functions
238
239 Print details for every function.
240
241.. option:: -counts
242
243 Print the counter values for the displayed functions.
244
245.. option:: -function=string
246
247 Print details for a function if the function's name contains the given string.
248
249.. option:: -help
250
251 Print a summary of command line options.
252
253.. option:: -output=output, -o=output
254
255 Specify the output file name. If *output* is ``-`` or it isn't specified,
256 then the output is sent to standard output.
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +0000257
Diego Novillo6555adb2015-05-28 21:57:17 +0000258.. option:: -instr (default)
259
260 Specify that the input profile is an instrumentation-based profile.
261
Xinliang David Li6f7c19a2015-11-23 20:47:38 +0000262.. option:: -text
263
264 Instruct the profile dumper to show profile counts in the text format of the
265 instrumentation-based profile data representation. By default, the profile
266 information is dumped in a more human readable form (also in text) with
267 annotations.
268
Xinliang David Li801b5312017-07-11 20:30:43 +0000269.. option:: -topn=n
Rong Xu52aa2242019-01-08 22:41:48 +0000270
Xinliang David Li801b5312017-07-11 20:30:43 +0000271 Instruct the profile dumper to show the top ``n`` functions with the
272 hottest basic blocks in the summary section. By default, the topn functions
273 are not dumped.
274
Diego Novillo6555adb2015-05-28 21:57:17 +0000275.. option:: -sample
276
277 Specify that the input profile is a sample-based profile.
278
Rong Xu60faea12017-03-16 21:15:48 +0000279.. option:: -memop-sizes
280
281 Show the profiled sizes of the memory intrinsic calls for shown functions.
282
Rong Xu52aa2242019-01-08 22:41:48 +0000283.. option:: -value-cutoff=n
284
285 Show only those functions whose max count values are greater or equal to ``n``.
286 By default, the value-cutoff is set to 0.
287
288.. option:: -list-below-cutoff
289
290 Only output names of functions whose max count value are below the cutoff
291 value.
292
Rong Xua6ff69f2019-02-28 19:55:07 +0000293.. option:: -showcs
Rong Xu4f471ee2019-04-18 07:11:05 +0000294
Rong Xua6ff69f2019-02-28 19:55:07 +0000295 Only show context sensitive profile counts. The default is to filter all
296 context sensitive profile counts.
297
Wei Mi67bb1602020-05-13 15:11:49 -0700298.. option:: -show-prof-sym-list=[true|false]
299
300 Show profile symbol list if it exists in the profile. This option is only
301 meaningful for sample-based profile in extbinary format.
302
303.. option:: -show-sec-info-only=[true|false]
304
305 Show basic information about each section in the profile. This option is
306 only meaningful for sample-based profile in extbinary format.
307
Rong Xu998b97f2019-04-30 21:19:12 +0000308.. program:: llvm-profdata overlap
309
310.. _profdata-overlap:
311
312OVERLAP
313-------
314
315SYNOPSIS
316^^^^^^^^
317
318:program:`llvm-profdata overlap` [*options*] [*base profile file*] [*test profile file*]
319
320DESCRIPTION
321^^^^^^^^^^^
322
323:program:`llvm-profdata overlap` takes two profile data files and displays the
324*overlap* of counter distribution between the whole files and between any of the
325specified functions.
326
327In this command, *overlap* is defined as follows:
328Suppose *base profile file* has the following counts:
329{c1_1, c1_2, ..., c1_n, c1_u_1, c2_u_2, ..., c2_u_s},
330and *test profile file* has
331{c2_1, c2_2, ..., c2_n, c2_v_1, c2_v_2, ..., c2_v_t}.
332Here c{1|2}_i (i = 1 .. n) are matched counters and c1_u_i (i = 1 .. s) and
333c2_v_i (i = 1 .. v) are unmatched counters (or counters only existing in)
334*base profile file* and *test profile file*, respectively.
335Let sum_1 = c1_1 + c1_2 + ... + c1_n + c1_u_1 + c2_u_2 + ... + c2_u_s, and
336sum_2 = c2_1 + c2_2 + ... + c2_n + c2_v_1 + c2_v_2 + ... + c2_v_t.
337*overlap* = min(c1_1/sum_1, c2_1/sum_2) + min(c1_2/sum_1, c2_2/sum_2) + ...
Rong Xub1f95772019-04-30 22:35:35 +0000338+ min(c1_n/sum_1, c2_n/sum_2).
Rong Xu998b97f2019-04-30 21:19:12 +0000339
340The result overlap distribution is a percentage number, ranging from 0.0% to
341100.0%, where 0.0% means there is no overlap and 100.0% means a perfect
342overlap.
343
344Here is an example, if *base profile file* has counts of {400, 600}, and
345*test profile file* has matched counts of {60000, 40000}. The *overlap* is 80%.
346
Rong Xu998b97f2019-04-30 21:19:12 +0000347OPTIONS
348^^^^^^^
349
350.. option:: -function=string
351
352 Print details for a function if the function's name contains the given string.
353
354.. option:: -help
355
356 Print a summary of command line options.
357
358.. option:: -o=output or -o output
359
360 Specify the output file name. If *output* is ``-`` or it isn't specified,
361 then the output is sent to standard output.
362
363.. option:: -value-cutoff=n
364
365 Show only those functions whose max count values are greater or equal to ``n``.
366 By default, the value-cutoff is set to max of unsigned long long.
367
368.. option:: -cs
369
370 Only show overlap for the context sensitive profile counts. The default is to show
371 non-context sensitive profile counts.
372
Duncan P. N. Exon Smith846a6272014-02-17 23:22:49 +0000373EXIT STATUS
374-----------
375
Alex Lorenzbf4508b2014-07-30 20:30:11 +0000376:program:`llvm-profdata` returns 1 if the command is omitted or is invalid,
377if it cannot read input files, or if there is a mismatch between their data.