Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 1 | perf-report(1) |
Ingo Molnar | c1c2365 | 2009-05-30 12:38:51 +0200 | [diff] [blame] | 2 | ============== |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 3 | |
| 4 | NAME |
| 5 | ---- |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 6 | perf-report - Read perf.data (created by perf record) and display the profile |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 7 | |
| 8 | SYNOPSIS |
| 9 | -------- |
| 10 | [verse] |
| 11 | 'perf report' [-i <file> | --input=file] |
| 12 | |
| 13 | DESCRIPTION |
| 14 | ----------- |
| 15 | This command displays the performance counter profile information recorded |
Ingo Molnar | b0a2858 | 2009-06-23 16:39:53 +0200 | [diff] [blame] | 16 | via perf record. |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 17 | |
| 18 | OPTIONS |
| 19 | ------- |
| 20 | -i:: |
| 21 | --input=:: |
Robert Richter | efad141 | 2011-12-07 10:02:54 +0100 | [diff] [blame] | 22 | Input file name. (default: perf.data unless stdin is a fifo) |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 23 | |
| 24 | -v:: |
| 25 | --verbose:: |
| 26 | Be more verbose. (show symbol address, etc) |
| 27 | |
Namhyung Kim | 27fafab | 2017-02-17 17:17:39 +0900 | [diff] [blame] | 28 | -q:: |
| 29 | --quiet:: |
| 30 | Do not show any message. (Suppress -v) |
| 31 | |
Zeev Tarantov | ca2b900 | 2009-11-09 13:26:13 +0200 | [diff] [blame] | 32 | -n:: |
| 33 | --show-nr-samples:: |
Arnaldo Carvalho de Melo | e3d7e18 | 2009-07-11 12:18:37 -0300 | [diff] [blame] | 34 | Show the number of samples for each symbol |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 35 | |
Namhyung Kim | b272a59 | 2015-10-25 00:49:25 +0900 | [diff] [blame] | 36 | --show-cpu-utilization:: |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 37 | Show sample percentage for different cpu modes. |
| 38 | |
Zeev Tarantov | ca2b900 | 2009-11-09 13:26:13 +0200 | [diff] [blame] | 39 | -T:: |
| 40 | --threads:: |
Namhyung Kim | 1f91d5f | 2015-05-10 00:19:42 +0900 | [diff] [blame] | 41 | Show per-thread event counters. The input data file should be recorded |
| 42 | with -s option. |
David Ahern | c8e6672 | 2011-11-13 11:30:08 -0700 | [diff] [blame] | 43 | -c:: |
Arnaldo Carvalho de Melo | cc8b88b | 2009-06-30 19:01:21 -0300 | [diff] [blame] | 44 | --comms=:: |
| 45 | Only consider symbols in these comms. CSV that understands |
Namhyung Kim | f214833 | 2014-01-14 11:52:48 +0900 | [diff] [blame] | 46 | file://filename entries. This option will affect the percentage of |
| 47 | the overhead column. See --percentage for more info. |
David Ahern | e03eaa4 | 2015-03-24 09:52:41 -0600 | [diff] [blame] | 48 | --pid=:: |
| 49 | Only show events for given process ID (comma separated list). |
| 50 | |
| 51 | --tid=:: |
| 52 | Only show events for given thread ID (comma separated list). |
Namhyung Kim | f214833 | 2014-01-14 11:52:48 +0900 | [diff] [blame] | 53 | -d:: |
| 54 | --dsos=:: |
| 55 | Only consider symbols in these dsos. CSV that understands |
| 56 | file://filename entries. This option will affect the percentage of |
| 57 | the overhead column. See --percentage for more info. |
Arnaldo Carvalho de Melo | 7bec7a9 | 2009-06-30 19:01:22 -0300 | [diff] [blame] | 58 | -S:: |
| 59 | --symbols=:: |
| 60 | Only consider these symbols. CSV that understands |
Namhyung Kim | f214833 | 2014-01-14 11:52:48 +0900 | [diff] [blame] | 61 | file://filename entries. This option will affect the percentage of |
| 62 | the overhead column. See --percentage for more info. |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 63 | |
Namhyung Kim | fde0eea | 2012-03-19 11:53:48 +0900 | [diff] [blame] | 64 | --symbol-filter=:: |
| 65 | Only show symbols that match (partially) with this filter. |
| 66 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 67 | -U:: |
| 68 | --hide-unresolved:: |
| 69 | Only display entries resolved to a symbol. |
| 70 | |
Arnaldo Carvalho de Melo | c351c28 | 2009-12-16 13:49:27 -0200 | [diff] [blame] | 71 | -s:: |
| 72 | --sort=:: |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 73 | Sort histogram entries by given key(s) - multiple keys can be specified |
| 74 | in CSV format. Following sort keys are available: |
Hari Bathini | d890a98 | 2017-03-08 02:12:13 +0530 | [diff] [blame] | 75 | pid, comm, dso, symbol, parent, cpu, socket, srcline, weight, |
| 76 | local_weight, cgroup_id. |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 77 | |
| 78 | Each key has following meaning: |
| 79 | |
| 80 | - comm: command (name) of the task which can be read via /proc/<pid>/comm |
| 81 | - pid: command and tid of the task |
| 82 | - dso: name of library or module executed at the time of sample |
Kim Phillips | b74d12d | 2018-03-27 06:09:56 -0500 | [diff] [blame] | 83 | - dso_size: size of library or module executed at the time of sample |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 84 | - symbol: name of function executed at the time of sample |
Charles Baylis | 7768f8d | 2017-02-24 13:32:56 +0000 | [diff] [blame] | 85 | - symbol_size: size of function executed at the time of sample |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 86 | - parent: name of function matched to the parent regex filter. Unmatched |
| 87 | entries are displayed as "[other]". |
| 88 | - cpu: cpu number the task ran at the time of sample |
Kan Liang | 2e7ea3a | 2015-09-04 10:45:43 -0400 | [diff] [blame] | 89 | - socket: processor socket number the task ran at the time of sample |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 90 | - srcline: filename and line number executed at the time of sample. The |
Andi Kleen | f5d05bc | 2013-09-20 07:40:41 -0700 | [diff] [blame] | 91 | DWARF debugging info must be provided. |
Andi Kleen | 31191a8 | 2015-08-07 15:54:24 -0700 | [diff] [blame] | 92 | - srcfile: file name of the source file of the same. Requires dwarf |
| 93 | information. |
Andi Kleen | 475eeab | 2013-09-20 07:40:43 -0700 | [diff] [blame] | 94 | - weight: Event specific weight, e.g. memory latency or transaction |
| 95 | abort cost. This is the global weight. |
| 96 | - local_weight: Local weight version of the weight above. |
Hari Bathini | d890a98 | 2017-03-08 02:12:13 +0530 | [diff] [blame] | 97 | - cgroup_id: ID derived from cgroup namespace device and inode numbers. |
Andi Kleen | 475eeab | 2013-09-20 07:40:43 -0700 | [diff] [blame] | 98 | - transaction: Transaction abort flags. |
Namhyung Kim | a2ce067 | 2014-03-04 09:06:42 +0900 | [diff] [blame] | 99 | - overhead: Overhead percentage of sample |
| 100 | - overhead_sys: Overhead percentage of sample running in system mode |
| 101 | - overhead_us: Overhead percentage of sample running in user mode |
| 102 | - overhead_guest_sys: Overhead percentage of sample running in system mode |
| 103 | on guest machine |
| 104 | - overhead_guest_us: Overhead percentage of sample running in user mode on |
| 105 | guest machine |
| 106 | - sample: Number of sample |
| 107 | - period: Raw number of event count of sample |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 108 | |
| 109 | By default, comm, dso and symbol keys are used. |
| 110 | (i.e. --sort comm,dso,symbol) |
| 111 | |
| 112 | If --branch-stack option is used, following sort keys are also |
| 113 | available: |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 114 | |
| 115 | - dso_from: name of library or module branched from |
| 116 | - dso_to: name of library or module branched to |
| 117 | - symbol_from: name of function branched from |
| 118 | - symbol_to: name of function branched to |
Andi Kleen | 508be0d | 2016-05-20 13:15:08 -0700 | [diff] [blame] | 119 | - srcline_from: source file and line branched from |
| 120 | - srcline_to: source file and line branched to |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 121 | - mispredict: "N" for predicted branch, "Y" for mispredicted branch |
Andi Kleen | f5d05bc | 2013-09-20 07:40:41 -0700 | [diff] [blame] | 122 | - in_tx: branch in TSX transaction |
| 123 | - abort: TSX transaction abort. |
Andi Kleen | 0e332f0 | 2015-07-18 08:24:46 -0700 | [diff] [blame] | 124 | - cycles: Cycles in basic block |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 125 | |
| 126 | And default sort keys are changed to comm, dso_from, symbol_from, dso_to |
| 127 | and symbol_to, see '--branch-stack'. |
Arnaldo Carvalho de Melo | c351c28 | 2009-12-16 13:49:27 -0200 | [diff] [blame] | 128 | |
Namhyung Kim | c6f5f6b | 2016-02-03 23:11:20 +0900 | [diff] [blame] | 129 | If the --mem-mode option is used, the following sort keys are also available |
| 130 | (incompatible with --branch-stack): |
| 131 | symbol_daddr, dso_daddr, locked, tlb, mem, snoop, dcacheline. |
| 132 | |
| 133 | - symbol_daddr: name of data symbol being executed on at the time of sample |
| 134 | - dso_daddr: name of library or module containing the data being executed |
| 135 | on at the time of the sample |
| 136 | - locked: whether the bus was locked at the time of the sample |
| 137 | - tlb: type of tlb access for the data at the time of the sample |
| 138 | - mem: type of memory access for the data at the time of the sample |
| 139 | - snoop: type of snoop (if any) for the data at the time of the sample |
| 140 | - dcacheline: the cacheline the data address is on at the time of the sample |
Kan Liang | 8780fb2 | 2017-08-29 13:11:09 -0400 | [diff] [blame] | 141 | - phys_daddr: physical address of data being executed on at the time of sample |
Namhyung Kim | c6f5f6b | 2016-02-03 23:11:20 +0900 | [diff] [blame] | 142 | |
| 143 | And the default sort keys are changed to local_weight, mem, sym, dso, |
| 144 | symbol_daddr, dso_daddr, snoop, tlb, locked, see '--mem-mode'. |
| 145 | |
Namhyung Kim | 4c96bee | 2016-01-05 19:58:36 +0900 | [diff] [blame] | 146 | If the data file has tracepoint event(s), following (dynamic) sort keys |
| 147 | are also available: |
| 148 | trace, trace_fields, [<event>.]<field>[/raw] |
| 149 | |
| 150 | - trace: pretty printed trace output in a single column |
| 151 | - trace_fields: fields in tracepoints in separate columns |
| 152 | - <field name>: optional event and field name for a specific field |
| 153 | |
| 154 | The last form consists of event and field names. If event name is |
| 155 | omitted, it searches all events for matching field name. The matched |
| 156 | field will be shown only for the event has the field. The event name |
| 157 | supports substring match so user doesn't need to specify full subsystem |
| 158 | and event name everytime. For example, 'sched:sched_switch' event can |
| 159 | be shortened to 'switch' as long as it's not ambiguous. Also event can |
| 160 | be specified by its index (starting from 1) preceded by the '%'. |
| 161 | So '%1' is the first event, '%2' is the second, and so on. |
| 162 | |
| 163 | The field name can have '/raw' suffix which disables pretty printing |
| 164 | and shows raw field value like hex numbers. The --raw-trace option |
| 165 | has the same effect for all dynamic sort keys. |
| 166 | |
| 167 | The default sort keys are changed to 'trace' if all events in the data |
| 168 | file are tracepoint. |
| 169 | |
Namhyung Kim | a7d945b | 2014-03-04 10:46:34 +0900 | [diff] [blame] | 170 | -F:: |
| 171 | --fields=:: |
| 172 | Specify output field - multiple keys can be specified in CSV format. |
| 173 | Following fields are available: |
Namhyung Kim | 793aaaa | 2013-10-30 17:05:55 +0900 | [diff] [blame] | 174 | overhead, overhead_sys, overhead_us, overhead_children, sample and period. |
Namhyung Kim | a7d945b | 2014-03-04 10:46:34 +0900 | [diff] [blame] | 175 | Also it can contain any sort key(s). |
| 176 | |
| 177 | By default, every sort keys not specified in -F will be appended |
| 178 | automatically. |
| 179 | |
Changbin Du | d35fa1e | 2017-03-13 16:32:52 +0800 | [diff] [blame] | 180 | If the keys starts with a prefix '+', then it will append the specified |
| 181 | field(s) to the default field order. For example: perf report -F +period,sample. |
| 182 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 183 | -p:: |
| 184 | --parent=<regex>:: |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 185 | A regex filter to identify parent. The parent is a caller of this |
| 186 | function and searched through the callchain, thus it requires callchain |
Kim Phillips | 1291927 | 2017-05-03 13:13:50 +0100 | [diff] [blame] | 187 | information recorded. The pattern is in the extended regex format and |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 188 | defaults to "\^sys_|^do_page_fault", see '--sort parent'. |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 189 | |
| 190 | -x:: |
| 191 | --exclude-other:: |
| 192 | Only display entries with parent-match. |
| 193 | |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 194 | -w:: |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 195 | --column-widths=<width[,width...]>:: |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 196 | Force each column width to the provided list, for large terminal |
Namhyung Kim | cf59002 | 2014-07-31 14:47:39 +0900 | [diff] [blame] | 197 | readability. 0 means no limit (default behavior). |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 198 | |
| 199 | -t:: |
| 200 | --field-separator=:: |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 201 | Use a special separator character and don't pad with spaces, replacing |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 202 | all occurrences of this separator in symbol names (and other output) |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 203 | with a '.' character, that thus it's the only non valid separator. |
| 204 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 205 | -D:: |
| 206 | --dump-raw-trace:: |
| 207 | Dump raw trace in ASCII. |
| 208 | |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 209 | -g:: |
Namhyung Kim | f2af008 | 2015-11-09 14:45:41 +0900 | [diff] [blame] | 210 | --call-graph=<print_type,threshold[,print_limit],order,sort_key[,branch],value>:: |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 211 | Display call chains using type, min percent threshold, print limit, |
Kim Phillips | 1291927 | 2017-05-03 13:13:50 +0100 | [diff] [blame] | 212 | call order, sort key, optional branch and value. Note that ordering |
| 213 | is not fixed so any parameter can be given in an arbitrary order. |
Namhyung Kim | f2af008 | 2015-11-09 14:45:41 +0900 | [diff] [blame] | 214 | One exception is the print_limit which should be preceded by threshold. |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 215 | |
| 216 | print_type can be either: |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 217 | - flat: single column, linear exposure of call chains. |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 218 | - graph: use a graph tree, displaying absolute overhead rates. (default) |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 219 | - fractal: like graph, but displays relative rates. Each branch of |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 220 | the tree is considered as a new profiled object. |
Namhyung Kim | 26e7792 | 2015-11-09 14:45:37 +0900 | [diff] [blame] | 221 | - folded: call chains are displayed in a line, separated by semicolons |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 222 | - none: disable call chain display. |
| 223 | |
| 224 | threshold is a percentage value which specifies a minimum percent to be |
| 225 | included in the output call graph. Default is 0.5 (%). |
| 226 | |
| 227 | print_limit is only applied when stdio interface is used. It's to limit |
| 228 | number of call graph entries in a single hist entry. Note that it needs |
| 229 | to be given after threshold (but not necessarily consecutive). |
| 230 | Default is 0 (unlimited). |
Sam Liao | d797fdc | 2011-06-07 23:49:46 +0800 | [diff] [blame] | 231 | |
| 232 | order can be either: |
| 233 | - callee: callee based call graph. |
| 234 | - caller: inverted caller based call graph. |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 235 | Default is 'caller' when --children is used, otherwise 'callee'. |
Sam Liao | d797fdc | 2011-06-07 23:49:46 +0800 | [diff] [blame] | 236 | |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 237 | sort_key can be: |
| 238 | - function: compare on functions (default) |
Andi Kleen | 99571ab | 2013-07-18 15:33:57 -0700 | [diff] [blame] | 239 | - address: compare on individual code addresses |
Milian Wolff | 5dfa210 | 2017-03-18 22:49:28 +0100 | [diff] [blame] | 240 | - srcline: compare on source filename and line number |
Andi Kleen | 99571ab | 2013-07-18 15:33:57 -0700 | [diff] [blame] | 241 | |
Andi Kleen | 8b7bad5 | 2014-11-12 18:05:20 -0800 | [diff] [blame] | 242 | branch can be: |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 243 | - branch: include last branch information in callgraph when available. |
| 244 | Usually more convenient to use --branch-history for this. |
Sam Liao | d797fdc | 2011-06-07 23:49:46 +0800 | [diff] [blame] | 245 | |
Namhyung Kim | f2af008 | 2015-11-09 14:45:41 +0900 | [diff] [blame] | 246 | value can be: |
| 247 | - percent: diplay overhead percent (default) |
| 248 | - period: display event period |
| 249 | - count: display event count |
| 250 | |
Namhyung Kim | 793aaaa | 2013-10-30 17:05:55 +0900 | [diff] [blame] | 251 | --children:: |
| 252 | Accumulate callchain of children to parent entry so that then can |
| 253 | show up in the output. The output will have a new "Children" column |
| 254 | and will be sorted on the data. It requires callchains are recorded. |
Yannick Brosseau | 108a7c1 | 2016-12-02 11:07:32 -0500 | [diff] [blame] | 255 | See the `overhead calculation' section for more details. Enabled by |
| 256 | default, disable with --no-children. |
Namhyung Kim | 793aaaa | 2013-10-30 17:05:55 +0900 | [diff] [blame] | 257 | |
Waiman Long | 91e9561 | 2013-10-18 10:38:48 -0400 | [diff] [blame] | 258 | --max-stack:: |
| 259 | Set the stack depth limit when parsing the callchain, anything |
| 260 | beyond the specified depth will be ignored. This is a trade-off |
| 261 | between information loss and faster processing especially for |
| 262 | workloads that can have a very long callchain stack. |
Adrian Hunter | 40862a7 | 2015-09-29 11:52:37 +0300 | [diff] [blame] | 263 | Note that when using the --itrace option the synthesized callchain size |
| 264 | will override this value if the synthesized callchain size is bigger. |
Waiman Long | 91e9561 | 2013-10-18 10:38:48 -0400 | [diff] [blame] | 265 | |
Arnaldo Carvalho de Melo | fe17608 | 2016-05-19 11:34:06 -0300 | [diff] [blame] | 266 | Default: 127 |
Waiman Long | 91e9561 | 2013-10-18 10:38:48 -0400 | [diff] [blame] | 267 | |
Sam Liao | d797fdc | 2011-06-07 23:49:46 +0800 | [diff] [blame] | 268 | -G:: |
| 269 | --inverted:: |
| 270 | alias for inverted caller based call graph. |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 271 | |
Greg Price | b21484f | 2012-12-06 21:48:05 -0800 | [diff] [blame] | 272 | --ignore-callees=<regex>:: |
| 273 | Ignore callees of the function(s) matching the given regex. |
| 274 | This has the effect of collecting the callers of each such |
| 275 | function into one place in the call-graph tree. |
| 276 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 277 | --pretty=<key>:: |
| 278 | Pretty printing style. key: normal, raw |
| 279 | |
Arnaldo Carvalho de Melo | 8b9e74e | 2010-08-21 10:38:16 -0300 | [diff] [blame] | 280 | --stdio:: Use the stdio interface. |
| 281 | |
Arnaldo Carvalho de Melo | 175b968 | 2016-07-05 11:14:38 -0300 | [diff] [blame] | 282 | --stdio-color:: |
| 283 | 'always', 'never' or 'auto', allowing configuring color output |
| 284 | via the command line, in addition to via "color.ui" .perfconfig. |
| 285 | Use '--stdio-color always' to generate color even when redirecting |
| 286 | to a pipe or file. Using just '--stdio-color' is equivalent to |
| 287 | using 'always'. |
| 288 | |
Arnaldo Carvalho de Melo | 8b9e74e | 2010-08-21 10:38:16 -0300 | [diff] [blame] | 289 | --tui:: Use the TUI interface, that is integrated with annotate and allows |
| 290 | zooming into DSOs or threads, among other features. Use of --tui |
| 291 | requires a tty, if one is not present, as when piping to other |
| 292 | commands, the stdio interface is used. |
| 293 | |
Pekka Enberg | c31a945 | 2012-03-19 15:13:29 -0300 | [diff] [blame] | 294 | --gtk:: Use the GTK2 interface. |
| 295 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 296 | -k:: |
| 297 | --vmlinux=<file>:: |
| 298 | vmlinux pathname |
| 299 | |
Arnaldo Carvalho de Melo | 91340c51 | 2018-03-16 16:27:04 -0300 | [diff] [blame] | 300 | --ignore-vmlinux:: |
| 301 | Ignore vmlinux files. |
| 302 | |
David Ahern | b226a5a7 | 2010-12-07 19:39:46 -0700 | [diff] [blame] | 303 | --kallsyms=<file>:: |
| 304 | kallsyms pathname |
| 305 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 306 | -m:: |
| 307 | --modules:: |
| 308 | Load module symbols. WARNING: This should only be used with -k and |
| 309 | a LIVE kernel. |
| 310 | |
| 311 | -f:: |
| 312 | --force:: |
Jiri Olsa | e0be62c | 2016-03-24 13:52:19 +0100 | [diff] [blame] | 313 | Don't do ownership validation. |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 314 | |
David Ahern | ec5761e | 2010-12-09 13:27:07 -0700 | [diff] [blame] | 315 | --symfs=<directory>:: |
| 316 | Look for files with symbols relative to this directory. |
| 317 | |
David Ahern | c8e6672 | 2011-11-13 11:30:08 -0700 | [diff] [blame] | 318 | -C:: |
Anton Blanchard | 5d67be9 | 2011-07-04 21:57:50 +1000 | [diff] [blame] | 319 | --cpu:: Only report samples for the list of CPUs provided. Multiple CPUs can |
| 320 | be provided as a comma-separated list with no space: 0,1. Ranges of |
| 321 | CPUs are specified with -: 0-2. Default is to report samples on all |
| 322 | CPUs. |
| 323 | |
Andi Kleen | f69b64f | 2011-09-15 14:31:41 -0700 | [diff] [blame] | 324 | -M:: |
| 325 | --disassembler-style=:: Set disassembler style for objdump. |
| 326 | |
Arnaldo Carvalho de Melo | 64c6f0c | 2011-10-06 12:48:31 -0300 | [diff] [blame] | 327 | --source:: |
| 328 | Interleave source code with assembly code. Enabled by default, |
| 329 | disable with --no-source. |
| 330 | |
| 331 | --asm-raw:: |
| 332 | Show raw instruction encoding of assembly instructions. |
| 333 | |
Arnaldo Carvalho de Melo | 3f2728b | 2011-10-05 16:10:06 -0300 | [diff] [blame] | 334 | --show-total-period:: Show a column with the sum of periods. |
| 335 | |
Stephane Eranian | fbe96f2 | 2011-09-30 15:40:40 +0200 | [diff] [blame] | 336 | -I:: |
| 337 | --show-info:: |
| 338 | Display extended information about the perf.data file. This adds |
| 339 | information which may be very large and thus may clutter the display. |
| 340 | It currently includes: cpu and numa topology of the host system. |
| 341 | |
Roberto Agostino Vitillo | b50311d | 2012-02-09 23:21:03 +0100 | [diff] [blame] | 342 | -b:: |
| 343 | --branch-stack:: |
| 344 | Use the addresses of sampled taken branches instead of the instruction |
| 345 | address to build the histograms. To generate meaningful output, the |
Stephane Eranian | 993ac88 | 2012-03-08 23:47:47 +0100 | [diff] [blame] | 346 | perf.data file must have been obtained using perf record -b or |
| 347 | perf record --branch-filter xxx where xxx is a branch filter option. |
| 348 | perf report is able to auto-detect whether a perf.data file contains |
| 349 | branch stacks and it will automatically switch to the branch view mode, |
| 350 | unless --no-branch-stack is used. |
Roberto Agostino Vitillo | b50311d | 2012-02-09 23:21:03 +0100 | [diff] [blame] | 351 | |
Andi Kleen | fa94c36 | 2014-11-12 18:05:22 -0800 | [diff] [blame] | 352 | --branch-history:: |
| 353 | Add the addresses of sampled taken branches to the callstack. |
| 354 | This allows to examine the path the program took to each sample. |
| 355 | The data collection must have used -b (or -j) and -g. |
| 356 | |
Maciek Borzecki | 7a4ec93 | 2012-09-04 12:32:30 +0200 | [diff] [blame] | 357 | --objdump=<path>:: |
| 358 | Path to objdump binary. |
| 359 | |
Namhyung Kim | 01d14f1 | 2013-01-22 18:09:45 +0900 | [diff] [blame] | 360 | --group:: |
Jiri Olsa | ad52b8c | 2018-02-09 10:27:34 +0100 | [diff] [blame] | 361 | Show event group information together. It forces group output also |
| 362 | if there are no groups defined in data file. |
Namhyung Kim | 01d14f1 | 2013-01-22 18:09:45 +0900 | [diff] [blame] | 363 | |
Namhyung Kim | 328ccda | 2013-03-25 18:18:18 +0900 | [diff] [blame] | 364 | --demangle:: |
| 365 | Demangle symbol names to human readable form. It's enabled by default, |
| 366 | disable with --no-demangle. |
| 367 | |
Avi Kivity | 763122a | 2014-09-13 07:15:05 +0300 | [diff] [blame] | 368 | --demangle-kernel:: |
| 369 | Demangle kernel symbol names to human readable form (for C++ kernels). |
| 370 | |
Don Zickus | 75e906c | 2014-05-23 18:41:23 +0200 | [diff] [blame] | 371 | --mem-mode:: |
| 372 | Use the data addresses of samples in addition to instruction addresses |
| 373 | to build the histograms. To generate meaningful output, the perf.data |
| 374 | file must have been obtained using perf record -d -W and using a |
Andi Kleen | fc2f523 | 2018-02-09 08:39:09 -0800 | [diff] [blame] | 375 | special event -e cpu/mem-loads/p or -e cpu/mem-stores/p. See |
Don Zickus | 75e906c | 2014-05-23 18:41:23 +0200 | [diff] [blame] | 376 | 'perf mem' for simpler access. |
| 377 | |
Namhyung Kim | 064f198 | 2013-05-14 11:09:04 +0900 | [diff] [blame] | 378 | --percent-limit:: |
| 379 | Do not show entries which have an overhead under that percent. |
Namhyung Kim | 1ba2fc6 | 2016-02-03 23:11:21 +0900 | [diff] [blame] | 380 | (Default: 0). Note that this option also sets the percent limit (threshold) |
| 381 | of callchains. However the default value of callchain threshold is |
| 382 | different than the default value of hist entries. Please see the |
| 383 | --call-graph option for details. |
Namhyung Kim | 064f198 | 2013-05-14 11:09:04 +0900 | [diff] [blame] | 384 | |
Namhyung Kim | f214833 | 2014-01-14 11:52:48 +0900 | [diff] [blame] | 385 | --percentage:: |
| 386 | Determine how to display the overhead percentage of filtered entries. |
| 387 | Filters can be applied by --comms, --dsos and/or --symbols options and |
| 388 | Zoom operations on the TUI (thread, dso, etc). |
| 389 | |
| 390 | "relative" means it's relative to filtered entries only so that the |
| 391 | sum of shown entries will be always 100%. "absolute" means it retains |
| 392 | the original value before and after the filter is applied. |
| 393 | |
Jiri Olsa | 5cfe2c8 | 2013-12-09 11:02:49 +0100 | [diff] [blame] | 394 | --header:: |
| 395 | Show header information in the perf.data file. This includes |
| 396 | various information like hostname, OS and perf version, cpu/mem |
| 397 | info, perf command line, event list and so on. Currently only |
| 398 | --stdio output supports this feature. |
| 399 | |
| 400 | --header-only:: |
| 401 | Show only perf.data header (forces --stdio). |
| 402 | |
David Ahern | 46690a8 | 2016-11-29 10:15:46 -0700 | [diff] [blame] | 403 | --time:: |
| 404 | Only analyze samples within given time window: <start>,<stop>. Times |
| 405 | have the format seconds.microseconds. If start is not given (i.e., time |
| 406 | string is ',x.y') then analysis starts at the beginning of the file. If |
| 407 | stop time is not given (i.e, time string is 'x.y,') then analysis goes |
| 408 | to end of file. |
| 409 | |
Jin Yao | 5b969bc | 2017-12-08 21:13:45 +0800 | [diff] [blame] | 410 | Also support time percent with multiple time range. Time string is |
Jin Yao | 0a3cc3a | 2018-01-10 23:00:32 +0800 | [diff] [blame] | 411 | 'a%/n,b%/m,...' or 'a%-b%,c%-%d,...'. |
Jin Yao | 5b969bc | 2017-12-08 21:13:45 +0800 | [diff] [blame] | 412 | |
| 413 | For example: |
| 414 | Select the second 10% time slice: |
| 415 | |
| 416 | perf report --time 10%/2 |
| 417 | |
| 418 | Select from 0% to 10% time slice: |
| 419 | |
| 420 | perf report --time 0%-10% |
| 421 | |
| 422 | Select the first and second 10% time slices: |
| 423 | |
| 424 | perf report --time 10%/1,10%/2 |
| 425 | |
| 426 | Select from 0% to 10% and 30% to 40% slices: |
| 427 | |
| 428 | perf report --time 0%-10%,30%-40% |
| 429 | |
Adrian Hunter | 520a2eb | 2015-04-24 22:29:45 +0300 | [diff] [blame] | 430 | --itrace:: |
| 431 | Options for decoding instruction tracing data. The options are: |
| 432 | |
Adrian Hunter | 60b88d8 | 2015-07-17 19:33:44 +0300 | [diff] [blame] | 433 | include::itrace.txt[] |
Adrian Hunter | 520a2eb | 2015-04-24 22:29:45 +0300 | [diff] [blame] | 434 | |
| 435 | To disable decoding entirely, use --no-itrace. |
| 436 | |
Andi Kleen | a9710ba | 2015-08-07 15:24:05 -0700 | [diff] [blame] | 437 | --full-source-path:: |
| 438 | Show the full path for source files for srcline output. |
Adrian Hunter | 64a7e61 | 2015-04-30 17:37:24 +0300 | [diff] [blame] | 439 | |
Kan Liang | 9e207dd | 2015-08-11 06:30:49 -0400 | [diff] [blame] | 440 | --show-ref-call-graph:: |
| 441 | When multiple events are sampled, it may not be needed to collect |
| 442 | callgraphs for all of them. The sample sites are usually nearby, |
| 443 | and it's enough to collect the callgraphs on a reference event. |
| 444 | So user can use "call-graph=no" event modifier to disable callgraph |
| 445 | for other events to reduce the overhead. |
| 446 | However, perf report cannot show callgraphs for the event which |
| 447 | disable the callgraph. |
| 448 | This option extends the perf report to show reference callgraphs, |
| 449 | which collected by reference event, in no callgraph event. |
| 450 | |
Kan Liang | 21394d9 | 2015-09-04 10:45:44 -0400 | [diff] [blame] | 451 | --socket-filter:: |
| 452 | Only report the samples on the processor socket that match with this filter |
| 453 | |
Namhyung Kim | 053a398 | 2015-12-23 02:07:05 +0900 | [diff] [blame] | 454 | --raw-trace:: |
| 455 | When displaying traceevent output, do not use print fmt or plugins. |
| 456 | |
Namhyung Kim | 4251446 | 2016-02-25 00:13:48 +0900 | [diff] [blame] | 457 | --hierarchy:: |
| 458 | Enable hierarchical output. |
| 459 | |
Jin Yao | f3a6064 | 2017-03-26 04:34:27 +0800 | [diff] [blame] | 460 | --inline:: |
| 461 | If a callgraph address belongs to an inlined function, the inline stack |
Milian Wolff | d8a88dd | 2017-10-19 13:38:36 +0200 | [diff] [blame] | 462 | will be printed. Each entry is function name or file/line. Enabled by |
| 463 | default, disable with --no-inline. |
Jin Yao | f3a6064 | 2017-03-26 04:34:27 +0800 | [diff] [blame] | 464 | |
Arnaldo Carvalho de Melo | 6439d7d | 2018-01-09 15:25:03 -0300 | [diff] [blame] | 465 | --mmaps:: |
| 466 | Show --tasks output plus mmap information in a format similar to |
| 467 | /proc/<PID>/maps. |
| 468 | |
| 469 | Please note that not all mmaps are stored, options affecting which ones |
| 470 | are include 'perf record --data', for instance. |
| 471 | |
Jiri Olsa | a4a4d0a | 2018-01-07 17:03:55 +0100 | [diff] [blame] | 472 | --stats:: |
| 473 | Display overall events statistics without any further processing. |
| 474 | (like the one at the end of the perf report -D command) |
| 475 | |
Jiri Olsa | 930f8b3 | 2018-01-07 17:03:56 +0100 | [diff] [blame] | 476 | --tasks:: |
| 477 | Display monitored tasks stored in perf data. Displaying pid/tid/ppid |
| 478 | plus the command string aligned to distinguish parent and child tasks. |
| 479 | |
Adrian Hunter | 64a7e61 | 2015-04-30 17:37:24 +0300 | [diff] [blame] | 480 | include::callchain-overhead-calculation.txt[] |
| 481 | |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 482 | SEE ALSO |
| 483 | -------- |
Arnaldo Carvalho de Melo | 6439d7d | 2018-01-09 15:25:03 -0300 | [diff] [blame] | 484 | linkperf:perf-stat[1], linkperf:perf-annotate[1], linkperf:perf-record[1] |