Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 1 | perf-report(1) |
Ingo Molnar | c1c2365 | 2009-05-30 12:38:51 +0200 | [diff] [blame] | 2 | ============== |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 3 | |
| 4 | NAME |
| 5 | ---- |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 6 | perf-report - Read perf.data (created by perf record) and display the profile |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 7 | |
| 8 | SYNOPSIS |
| 9 | -------- |
| 10 | [verse] |
| 11 | 'perf report' [-i <file> | --input=file] |
| 12 | |
| 13 | DESCRIPTION |
| 14 | ----------- |
| 15 | This command displays the performance counter profile information recorded |
Ingo Molnar | b0a2858 | 2009-06-23 16:39:53 +0200 | [diff] [blame] | 16 | via perf record. |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 17 | |
| 18 | OPTIONS |
| 19 | ------- |
| 20 | -i:: |
| 21 | --input=:: |
Robert Richter | efad141 | 2011-12-07 10:02:54 +0100 | [diff] [blame] | 22 | Input file name. (default: perf.data unless stdin is a fifo) |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 23 | |
| 24 | -v:: |
| 25 | --verbose:: |
| 26 | Be more verbose. (show symbol address, etc) |
| 27 | |
Zeev Tarantov | ca2b900 | 2009-11-09 13:26:13 +0200 | [diff] [blame] | 28 | -n:: |
| 29 | --show-nr-samples:: |
Arnaldo Carvalho de Melo | e3d7e18 | 2009-07-11 12:18:37 -0300 | [diff] [blame] | 30 | Show the number of samples for each symbol |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 31 | |
| 32 | --showcpuutilization:: |
| 33 | Show sample percentage for different cpu modes. |
| 34 | |
Zeev Tarantov | ca2b900 | 2009-11-09 13:26:13 +0200 | [diff] [blame] | 35 | -T:: |
| 36 | --threads:: |
Brice Goglin | 8d51327 | 2009-08-07 13:55:24 +0200 | [diff] [blame] | 37 | Show per-thread event counters |
David Ahern | c8e6672 | 2011-11-13 11:30:08 -0700 | [diff] [blame] | 38 | -c:: |
Arnaldo Carvalho de Melo | cc8b88b | 2009-06-30 19:01:21 -0300 | [diff] [blame] | 39 | --comms=:: |
| 40 | Only consider symbols in these comms. CSV that understands |
Namhyung Kim | f214833 | 2014-01-14 11:52:48 +0900 | [diff] [blame] | 41 | file://filename entries. This option will affect the percentage of |
| 42 | the overhead column. See --percentage for more info. |
| 43 | -d:: |
| 44 | --dsos=:: |
| 45 | Only consider symbols in these dsos. CSV that understands |
| 46 | file://filename entries. This option will affect the percentage of |
| 47 | the overhead column. See --percentage for more info. |
Arnaldo Carvalho de Melo | 7bec7a9 | 2009-06-30 19:01:22 -0300 | [diff] [blame] | 48 | -S:: |
| 49 | --symbols=:: |
| 50 | Only consider these symbols. CSV that understands |
Namhyung Kim | f214833 | 2014-01-14 11:52:48 +0900 | [diff] [blame] | 51 | file://filename entries. This option will affect the percentage of |
| 52 | the overhead column. See --percentage for more info. |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 53 | |
Namhyung Kim | fde0eea | 2012-03-19 11:53:48 +0900 | [diff] [blame] | 54 | --symbol-filter=:: |
| 55 | Only show symbols that match (partially) with this filter. |
| 56 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 57 | -U:: |
| 58 | --hide-unresolved:: |
| 59 | Only display entries resolved to a symbol. |
| 60 | |
Arnaldo Carvalho de Melo | c351c28 | 2009-12-16 13:49:27 -0200 | [diff] [blame] | 61 | -s:: |
| 62 | --sort=:: |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 63 | Sort histogram entries by given key(s) - multiple keys can be specified |
| 64 | in CSV format. Following sort keys are available: |
Andi Kleen | 0548429 | 2013-01-24 16:10:29 +0100 | [diff] [blame] | 65 | pid, comm, dso, symbol, parent, cpu, srcline, weight, local_weight. |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 66 | |
| 67 | Each key has following meaning: |
| 68 | |
| 69 | - comm: command (name) of the task which can be read via /proc/<pid>/comm |
| 70 | - pid: command and tid of the task |
| 71 | - dso: name of library or module executed at the time of sample |
| 72 | - symbol: name of function executed at the time of sample |
| 73 | - parent: name of function matched to the parent regex filter. Unmatched |
| 74 | entries are displayed as "[other]". |
| 75 | - cpu: cpu number the task ran at the time of sample |
| 76 | - srcline: filename and line number executed at the time of sample. The |
Andi Kleen | f5d05bc | 2013-09-20 07:40:41 -0700 | [diff] [blame] | 77 | DWARF debugging info must be provided. |
Andi Kleen | 475eeab | 2013-09-20 07:40:43 -0700 | [diff] [blame] | 78 | - weight: Event specific weight, e.g. memory latency or transaction |
| 79 | abort cost. This is the global weight. |
| 80 | - local_weight: Local weight version of the weight above. |
| 81 | - transaction: Transaction abort flags. |
Namhyung Kim | a2ce067 | 2014-03-04 09:06:42 +0900 | [diff] [blame] | 82 | - overhead: Overhead percentage of sample |
| 83 | - overhead_sys: Overhead percentage of sample running in system mode |
| 84 | - overhead_us: Overhead percentage of sample running in user mode |
| 85 | - overhead_guest_sys: Overhead percentage of sample running in system mode |
| 86 | on guest machine |
| 87 | - overhead_guest_us: Overhead percentage of sample running in user mode on |
| 88 | guest machine |
| 89 | - sample: Number of sample |
| 90 | - period: Raw number of event count of sample |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 91 | |
| 92 | By default, comm, dso and symbol keys are used. |
| 93 | (i.e. --sort comm,dso,symbol) |
| 94 | |
| 95 | If --branch-stack option is used, following sort keys are also |
| 96 | available: |
| 97 | dso_from, dso_to, symbol_from, symbol_to, mispredict. |
| 98 | |
| 99 | - dso_from: name of library or module branched from |
| 100 | - dso_to: name of library or module branched to |
| 101 | - symbol_from: name of function branched from |
| 102 | - symbol_to: name of function branched to |
| 103 | - mispredict: "N" for predicted branch, "Y" for mispredicted branch |
Andi Kleen | f5d05bc | 2013-09-20 07:40:41 -0700 | [diff] [blame] | 104 | - in_tx: branch in TSX transaction |
| 105 | - abort: TSX transaction abort. |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 106 | |
| 107 | And default sort keys are changed to comm, dso_from, symbol_from, dso_to |
| 108 | and symbol_to, see '--branch-stack'. |
Arnaldo Carvalho de Melo | c351c28 | 2009-12-16 13:49:27 -0200 | [diff] [blame] | 109 | |
Namhyung Kim | a7d945b | 2014-03-04 10:46:34 +0900 | [diff] [blame] | 110 | -F:: |
| 111 | --fields=:: |
| 112 | Specify output field - multiple keys can be specified in CSV format. |
| 113 | Following fields are available: |
Namhyung Kim | 793aaaa | 2013-10-30 17:05:55 +0900 | [diff] [blame] | 114 | overhead, overhead_sys, overhead_us, overhead_children, sample and period. |
Namhyung Kim | a7d945b | 2014-03-04 10:46:34 +0900 | [diff] [blame] | 115 | Also it can contain any sort key(s). |
| 116 | |
| 117 | By default, every sort keys not specified in -F will be appended |
| 118 | automatically. |
| 119 | |
Don Zickus | 75e906c | 2014-05-23 18:41:23 +0200 | [diff] [blame] | 120 | If --mem-mode option is used, following sort keys are also available |
| 121 | (incompatible with --branch-stack): |
Don Zickus | 9b32ba7 | 2014-06-01 15:38:29 +0200 | [diff] [blame] | 122 | symbol_daddr, dso_daddr, locked, tlb, mem, snoop, dcacheline. |
Don Zickus | 75e906c | 2014-05-23 18:41:23 +0200 | [diff] [blame] | 123 | |
| 124 | - symbol_daddr: name of data symbol being executed on at the time of sample |
| 125 | - dso_daddr: name of library or module containing the data being executed |
| 126 | on at the time of sample |
| 127 | - locked: whether the bus was locked at the time of sample |
| 128 | - tlb: type of tlb access for the data at the time of sample |
| 129 | - mem: type of memory access for the data at the time of sample |
| 130 | - snoop: type of snoop (if any) for the data at the time of sample |
Don Zickus | 9b32ba7 | 2014-06-01 15:38:29 +0200 | [diff] [blame] | 131 | - dcacheline: the cacheline the data address is on at the time of sample |
Don Zickus | 75e906c | 2014-05-23 18:41:23 +0200 | [diff] [blame] | 132 | |
| 133 | And default sort keys are changed to local_weight, mem, sym, dso, |
| 134 | symbol_daddr, dso_daddr, snoop, tlb, locked, see '--mem-mode'. |
| 135 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 136 | -p:: |
| 137 | --parent=<regex>:: |
Namhyung Kim | 9811360 | 2012-12-27 18:11:47 +0900 | [diff] [blame] | 138 | A regex filter to identify parent. The parent is a caller of this |
| 139 | function and searched through the callchain, thus it requires callchain |
| 140 | information recorded. The pattern is in the exteneded regex format and |
| 141 | defaults to "\^sys_|^do_page_fault", see '--sort parent'. |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 142 | |
| 143 | -x:: |
| 144 | --exclude-other:: |
| 145 | Only display entries with parent-match. |
| 146 | |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 147 | -w:: |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 148 | --column-widths=<width[,width...]>:: |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 149 | Force each column width to the provided list, for large terminal |
Namhyung Kim | cf59002 | 2014-07-31 14:47:39 +0900 | [diff] [blame] | 150 | readability. 0 means no limit (default behavior). |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 151 | |
| 152 | -t:: |
| 153 | --field-separator=:: |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 154 | Use a special separator character and don't pad with spaces, replacing |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 155 | all occurrences of this separator in symbol names (and other output) |
Arnaldo Carvalho de Melo | 52d422d | 2009-07-10 22:47:28 -0300 | [diff] [blame] | 156 | with a '.' character, that thus it's the only non valid separator. |
| 157 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 158 | -D:: |
| 159 | --dump-raw-trace:: |
| 160 | Dump raw trace in ASCII. |
| 161 | |
Andi Kleen | 8b7bad5 | 2014-11-12 18:05:20 -0800 | [diff] [blame^] | 162 | -g [type,min[,limit],order[,key][,branch]]:: |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 163 | --call-graph:: |
Namhyung Kim | 6581f6e | 2011-12-13 00:16:50 +0900 | [diff] [blame] | 164 | Display call chains using type, min percent threshold, optional print |
| 165 | limit and order. |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 166 | type can be either: |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 167 | - flat: single column, linear exposure of call chains. |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 168 | - graph: use a graph tree, displaying absolute overhead rates. |
| 169 | - fractal: like graph, but displays relative rates. Each branch of |
| 170 | the tree is considered as a new profiled object. + |
Sam Liao | d797fdc | 2011-06-07 23:49:46 +0800 | [diff] [blame] | 171 | |
| 172 | order can be either: |
| 173 | - callee: callee based call graph. |
| 174 | - caller: inverted caller based call graph. |
| 175 | |
Andi Kleen | 99571ab | 2013-07-18 15:33:57 -0700 | [diff] [blame] | 176 | key can be: |
| 177 | - function: compare on functions |
| 178 | - address: compare on individual code addresses |
| 179 | |
Andi Kleen | 8b7bad5 | 2014-11-12 18:05:20 -0800 | [diff] [blame^] | 180 | branch can be: |
| 181 | - branch: include last branch information in callgraph |
| 182 | when available. Usually more convenient to use --branch-history |
| 183 | for this. |
| 184 | |
Andi Kleen | 99571ab | 2013-07-18 15:33:57 -0700 | [diff] [blame] | 185 | Default: fractal,0.5,callee,function. |
Sam Liao | d797fdc | 2011-06-07 23:49:46 +0800 | [diff] [blame] | 186 | |
Namhyung Kim | 793aaaa | 2013-10-30 17:05:55 +0900 | [diff] [blame] | 187 | --children:: |
| 188 | Accumulate callchain of children to parent entry so that then can |
| 189 | show up in the output. The output will have a new "Children" column |
| 190 | and will be sorted on the data. It requires callchains are recorded. |
| 191 | |
Waiman Long | 91e9561 | 2013-10-18 10:38:48 -0400 | [diff] [blame] | 192 | --max-stack:: |
| 193 | Set the stack depth limit when parsing the callchain, anything |
| 194 | beyond the specified depth will be ignored. This is a trade-off |
| 195 | between information loss and faster processing especially for |
| 196 | workloads that can have a very long callchain stack. |
| 197 | |
| 198 | Default: 127 |
| 199 | |
Sam Liao | d797fdc | 2011-06-07 23:49:46 +0800 | [diff] [blame] | 200 | -G:: |
| 201 | --inverted:: |
| 202 | alias for inverted caller based call graph. |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 203 | |
Greg Price | b21484f | 2012-12-06 21:48:05 -0800 | [diff] [blame] | 204 | --ignore-callees=<regex>:: |
| 205 | Ignore callees of the function(s) matching the given regex. |
| 206 | This has the effect of collecting the callers of each such |
| 207 | function into one place in the call-graph tree. |
| 208 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 209 | --pretty=<key>:: |
| 210 | Pretty printing style. key: normal, raw |
| 211 | |
Arnaldo Carvalho de Melo | 8b9e74e | 2010-08-21 10:38:16 -0300 | [diff] [blame] | 212 | --stdio:: Use the stdio interface. |
| 213 | |
| 214 | --tui:: Use the TUI interface, that is integrated with annotate and allows |
| 215 | zooming into DSOs or threads, among other features. Use of --tui |
| 216 | requires a tty, if one is not present, as when piping to other |
| 217 | commands, the stdio interface is used. |
| 218 | |
Pekka Enberg | c31a945 | 2012-03-19 15:13:29 -0300 | [diff] [blame] | 219 | --gtk:: Use the GTK2 interface. |
| 220 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 221 | -k:: |
| 222 | --vmlinux=<file>:: |
| 223 | vmlinux pathname |
| 224 | |
David Ahern | b226a5a7 | 2010-12-07 19:39:46 -0700 | [diff] [blame] | 225 | --kallsyms=<file>:: |
| 226 | kallsyms pathname |
| 227 | |
Shawn Bohrer | e04fffc | 2010-11-30 19:57:17 -0600 | [diff] [blame] | 228 | -m:: |
| 229 | --modules:: |
| 230 | Load module symbols. WARNING: This should only be used with -k and |
| 231 | a LIVE kernel. |
| 232 | |
| 233 | -f:: |
| 234 | --force:: |
| 235 | Don't complain, do it. |
| 236 | |
David Ahern | ec5761e | 2010-12-09 13:27:07 -0700 | [diff] [blame] | 237 | --symfs=<directory>:: |
| 238 | Look for files with symbols relative to this directory. |
| 239 | |
David Ahern | c8e6672 | 2011-11-13 11:30:08 -0700 | [diff] [blame] | 240 | -C:: |
Anton Blanchard | 5d67be9 | 2011-07-04 21:57:50 +1000 | [diff] [blame] | 241 | --cpu:: Only report samples for the list of CPUs provided. Multiple CPUs can |
| 242 | be provided as a comma-separated list with no space: 0,1. Ranges of |
| 243 | CPUs are specified with -: 0-2. Default is to report samples on all |
| 244 | CPUs. |
| 245 | |
Andi Kleen | f69b64f | 2011-09-15 14:31:41 -0700 | [diff] [blame] | 246 | -M:: |
| 247 | --disassembler-style=:: Set disassembler style for objdump. |
| 248 | |
Arnaldo Carvalho de Melo | 64c6f0c | 2011-10-06 12:48:31 -0300 | [diff] [blame] | 249 | --source:: |
| 250 | Interleave source code with assembly code. Enabled by default, |
| 251 | disable with --no-source. |
| 252 | |
| 253 | --asm-raw:: |
| 254 | Show raw instruction encoding of assembly instructions. |
| 255 | |
Arnaldo Carvalho de Melo | 3f2728b | 2011-10-05 16:10:06 -0300 | [diff] [blame] | 256 | --show-total-period:: Show a column with the sum of periods. |
| 257 | |
Stephane Eranian | fbe96f2 | 2011-09-30 15:40:40 +0200 | [diff] [blame] | 258 | -I:: |
| 259 | --show-info:: |
| 260 | Display extended information about the perf.data file. This adds |
| 261 | information which may be very large and thus may clutter the display. |
| 262 | It currently includes: cpu and numa topology of the host system. |
| 263 | |
Roberto Agostino Vitillo | b50311d | 2012-02-09 23:21:03 +0100 | [diff] [blame] | 264 | -b:: |
| 265 | --branch-stack:: |
| 266 | Use the addresses of sampled taken branches instead of the instruction |
| 267 | address to build the histograms. To generate meaningful output, the |
Stephane Eranian | 993ac88 | 2012-03-08 23:47:47 +0100 | [diff] [blame] | 268 | perf.data file must have been obtained using perf record -b or |
| 269 | perf record --branch-filter xxx where xxx is a branch filter option. |
| 270 | perf report is able to auto-detect whether a perf.data file contains |
| 271 | branch stacks and it will automatically switch to the branch view mode, |
| 272 | unless --no-branch-stack is used. |
Roberto Agostino Vitillo | b50311d | 2012-02-09 23:21:03 +0100 | [diff] [blame] | 273 | |
Maciek Borzecki | 7a4ec93 | 2012-09-04 12:32:30 +0200 | [diff] [blame] | 274 | --objdump=<path>:: |
| 275 | Path to objdump binary. |
| 276 | |
Namhyung Kim | 01d14f1 | 2013-01-22 18:09:45 +0900 | [diff] [blame] | 277 | --group:: |
| 278 | Show event group information together. |
| 279 | |
Namhyung Kim | 328ccda | 2013-03-25 18:18:18 +0900 | [diff] [blame] | 280 | --demangle:: |
| 281 | Demangle symbol names to human readable form. It's enabled by default, |
| 282 | disable with --no-demangle. |
| 283 | |
Avi Kivity | 763122a | 2014-09-13 07:15:05 +0300 | [diff] [blame] | 284 | --demangle-kernel:: |
| 285 | Demangle kernel symbol names to human readable form (for C++ kernels). |
| 286 | |
Don Zickus | 75e906c | 2014-05-23 18:41:23 +0200 | [diff] [blame] | 287 | --mem-mode:: |
| 288 | Use the data addresses of samples in addition to instruction addresses |
| 289 | to build the histograms. To generate meaningful output, the perf.data |
| 290 | file must have been obtained using perf record -d -W and using a |
| 291 | special event -e cpu/mem-loads/ or -e cpu/mem-stores/. See |
| 292 | 'perf mem' for simpler access. |
| 293 | |
Namhyung Kim | 064f198 | 2013-05-14 11:09:04 +0900 | [diff] [blame] | 294 | --percent-limit:: |
| 295 | Do not show entries which have an overhead under that percent. |
| 296 | (Default: 0). |
| 297 | |
Namhyung Kim | f214833 | 2014-01-14 11:52:48 +0900 | [diff] [blame] | 298 | --percentage:: |
| 299 | Determine how to display the overhead percentage of filtered entries. |
| 300 | Filters can be applied by --comms, --dsos and/or --symbols options and |
| 301 | Zoom operations on the TUI (thread, dso, etc). |
| 302 | |
| 303 | "relative" means it's relative to filtered entries only so that the |
| 304 | sum of shown entries will be always 100%. "absolute" means it retains |
| 305 | the original value before and after the filter is applied. |
| 306 | |
Jiri Olsa | 5cfe2c8 | 2013-12-09 11:02:49 +0100 | [diff] [blame] | 307 | --header:: |
| 308 | Show header information in the perf.data file. This includes |
| 309 | various information like hostname, OS and perf version, cpu/mem |
| 310 | info, perf command line, event list and so on. Currently only |
| 311 | --stdio output supports this feature. |
| 312 | |
| 313 | --header-only:: |
| 314 | Show only perf.data header (forces --stdio). |
| 315 | |
Ingo Molnar | 0bec253 | 2009-05-26 09:17:18 +0200 | [diff] [blame] | 316 | SEE ALSO |
| 317 | -------- |
Arnaldo Carvalho de Melo | 64c6f0c | 2011-10-06 12:48:31 -0300 | [diff] [blame] | 318 | linkperf:perf-stat[1], linkperf:perf-annotate[1] |