blob: 98064a9b646bb66e05f2beac4e2066b5c6b7912d [file] [log] [blame]
Brendan Gregg60393ea2016-10-04 15:18:11 -07001Demonstrations of tcptop, the Linux eBPF/bcc version.
2
3
4tcptop summarizes throughput by host and port. Eg:
5
6# tcptop
7Tracing... Output every 1 secs. Hit Ctrl-C to end
8<screen clears>
919:46:24 loadavg: 1.86 2.67 2.91 3/362 16681
10
11PID COMM LADDR RADDR RX_KB TX_KB
1216648 16648 100.66.3.172:22 100.127.69.165:6684 1 0
1316647 sshd 100.66.3.172:22 100.127.69.165:6684 0 2149
1414374 sshd 100.66.3.172:22 100.127.69.165:25219 0 0
1514458 sshd 100.66.3.172:22 100.127.69.165:7165 0 0
16
17PID COMM LADDR6 RADDR6 RX_KB TX_KB
1816681 sshd fe80::8a3:9dff:fed5:6b19:22 fe80::8a3:9dff:fed5:6b19:16606 1 1
1916679 ssh fe80::8a3:9dff:fed5:6b19:16606 fe80::8a3:9dff:fed5:6b19:22 1 1
2016680 sshd fe80::8a3:9dff:fed5:6b19:22 fe80::8a3:9dff:fed5:6b19:16606 0 0
21
22This example output shows two listings of TCP connections, for IPv4 and IPv6.
23If there is only traffic for one of these, then only one group is shown.
24
25The output in each listing is sorted by total throughput (send then receive),
26and when printed it is rounded (floor) to the nearest Kbyte. The example output
27shows PID 16647, sshd, transmitted 2149 Kbytes during the tracing interval.
28The other IPv4 sessions had such low throughput they rounded to zero.
29
30All TCP sessions, including over loopback, are included.
31
32The session with the process name (COMM) of 16648 is really a short-lived
33process with PID 16648 where we didn't catch the process name when printing
34the output. If this behavior is a serious issue for you, you can modify the
35tool's code to include bpf_get_current_comm() in the key structs, so that it's
36fetched during the event and will always be seen. I did it this way to start
37with, but it was measurably increasing the overhead of this tool, so I switched
38to the asynchronous model.
39
40The overhead is relative to TCP event rate (the rate of tcp_sendmsg() and
41tcp_recvmsg() or tcp_cleanup_rbuf()). Due to buffering, this should be lower
42than the packet rate. You can measure the rate of these using funccount.
43Some sample production servers tested found total rates of 4k to 15k per
44second. The CPU overhead at these rates ranged from 0.5% to 2.0% of one CPU.
45Maybe your workloads have higher rates and therefore higher overhead, or,
46lower rates.
47
48
49I much prefer not clearing the screen, so that historic output is in the
50scroll-back buffer, and patterns or intermittent issues can be better seen.
51You can do this with -C:
52
53# tcptop -C
54Tracing... Output every 1 secs. Hit Ctrl-C to end
55
5620:27:12 loadavg: 0.08 0.02 0.17 2/367 17342
57
58PID COMM LADDR RADDR RX_KB TX_KB
5917287 17287 100.66.3.172:22 100.127.69.165:57585 3 1
6017286 sshd 100.66.3.172:22 100.127.69.165:57585 0 1
6114374 sshd 100.66.3.172:22 100.127.69.165:25219 0 0
62
6320:27:13 loadavg: 0.08 0.02 0.17 1/367 17342
64
65PID COMM LADDR RADDR RX_KB TX_KB
6617286 sshd 100.66.3.172:22 100.127.69.165:57585 1 7761
6714374 sshd 100.66.3.172:22 100.127.69.165:25219 0 0
68
6920:27:14 loadavg: 0.08 0.02 0.17 2/365 17347
70
71PID COMM LADDR RADDR RX_KB TX_KB
7217286 17286 100.66.3.172:22 100.127.69.165:57585 1 2501
7314374 sshd 100.66.3.172:22 100.127.69.165:25219 0 0
74
7520:27:15 loadavg: 0.07 0.02 0.17 2/367 17403
76
77PID COMM LADDR RADDR RX_KB TX_KB
7817349 17349 100.66.3.172:22 100.127.69.165:10161 3 1
7917348 sshd 100.66.3.172:22 100.127.69.165:10161 0 1
8014374 sshd 100.66.3.172:22 100.127.69.165:25219 0 0
81
8220:27:16 loadavg: 0.07 0.02 0.17 1/367 17403
83
84PID COMM LADDR RADDR RX_KB TX_KB
8517348 sshd 100.66.3.172:22 100.127.69.165:10161 3333 0
8614374 sshd 100.66.3.172:22 100.127.69.165:25219 0 0
87
8820:27:17 loadavg: 0.07 0.02 0.17 2/366 17409
89
90PID COMM LADDR RADDR RX_KB TX_KB
9117348 17348 100.66.3.172:22 100.127.69.165:10161 6909 2
92
93You can disable the loadavg summary line with -S if needed.
94
Alban Crequy7d626562020-03-08 16:41:34 +010095The --cgroupmap option filters based on a cgroup set. It is meant to be used
96with an externally created map.
97
98# tcptop --cgroupmap /sys/fs/bpf/test01
99
Alban Crequy32ab8582020-03-22 16:06:44 +0100100For more details, see docs/special_filtering.md
Alban Crequy7d626562020-03-08 16:41:34 +0100101
Brendan Gregg60393ea2016-10-04 15:18:11 -0700102
103USAGE:
104
105# tcptop -h
Alban Crequy7d626562020-03-08 16:41:34 +0100106usage: tcptop.py [-h] [-C] [-S] [-p PID] [--cgroupmap CGROUPMAP]
Alban Crequy32ab8582020-03-22 16:06:44 +0100107 [--mntnsmap MNTNSMAP]
Hariharan Ananthakrishnan04893e32021-08-12 05:55:21 -0700108 [interval] [count] [-4 | -6]
Brendan Gregg60393ea2016-10-04 15:18:11 -0700109
110Summarize TCP send/recv throughput by host
111
112positional arguments:
Alban Crequy7d626562020-03-08 16:41:34 +0100113 interval output interval, in seconds (default 1)
114 count number of outputs
Brendan Gregg60393ea2016-10-04 15:18:11 -0700115
116optional arguments:
Alban Crequy7d626562020-03-08 16:41:34 +0100117 -h, --help show this help message and exit
118 -C, --noclear don't clear the screen
119 -S, --nosummary skip system summary line
120 -p PID, --pid PID trace this PID only
121 --cgroupmap CGROUPMAP
122 trace cgroups in this BPF map only
Hariharan Ananthakrishnan04893e32021-08-12 05:55:21 -0700123 -4, --ipv4 trace IPv4 family only
124 -6, --ipv6 trace IPv6 family only
Brendan Gregg60393ea2016-10-04 15:18:11 -0700125
126examples:
127 ./tcptop # trace TCP send/recv by host
128 ./tcptop -C # don't clear the screen
129 ./tcptop -p 181 # only trace PID 181
Alban Crequy7d626562020-03-08 16:41:34 +0100130 ./tcptop --cgroupmap ./mappath # only trace cgroups in this BPF map
Alban Crequy32ab8582020-03-22 16:06:44 +0100131 ./tcptop --mntnsmap mappath # only trace mount namespaces in the map
Hariharan Ananthakrishnan04893e32021-08-12 05:55:21 -0700132 ./tcptop -4 # trace IPv4 family only
133 ./tcptop -6 # trace IPv6 family only