blob: f216b4cb00c2665796826d37c5df159973b5fe16 [file] [log] [blame]
Aaron Carrolld60e92d2007-09-17 10:32:59 +02001.TH fio 1 "September 2007" "User Manual"
2.SH NAME
3fio \- flexible I/O tester
4.SH SYNOPSIS
5.B fio
6[\fIoptions\fR] [\fIjobfile\fR]...
7.SH DESCRIPTION
8.B fio
9is a tool that will spawn a number of threads or processes doing a
10particular type of I/O action as specified by the user.
11The typical use of fio is to write a job file matching the I/O load
12one wants to simulate.
13.SH OPTIONS
14.TP
15.BI \-\-output \fR=\fPfilename
16Write output to \fIfilename\fR.
17.TP
18.BI \-\-timeout \fR=\fPtimeout
19Limit run time to \fItimeout\fR seconds.
20.TP
21.B \-\-latency\-log
22Generate per-job latency logs.
23.TP
24.B \-\-bandwidth\-log
25Generate per-job bandwidth logs.
26.TP
27.B \-\-minimal
Aaron Carrolld1429b52007-09-18 08:10:57 +020028Print statistics in a terse, semicolon-delimited format.
Aaron Carrolld60e92d2007-09-17 10:32:59 +020029.TP
30.BI \-\-showcmd \fR=\fPjobfile
31Convert \fIjobfile\fR to a set of command-line options.
32.TP
33.B \-\-readonly
34Enable read-only safety checks.
35.TP
36.BI \-\-eta \fR=\fPwhen
37Specifies when real-time ETA estimate should be printed. \fIwhen\fR may
38be one of `always', `never' or `auto'.
39.TP
Aaron Carrollc0a5d352008-02-26 23:10:39 +010040.BI \-\-section \fR=\fPsec
41Only run section \fIsec\fR from job file.
42.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +020043.BI \-\-cmdhelp \fR=\fPcommand
44Print help information for \fIcommand\fR. May be `all' for all commands.
45.TP
Jens Axboe91837882008-02-05 12:02:07 +010046.BI \-\-debug \fR=\fPtype
47Enable verbose tracing of various fio actions. May be `all' for all types
Michael Prokopc6e13ea2009-08-12 17:24:41 +020048or individual types seperated by a comma (eg \-\-debug=io,file). `help' will
Jens Axboe91837882008-02-05 12:02:07 +010049list all available tracing options.
50.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +020051.B \-\-help
52Display usage information and exit.
53.TP
54.B \-\-version
55Display version information and exit.
56.SH "JOB FILE FORMAT"
57Job files are in `ini' format. They consist of one or more
58job definitions, which begin with a job name in square brackets and
59extend to the next job name. The job name can be any ASCII string
60except `global', which has a special meaning. Following the job name is
61a sequence of zero or more parameters, one per line, that define the
62behavior of the job. Any line starting with a `;' or `#' character is
Aaron Carrolld1429b52007-09-18 08:10:57 +020063considered a comment and ignored.
Aaron Carrolld9956b62007-11-16 12:12:45 +010064.P
65If \fIjobfile\fR is specified as `-', the job file will be read from
66standard input.
Aaron Carrolld60e92d2007-09-17 10:32:59 +020067.SS "Global Section"
68The global section contains default parameters for jobs specified in the
69job file. A job is only affected by global sections residing above it,
70and there may be any number of global sections. Specific job definitions
71may override any parameter set in global sections.
72.SH "JOB PARAMETERS"
73.SS Types
74Some parameters may take arguments of a specific type. The types used are:
75.TP
76.I str
77String: a sequence of alphanumeric characters.
78.TP
79.I int
Aaron Carrolld60e92d2007-09-17 10:32:59 +020080SI integer: a whole number, possibly containing a suffix denoting the base unit
Jens Axboeb09da8f2009-07-17 23:16:17 +020081of the value. Accepted suffixes are `k', 'M', 'G', 'T', and 'P', denoting
82kilo (1024), mega (1024^2), giga (1024^3), tera (1024^4), and peta (1024^5)
83respectively. The suffix is not case sensitive. If prefixed with '0x', the
84value is assumed to be base 16 (hexadecimal).
Aaron Carrolld60e92d2007-09-17 10:32:59 +020085.TP
86.I bool
87Boolean: a true or false value. `0' denotes false, `1' denotes true.
88.TP
89.I irange
90Integer range: a range of integers specified in the format
Aaron Carrolld1429b52007-09-18 08:10:57 +020091\fIlower\fR:\fIupper\fR or \fIlower\fR\-\fIupper\fR. \fIlower\fR and
92\fIupper\fR may contain a suffix as described above. If an option allows two
93sets of ranges, they are separated with a `,' or `/' character. For example:
94`8\-8k/8M\-4G'.
Aaron Carrolld60e92d2007-09-17 10:32:59 +020095.SS "Parameter List"
96.TP
97.BI name \fR=\fPstr
Aaron Carrolld9956b62007-11-16 12:12:45 +010098May be used to override the job name. On the command line, this parameter
Aaron Carrolld60e92d2007-09-17 10:32:59 +020099has the special purpose of signalling the start of a new job.
100.TP
101.BI description \fR=\fPstr
102Human-readable description of the job. It is printed when the job is run, but
103otherwise has no special purpose.
104.TP
105.BI directory \fR=\fPstr
106Prefix filenames with this directory. Used to place files in a location other
107than `./'.
108.TP
109.BI filename \fR=\fPstr
110.B fio
111normally makes up a file name based on the job name, thread number, and file
Aaron Carrolld1429b52007-09-18 08:10:57 +0200112number. If you want to share files between threads in a job or several jobs,
113specify a \fIfilename\fR for each of them to override the default. If the I/O
114engine used is `net', \fIfilename\fR is the host and port to connect to in the
115format \fIhost\fR/\fIport\fR. If the I/O engine is file-based, you can specify
116a number of files by separating the names with a `:' character. `\-' is a
117reserved name, meaning stdin or stdout, depending on the read/write direction
118set.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200119.TP
Jens Axboe3ce9dca2009-06-10 08:55:21 +0200120.BI lockfile \fR=\fPstr
121Fio defaults to not locking any files before it does IO to them. If a file or
122file descriptor is shared, fio can serialize IO to that file to make the end
123result consistent. This is usual for emulating real workloads that share files.
124The lock modes are:
125.RS
126.RS
127.TP
128.B none
129No locking. This is the default.
130.TP
131.B exclusive
132Only one thread or process may do IO at the time, excluding all others.
133.TP
134.B readwrite
135Read-write locking on the file. Many readers may access the file at the same
136time, but writes get exclusive access.
137.RE
138.P
139The option may be post-fixed with a lock batch number. If set, then each
140thread/process may do that amount of IOs to the file before giving up the lock.
141Since lock acquisition is expensive, batching the lock/unlocks will speed up IO.
142.RE
143.P
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200144.BI opendir \fR=\fPstr
145Recursively open any files below directory \fIstr\fR.
146.TP
147.BI readwrite \fR=\fPstr "\fR,\fP rw" \fR=\fPstr
148Type of I/O pattern. Accepted values are:
149.RS
150.RS
151.TP
152.B read
Aaron Carrolld1429b52007-09-18 08:10:57 +0200153Sequential reads.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200154.TP
155.B write
Aaron Carrolld1429b52007-09-18 08:10:57 +0200156Sequential writes.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200157.TP
158.B randread
Aaron Carrolld1429b52007-09-18 08:10:57 +0200159Random reads.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200160.TP
161.B randwrite
Aaron Carrolld1429b52007-09-18 08:10:57 +0200162Random writes.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200163.TP
164.B rw
Aaron Carrolld1429b52007-09-18 08:10:57 +0200165Mixed sequential reads and writes.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200166.TP
167.B randrw
Aaron Carrolld1429b52007-09-18 08:10:57 +0200168Mixed random reads and writes.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200169.RE
170.P
Aaron Carrolld1429b52007-09-18 08:10:57 +0200171For mixed I/O, the default split is 50/50. For random I/O, the number of I/Os
172to perform before getting a new offset can be specified by appending
173`:\fIint\fR' to the pattern type. The default is 1.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200174.RE
175.TP
Jens Axboe90fef2d2009-07-17 22:33:32 +0200176.BI kb_base \fR=\fPint
177The base unit for a kilobyte. The defacto base is 2^10, 1024. Storage
178manufacturers like to use 10^3 or 1000 as a base ten unit instead, for obvious
179reasons. Allow values are 1024 or 1000, with 1024 being the default.
180.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200181.BI randrepeat \fR=\fPbool
182Seed the random number generator in a predictable way so results are repeatable
Aaron Carrolld1429b52007-09-18 08:10:57 +0200183across runs. Default: true.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200184.TP
185.BI fadvise_hint \fR=\fPbool
Aaron Carrolld1429b52007-09-18 08:10:57 +0200186Disable use of \fIposix_fadvise\fR\|(2) to advise the kernel what I/O patterns
187are likely to be issued. Default: true.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200188.TP
Jens Axboef7fa2652009-03-09 14:20:20 +0100189.BI size \fR=\fPint
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200190Total size of I/O for this job. \fBfio\fR will run until this many bytes have
191been transfered, unless limited by other options (\fBruntime\fR, for instance).
192Unless \fBnr_files\fR and \fBfilesize\fR options are given, this amount will be
193divided between the available files for the job.
194.TP
Jens Axboe3ce9dca2009-06-10 08:55:21 +0200195.BI fill_device \fR=\fPbool
196Sets size to something really large and waits for ENOSPC (no space left on
197device) as the terminating condition. Only makes sense with sequential write.
198For a read workload, the mount point will be filled first then IO started on
199the result.
200.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200201.BI filesize \fR=\fPirange
202Individual file sizes. May be a range, in which case \fBfio\fR will select sizes
Aaron Carrolld1429b52007-09-18 08:10:57 +0200203for files at random within the given range, limited to \fBsize\fR in total (if
204that is given). If \fBfilesize\fR is not specified, each created file is the
205same size.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200206.TP
Jens Axboef7fa2652009-03-09 14:20:20 +0100207.BI blocksize \fR=\fPint[,int] "\fR,\fB bs" \fR=\fPint[,int]
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200208Block size for I/O units. Default: 4k. Values for reads and writes can be
209specified seperately in the format \fIread\fR,\fIwrite\fR, either of
210which may be empty to leave that value at its default.
211.TP
Jens Axboe91837882008-02-05 12:02:07 +0100212.BI blocksize_range \fR=\fPirange[,irange] "\fR,\fB bsrange" \fR=\fPirange[,irange]
Aaron Carrolld1429b52007-09-18 08:10:57 +0200213Specify a range of I/O block sizes. The issued I/O unit will always be a
214multiple of the minimum size, unless \fBblocksize_unaligned\fR is set. Applies
Jens Axboe91837882008-02-05 12:02:07 +0100215to both reads and writes if only one range is given, but can be specified
216seperately with a comma seperating the values. Example: bsrange=1k-4k,2k-8k.
217Also (see \fBblocksize\fR).
218.TP
219.BI bssplit \fR=\fPstr
220This option allows even finer grained control of the block sizes issued,
221not just even splits between them. With this option, you can weight various
222block sizes for exact control of the issued IO for a job that has mixed
223block sizes. The format of the option is bssplit=blocksize/percentage,
224optionally adding as many definitions as needed seperated by a colon.
225Example: bssplit=4k/10:64k/50:32k/40 would issue 50% 64k blocks, 10% 4k
Jens Axboec83cdd32009-04-24 14:23:59 +0200226blocks and 40% 32k blocks. \fBbssplit\fR also supports giving separate
227splits to reads and writes. The format is identical to what the
228\fBbs\fR option accepts, the read and write parts are separated with a
229comma.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200230.TP
231.B blocksize_unaligned\fR,\fP bs_unaligned
Aaron Carrolld1429b52007-09-18 08:10:57 +0200232If set, any size in \fBblocksize_range\fR may be used. This typically won't
233work with direct I/O, as that normally requires sector alignment.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200234.TP
Jens Axboe2b7a01d2009-03-11 11:00:13 +0100235.BI blockalign \fR=\fPint[,int] "\fR,\fB ba" \fR=\fPint[,int]
Martin Steigerwald639ce0f2009-05-20 11:33:49 +0200236At what boundary to align random IO offsets. Defaults to the same as 'blocksize'
237the minimum blocksize given. Minimum alignment is typically 512b
Jens Axboe2b7a01d2009-03-11 11:00:13 +0100238for using direct IO, though it usually depends on the hardware block size.
239This option is mutually exclusive with using a random map for files, so it
240will turn off that option.
Jens Axboe43602662009-03-14 20:08:47 +0100241.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200242.B zero_buffers
243Initialise buffers with all zeros. Default: fill buffers with random data.
244.TP
Jens Axboe901bb992009-03-14 20:17:36 +0100245.B refill_buffers
246If this option is given, fio will refill the IO buffers on every submit. The
247default is to only fill it at init time and reuse that data. Only makes sense
248if zero_buffers isn't specified, naturally. If data verification is enabled,
249refill_buffers is also automatically enabled.
250.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200251.BI nrfiles \fR=\fPint
252Number of files to use for this job. Default: 1.
253.TP
254.BI openfiles \fR=\fPint
255Number of files to keep open at the same time. Default: \fBnrfiles\fR.
256.TP
257.BI file_service_type \fR=\fPstr
258Defines how files to service are selected. The following types are defined:
259.RS
260.RS
261.TP
262.B random
263Choose a file at random
264.TP
265.B roundrobin
266Round robin over open files (default).
Jens Axboe6b7f6852009-03-09 14:22:56 +0100267.B sequential
268Do each file in the set sequentially.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200269.RE
270.P
271The number of I/Os to issue before switching a new file can be specified by
272appending `:\fIint\fR' to the service type.
273.RE
274.TP
275.BI ioengine \fR=\fPstr
276Defines how the job issues I/O. The following types are defined:
277.RS
278.RS
279.TP
280.B sync
281Basic \fIread\fR\|(2) or \fIwrite\fR\|(2) I/O. \fIfseek\fR\|(2) is used to
282position the I/O location.
283.TP
gurudas paia31041e2007-10-23 15:12:30 +0200284.B psync
285Basic \fIpread\fR\|(2) or \fIpwrite\fR\|(2) I/O.
286.TP
Jens Axboe91837882008-02-05 12:02:07 +0100287.B vsync
288Basic \fIreadv\fR\|(2) or \fIwritev\fR\|(2) I/O. Will emulate queuing by
289coalescing adjacents IOs into a single submission.
290.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200291.B libaio
292Linux native asynchronous I/O.
293.TP
294.B posixaio
295glibc POSIX asynchronous I/O using \fIaio_read\fR\|(3) and \fIaio_write\fR\|(3).
296.TP
297.B mmap
Aaron Carrolld1429b52007-09-18 08:10:57 +0200298File is memory mapped with \fImmap\fR\|(2) and data copied using
299\fImemcpy\fR\|(3).
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200300.TP
301.B splice
Aaron Carrolld1429b52007-09-18 08:10:57 +0200302\fIsplice\fR\|(2) is used to transfer the data and \fIvmsplice\fR\|(2) to
303transfer data from user-space to the kernel.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200304.TP
305.B syslet-rw
306Use the syslet system calls to make regular read/write asynchronous.
307.TP
308.B sg
309SCSI generic sg v3 I/O. May be either synchronous using the SG_IO ioctl, or if
Aaron Carrolld1429b52007-09-18 08:10:57 +0200310the target is an sg character device, we use \fIread\fR\|(2) and
311\fIwrite\fR\|(2) for asynchronous I/O.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200312.TP
313.B null
314Doesn't transfer any data, just pretends to. Mainly used to exercise \fBfio\fR
315itself and for debugging and testing purposes.
316.TP
317.B net
318Transfer over the network. \fBfilename\fR must be set appropriately to
319`\fIhost\fR/\fIport\fR' regardless of data direction. If receiving, only the
320\fIport\fR argument is used.
321.TP
322.B netsplice
323Like \fBnet\fR, but uses \fIsplice\fR\|(2) and \fIvmsplice\fR\|(2) to map data
324and send/receive.
325.TP
gurudas pai53aec0a2007-10-05 13:20:18 +0200326.B cpuio
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200327Doesn't transfer any data, but burns CPU cycles according to \fBcpuload\fR and
328\fBcpucycles\fR parameters.
329.TP
330.B guasi
331The GUASI I/O engine is the Generic Userspace Asynchronous Syscall Interface
332approach to asycnronous I/O.
Aaron Carrolld1429b52007-09-18 08:10:57 +0200333.br
334See <http://www.xmailserver.org/guasi\-lib.html>.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200335.TP
336.B external
337Loads an external I/O engine object file. Append the engine filename as
338`:\fIenginepath\fR'.
339.RE
340.RE
341.TP
342.BI iodepth \fR=\fPint
343Number of I/O units to keep in flight against the file. Default: 1.
344.TP
345.BI iodepth_batch \fR=\fPint
346Number of I/Os to submit at once. Default: \fBiodepth\fR.
347.TP
Jens Axboe3ce9dca2009-06-10 08:55:21 +0200348.BI iodepth_batch_complete \fR=\fPint
349This defines how many pieces of IO to retrieve at once. It defaults to 1 which
350 means that we'll ask for a minimum of 1 IO in the retrieval process from the
351kernel. The IO retrieval will go on until we hit the limit set by
352\fBiodepth_low\fR. If this variable is set to 0, then fio will always check for
353completed events before queuing more IO. This helps reduce IO latency, at the
354cost of more retrieval system calls.
355.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200356.BI iodepth_low \fR=\fPint
357Low watermark indicating when to start filling the queue again. Default:
358\fBiodepth\fR.
359.TP
360.BI direct \fR=\fPbool
361If true, use non-buffered I/O (usually O_DIRECT). Default: false.
362.TP
363.BI buffered \fR=\fPbool
364If true, use buffered I/O. This is the opposite of the \fBdirect\fR parameter.
365Default: true.
366.TP
Jens Axboef7fa2652009-03-09 14:20:20 +0100367.BI offset \fR=\fPint
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200368Offset in the file to start I/O. Data before the offset will not be touched.
369.TP
370.BI fsync \fR=\fPint
Aaron Carrolld1429b52007-09-18 08:10:57 +0200371How many I/Os to perform before issuing an \fBfsync\fR\|(2) of dirty data. If
3720, don't sync. Default: 0.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200373.TP
Jens Axboe5f9099e2009-06-16 22:40:26 +0200374.BI fdatasync \fR=\fPint
375Like \fBfsync\fR, but uses \fBfdatasync\fR\|(2) instead to only sync the
376data parts of the file. Default: 0.
377.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200378.BI overwrite \fR=\fPbool
Aaron Carrolld1429b52007-09-18 08:10:57 +0200379If writing, setup the file first and do overwrites. Default: false.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200380.TP
381.BI end_fsync \fR=\fPbool
Aaron Carrolld1429b52007-09-18 08:10:57 +0200382Sync file contents when job exits. Default: false.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200383.TP
384.BI fsync_on_close \fR=\fPbool
385If true, sync file contents on close. This differs from \fBend_fsync\fR in that
Aaron Carrolld1429b52007-09-18 08:10:57 +0200386it will happen on every close, not just at the end of the job. Default: false.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200387.TP
388.BI rwmixcycle \fR=\fPint
389How many milliseconds before switching between reads and writes for a mixed
390workload. Default: 500ms.
391.TP
392.BI rwmixread \fR=\fPint
393Percentage of a mixed workload that should be reads. Default: 50.
394.TP
395.BI rwmixwrite \fR=\fPint
Aaron Carrolld1429b52007-09-18 08:10:57 +0200396Percentage of a mixed workload that should be writes. If \fBrwmixread\fR and
Jens Axboec35dd7a2009-06-10 08:39:16 +0200397\fBrwmixwrite\fR are given and do not sum to 100%, the latter of the two
398overrides the first. This may interfere with a given rate setting, if fio is
399asked to limit reads or writes to a certain rate. If that is the case, then
400the distribution may be skewed. Default: 50.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200401.TP
402.B norandommap
403Normally \fBfio\fR will cover every block of the file when doing random I/O. If
404this parameter is given, a new offset will be chosen without looking at past
405I/O history. This parameter is mutually exclusive with \fBverify\fR.
406.TP
Jens Axboe3ce9dca2009-06-10 08:55:21 +0200407.B softrandommap
408See \fBnorandommap\fR. If fio runs with the random block map enabled and it
409fails to allocate the map, if this option is set it will continue without a
410random block map. As coverage will not be as complete as with random maps, this
411option is disabled by default.
412.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200413.BI nice \fR=\fPint
414Run job with given nice value. See \fInice\fR\|(2).
415.TP
416.BI prio \fR=\fPint
417Set I/O priority value of this job between 0 (highest) and 7 (lowest). See
418\fIionice\fR\|(1).
419.TP
420.BI prioclass \fR=\fPint
421Set I/O priority class. See \fIionice\fR\|(1).
422.TP
423.BI thinktime \fR=\fPint
424Stall job for given number of microseconds between issuing I/Os.
425.TP
426.BI thinktime_spin \fR=\fPint
427Pretend to spend CPU time for given number of microseconds, sleeping the rest
428of the time specified by \fBthinktime\fR. Only valid if \fBthinktime\fR is set.
429.TP
430.BI thinktime_blocks \fR=\fPint
431Number of blocks to issue before waiting \fBthinktime\fR microseconds.
432Default: 1.
433.TP
434.BI rate \fR=\fPint
Jens Axboec35dd7a2009-06-10 08:39:16 +0200435Cap bandwidth used by this job. The number is in bytes/sec, the normal postfix
436rules apply. You can use \fBrate\fR=500k to limit reads and writes to 500k each,
437or you can specify read and writes separately. Using \fBrate\fR=1m,500k would
438limit reads to 1MB/sec and writes to 500KB/sec. Capping only reads or writes
439can be done with \fBrate\fR=,500k or \fBrate\fR=500k,. The former will only
440limit writes (to 500KB/sec), the latter will only limit reads.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200441.TP
442.BI ratemin \fR=\fPint
443Tell \fBfio\fR to do whatever it can to maintain at least the given bandwidth.
Jens Axboec35dd7a2009-06-10 08:39:16 +0200444Failing to meet this requirement will cause the job to exit. The same format
445as \fBrate\fR is used for read vs write separation.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200446.TP
447.BI rate_iops \fR=\fPint
Jens Axboec35dd7a2009-06-10 08:39:16 +0200448Cap the bandwidth to this number of IOPS. Basically the same as rate, just
449specified independently of bandwidth. The same format as \fBrate\fR is used for
450read vs write seperation. If \fBblocksize\fR is a range, the smallest block
451size is used as the metric.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200452.TP
453.BI rate_iops_min \fR=\fPint
Jens Axboec35dd7a2009-06-10 08:39:16 +0200454If this rate of I/O is not met, the job will exit. The same format as \fBrate\fR
455is used for read vs write seperation.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200456.TP
457.BI ratecycle \fR=\fPint
458Average bandwidth for \fBrate\fR and \fBratemin\fR over this number of
459milliseconds. Default: 1000ms.
460.TP
461.BI cpumask \fR=\fPint
462Set CPU affinity for this job. \fIint\fR is a bitmask of allowed CPUs the job
463may run on. See \fBsched_setaffinity\fR\|(2).
464.TP
465.BI cpus_allowed \fR=\fPstr
466Same as \fBcpumask\fR, but allows a comma-delimited list of CPU numbers.
467.TP
468.BI startdelay \fR=\fPint
469Delay start of job for the specified number of seconds.
470.TP
471.BI runtime \fR=\fPint
472Terminate processing after the specified number of seconds.
473.TP
474.B time_based
475If given, run for the specified \fBruntime\fR duration even if the files are
476completely read or written. The same workload will be repeated as many times
477as \fBruntime\fR allows.
478.TP
Jens Axboe901bb992009-03-14 20:17:36 +0100479.BI ramp_time \fR=\fPint
480If set, fio will run the specified workload for this amount of time before
481logging any performance numbers. Useful for letting performance settle before
482logging results, thus minimizing the runtime required for stable results. Note
Jens Axboec35dd7a2009-06-10 08:39:16 +0200483that the \fBramp_time\fR is considered lead in time for a job, thus it will
484increase the total runtime if a special timeout or runtime is specified.
Jens Axboe901bb992009-03-14 20:17:36 +0100485.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200486.BI invalidate \fR=\fPbool
487Invalidate buffer-cache for the file prior to starting I/O. Default: true.
488.TP
489.BI sync \fR=\fPbool
490Use synchronous I/O for buffered writes. For the majority of I/O engines,
Aaron Carrolld1429b52007-09-18 08:10:57 +0200491this means using O_SYNC. Default: false.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200492.TP
493.BI iomem \fR=\fPstr "\fR,\fP mem" \fR=\fPstr
494Allocation method for I/O unit buffer. Allowed values are:
495.RS
496.RS
497.TP
498.B malloc
499Allocate memory with \fImalloc\fR\|(3).
500.TP
501.B shm
502Use shared memory buffers allocated through \fIshmget\fR\|(2).
503.TP
504.B shmhuge
505Same as \fBshm\fR, but use huge pages as backing.
506.TP
507.B mmap
508Use \fImmap\fR\|(2) for allocation. Uses anonymous memory unless a filename
509is given after the option in the format `:\fIfile\fR'.
510.TP
511.B mmaphuge
512Same as \fBmmap\fR, but use huge files as backing.
513.RE
514.P
515The amount of memory allocated is the maximum allowed \fBblocksize\fR for the
516job multiplied by \fBiodepth\fR. For \fBshmhuge\fR or \fBmmaphuge\fR to work,
517the system must have free huge pages allocated. \fBmmaphuge\fR also needs to
Jens Axboe2e266ba2009-09-14 08:56:53 +0200518have hugetlbfs mounted, and \fIfile\fR must point there. At least on Linux,
519huge pages must be manually allocated. See \fB/proc/sys/vm/nr_hugehages\fR
520and the documentation for that. Normally you just need to echo an appropriate
521number, eg echoing 8 will ensure that the OS has 8 huge pages ready for
522use.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200523.RE
524.TP
Jens Axboed529ee12009-07-01 10:33:03 +0200525.BI iomem_align \fR=\fPint
526This indiciates the memory alignment of the IO memory buffers. Note that the
527given alignment is applied to the first IO unit buffer, if using \fBiodepth\fR
528the alignment of the following buffers are given by the \fBbs\fR used. In
529other words, if using a \fBbs\fR that is a multiple of the page sized in the
530system, all buffers will be aligned to this value. If using a \fBbs\fR that
531is not page aligned, the alignment of subsequent IO memory buffers is the
532sum of the \fBiomem_align\fR and \fBbs\fR used.
533.TP
Jens Axboef7fa2652009-03-09 14:20:20 +0100534.BI hugepage\-size \fR=\fPint
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200535Defines the size of a huge page. Must be at least equal to the system setting.
Jens Axboeb22989b2009-07-17 22:29:23 +0200536Should be a multiple of 1MB. Default: 4MB.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200537.TP
538.B exitall
539Terminate all jobs when one finishes. Default: wait for each job to finish.
540.TP
541.BI bwavgtime \fR=\fPint
542Average bandwidth calculations over the given time in milliseconds. Default:
543500ms.
544.TP
545.BI create_serialize \fR=\fPbool
Aaron Carrolld1429b52007-09-18 08:10:57 +0200546If true, serialize file creation for the jobs. Default: true.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200547.TP
548.BI create_fsync \fR=\fPbool
549\fIfsync\fR\|(2) data file after creation. Default: true.
550.TP
Jens Axboe6b7f6852009-03-09 14:22:56 +0100551.BI create_on_open \fR=\fPbool
552If true, the files are not created until they are opened for IO by the job.
553.TP
Jens Axboee9f48472009-06-03 12:14:08 +0200554.BI pre_read \fR=\fPbool
555If this is given, files will be pre-read into memory before starting the given
556IO operation. This will also clear the \fR \fBinvalidate\fR flag, since it is
Jens Axboe9c0d2242009-07-01 12:26:28 +0200557pointless to pre-read and then drop the cache. This will only work for IO
558engines that are seekable, since they allow you to read the same data
559multiple times. Thus it will not work on eg network or splice IO.
Jens Axboee9f48472009-06-03 12:14:08 +0200560.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200561.BI unlink \fR=\fPbool
562Unlink job files when done. Default: false.
563.TP
564.BI loops \fR=\fPint
565Specifies the number of iterations (runs of the same workload) of this job.
566Default: 1.
567.TP
568.BI do_verify \fR=\fPbool
569Run the verify phase after a write phase. Only valid if \fBverify\fR is set.
570Default: true.
571.TP
572.BI verify \fR=\fPstr
573Method of verifying file contents after each iteration of the job. Allowed
574values are:
575.RS
576.RS
577.TP
Jens Axboeb892dc02009-09-05 20:37:35 +0200578.B md5 crc16 crc32 crc32c crc32c-intel crc64 crc7 sha256 sha512 sha1
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200579Store appropriate checksum in the header of each block.
580.TP
581.B meta
582Write extra information about each I/O (timestamp, block number, etc.). The
583block number is verified.
584.TP
585.B pattern
586Fill I/O buffers with a specific pattern that is used to verify. The pattern is
587specified by appending `:\fIint\fR' to the parameter. \fIint\fR cannot be larger
588than 32-bits.
589.TP
590.B null
591Pretend to verify. Used for testing internals.
592.RE
Jens Axboeb892dc02009-09-05 20:37:35 +0200593
594This option can be used for repeated burn-in tests of a system to make sure
595that the written data is also correctly read back. If the data direction given
596is a read or random read, fio will assume that it should verify a previously
597written file. If the data direction includes any form of write, the verify will
598be of the newly written data.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200599.RE
600.TP
601.BI verify_sort \fR=\fPbool
602If true, written verify blocks are sorted if \fBfio\fR deems it to be faster to
603read them back in a sorted manner. Default: true.
604.TP
Jens Axboef7fa2652009-03-09 14:20:20 +0100605.BI verify_offset \fR=\fPint
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200606Swap the verification header with data somewhere else in the block before
Aaron Carrolld1429b52007-09-18 08:10:57 +0200607writing. It is swapped back before verifying.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200608.TP
Jens Axboef7fa2652009-03-09 14:20:20 +0100609.BI verify_interval \fR=\fPint
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200610Write the verification header for this number of bytes, which should divide
611\fBblocksize\fR. Default: \fBblocksize\fR.
612.TP
613.BI verify_fatal \fR=\fPbool
614If true, exit the job on the first observed verification failure. Default:
615false.
616.TP
Jens Axboee8462bd2009-07-06 12:59:04 +0200617.BI verify_async \fR=\fPint
618Fio will normally verify IO inline from the submitting thread. This option
619takes an integer describing how many async offload threads to create for IO
620verification instead, causing fio to offload the duty of verifying IO contents
Jens Axboec85c3242009-07-06 14:12:57 +0200621to one or more separate threads. If using this offload option, even sync IO
622engines can benefit from using an \fBiodepth\fR setting higher than 1, as it
623allows them to have IO in flight while verifies are running.
Jens Axboee8462bd2009-07-06 12:59:04 +0200624.TP
625.BI verify_async_cpus \fR=\fPstr
626Tell fio to set the given CPU affinity on the async IO verification threads.
627See \fBcpus_allowed\fP for the format used.
628.TP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200629.B stonewall
Aaron Carrolld1429b52007-09-18 08:10:57 +0200630Wait for preceeding jobs in the job file to exit before starting this one.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200631\fBstonewall\fR implies \fBnew_group\fR.
632.TP
633.B new_group
634Start a new reporting group. If not given, all jobs in a file will be part
635of the same reporting group, unless separated by a stonewall.
636.TP
637.BI numjobs \fR=\fPint
638Number of clones (processes/threads performing the same workload) of this job.
639Default: 1.
640.TP
641.B group_reporting
642If set, display per-group reports instead of per-job when \fBnumjobs\fR is
643specified.
644.TP
645.B thread
646Use threads created with \fBpthread_create\fR\|(3) instead of processes created
647with \fBfork\fR\|(2).
648.TP
Jens Axboef7fa2652009-03-09 14:20:20 +0100649.BI zonesize \fR=\fPint
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200650Divide file into zones of the specified size in bytes. See \fBzoneskip\fR.
651.TP
Jens Axboef7fa2652009-03-09 14:20:20 +0100652.BI zoneskip \fR=\fPint
Aaron Carrolld1429b52007-09-18 08:10:57 +0200653Skip the specified number of bytes when \fBzonesize\fR bytes of data have been
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200654read.
655.TP
656.BI write_iolog \fR=\fPstr
657Write the issued I/O patterns to the specified file.
658.TP
659.BI read_iolog \fR=\fPstr
660Replay the I/O patterns contained in the specified file generated by
661\fBwrite_iolog\fR, or may be a \fBblktrace\fR binary file.
662.TP
Jens Axboe901bb992009-03-14 20:17:36 +0100663.B write_bw_log \fR=\fPstr
664If given, write a bandwidth log of the jobs in this job file. Can be used to
665store data of the bandwidth of the jobs in their lifetime. The included
666fio_generate_plots script uses gnuplot to turn these text files into nice
667graphs. See \fBwrite_log_log\fR for behaviour of given filename. For this
668option, the postfix is _bw.log.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200669.TP
670.B write_lat_log
Jens Axboe901bb992009-03-14 20:17:36 +0100671Same as \fBwrite_bw_log\fR, but writes I/O completion latencies. If no
672filename is given with this option, the default filename of "jobname_type.log"
673is used. Even if the filename is given, fio will still append the type of log.
674.TP
675.B disable_clat \fR=\fPbool
676Disable measurements of completion latency numbers. Useful only for cutting
677back the number of calls to gettimeofday, as that does impact performance at
678really high IOPS rates. Note that to really get rid of a large amount of these
679calls, this option must be used with disable_slat and disable_bw as well.
680.TP
681.B disable_slat \fR=\fPbool
682Disable measurements of submission latency numbers. See \fBdisable_clat\fR.
683.TP
684.B disable_bw_measurement \fR=\fPbool
685Disable measurements of throughput/bandwidth numbers. See \fBdisable_clat\fR.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200686.TP
Jens Axboef7fa2652009-03-09 14:20:20 +0100687.BI lockmem \fR=\fPint
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200688Pin the specified amount of memory with \fBmlock\fR\|(2). Can be used to
689simulate a smaller amount of memory.
690.TP
691.BI exec_prerun \fR=\fPstr
692Before running the job, execute the specified command with \fBsystem\fR\|(3).
693.TP
694.BI exec_postrun \fR=\fPstr
695Same as \fBexec_prerun\fR, but the command is executed after the job completes.
696.TP
697.BI ioscheduler \fR=\fPstr
698Attempt to switch the device hosting the file to the specified I/O scheduler.
699.TP
700.BI cpuload \fR=\fPint
701If the job is a CPU cycle-eater, attempt to use the specified percentage of
702CPU cycles.
703.TP
704.BI cpuchunks \fR=\fPint
705If the job is a CPU cycle-eater, split the load into cycles of the
706given time in milliseconds.
707.TP
708.BI disk_util \fR=\fPbool
Aaron Carrolld1429b52007-09-18 08:10:57 +0200709Generate disk utilization statistics if the platform supports it. Default: true.
Jens Axboe901bb992009-03-14 20:17:36 +0100710.TP
711.BI gtod_reduce \fR=\fPbool
712Enable all of the gettimeofday() reducing options (disable_clat, disable_slat,
713disable_bw) plus reduce precision of the timeout somewhat to really shrink the
714gettimeofday() call count. With this option enabled, we only do about 0.4% of
715the gtod() calls we would have done if all time keeping was enabled.
716.TP
717.BI gtod_cpu \fR=\fPint
718Sometimes it's cheaper to dedicate a single thread of execution to just getting
719the current time. Fio (and databases, for instance) are very intensive on
720gettimeofday() calls. With this option, you can set one CPU aside for doing
721nothing but logging current time to a shared memory location. Then the other
722threads/processes that run IO workloads need only copy that segment, instead of
723entering the kernel with a gettimeofday() call. The CPU set aside for doing
724these time calls will be excluded from other uses. Fio will manually clear it
725from the CPU mask of other jobs.
Radha Ramachandranf2bba182009-06-15 08:40:16 +0200726.TP
727.BI continue_on_error \fR=\fPbool
728Normally fio will exit the job on the first observed failure. If this option is
729set, fio will continue the job when there is a 'non-fatal error'
730(\fBEIO\fR or \fBEILSEQ\fR) until the runtime is exceeded or the I/O size
731specified is completed. If this option is used, there are two more stats that
732are appended, the total error count and the first error. The error field given
733in the stats is the first error that was hit during the run.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200734.SH OUTPUT
Aaron Carrolld1429b52007-09-18 08:10:57 +0200735While running, \fBfio\fR will display the status of the created jobs. For
736example:
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200737.RS
Aaron Carrolld1429b52007-09-18 08:10:57 +0200738.P
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200739Threads: 1: [_r] [24.8% done] [ 13509/ 8334 kb/s] [eta 00h:01m:31s]
740.RE
741.P
Aaron Carrolld1429b52007-09-18 08:10:57 +0200742The characters in the first set of brackets denote the current status of each
743threads. The possible values are:
744.P
745.PD 0
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200746.RS
747.TP
748.B P
749Setup but not started.
750.TP
751.B C
752Thread created.
753.TP
754.B I
755Initialized, waiting.
756.TP
757.B R
758Running, doing sequential reads.
759.TP
760.B r
761Running, doing random reads.
762.TP
763.B W
764Running, doing sequential writes.
765.TP
766.B w
767Running, doing random writes.
768.TP
769.B M
770Running, doing mixed sequential reads/writes.
771.TP
772.B m
773Running, doing mixed random reads/writes.
774.TP
775.B F
776Running, currently waiting for \fBfsync\fR\|(2).
777.TP
778.B V
779Running, verifying written data.
780.TP
781.B E
782Exited, not reaped by main thread.
783.TP
784.B \-
785Exited, thread reaped.
786.RE
Aaron Carrolld1429b52007-09-18 08:10:57 +0200787.PD
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200788.P
789The second set of brackets shows the estimated completion percentage of
790the current group. The third set shows the read and write I/O rate,
791respectively. Finally, the estimated run time of the job is displayed.
792.P
793When \fBfio\fR completes (or is interrupted by Ctrl-C), it will show data
794for each thread, each group of threads, and each disk, in that order.
795.P
796Per-thread statistics first show the threads client number, group-id, and
797error code. The remaining figures are as follows:
798.RS
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200799.TP
800.B io
801Number of megabytes of I/O performed.
802.TP
803.B bw
804Average data rate (bandwidth).
805.TP
806.B runt
807Threads run time.
808.TP
809.B slat
810Submission latency minimum, maximum, average and standard deviation. This is
811the time it took to submit the I/O.
812.TP
813.B clat
814Completion latency minimum, maximum, average and standard deviation. This
815is the time between submission and completion.
816.TP
817.B bw
818Bandwidth minimum, maximum, percentage of aggregate bandwidth received, average
819and standard deviation.
820.TP
821.B cpu
822CPU usage statistics. Includes user and system time, number of context switches
823this thread went through and number of major and minor page faults.
824.TP
825.B IO depths
826Distribution of I/O depths. Each depth includes everything less than (or equal)
827to it, but greater than the previous depth.
828.TP
829.B IO issued
830Number of read/write requests issued, and number of short read/write requests.
831.TP
832.B IO latencies
833Distribution of I/O completion latencies. The numbers follow the same pattern
834as \fBIO depths\fR.
835.RE
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200836.P
837The group statistics show:
Aaron Carrolld1429b52007-09-18 08:10:57 +0200838.PD 0
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200839.RS
840.TP
841.B io
842Number of megabytes I/O performed.
843.TP
844.B aggrb
845Aggregate bandwidth of threads in the group.
846.TP
847.B minb
848Minimum average bandwidth a thread saw.
849.TP
850.B maxb
851Maximum average bandwidth a thread saw.
852.TP
853.B mint
Aaron Carrolld1429b52007-09-18 08:10:57 +0200854Shortest runtime of threads in the group.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200855.TP
856.B maxt
857Longest runtime of threads in the group.
858.RE
Aaron Carrolld1429b52007-09-18 08:10:57 +0200859.PD
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200860.P
861Finally, disk statistics are printed with reads first:
Aaron Carrolld1429b52007-09-18 08:10:57 +0200862.PD 0
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200863.RS
864.TP
865.B ios
866Number of I/Os performed by all groups.
867.TP
868.B merge
869Number of merges in the I/O scheduler.
870.TP
871.B ticks
872Number of ticks we kept the disk busy.
873.TP
874.B io_queue
875Total time spent in the disk queue.
876.TP
877.B util
878Disk utilization.
879.RE
Aaron Carrolld1429b52007-09-18 08:10:57 +0200880.PD
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200881.SH TERSE OUTPUT
882If the \fB\-\-minimal\fR option is given, the results will be printed in a
883semicolon-delimited format suitable for scripted use. The fields are:
884.P
885.RS
886.B jobname, groupid, error
887.P
888Read status:
889.RS
Jens Axboeb22989b2009-07-17 22:29:23 +0200890.B KB I/O, bandwidth \fR(KB/s)\fP, runtime \fR(ms)\fP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200891.P
892Submission latency:
893.RS
894.B min, max, mean, standard deviation
895.RE
896Completion latency:
897.RS
898.B min, max, mean, standard deviation
899.RE
900Bandwidth:
901.RS
902.B min, max, aggregate percentage of total, mean, standard deviation
903.RE
904.RE
905.P
906Write status:
907.RS
Jens Axboeb22989b2009-07-17 22:29:23 +0200908.B KB I/O, bandwidth \fR(KB/s)\fP, runtime \fR(ms)\fP
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200909.P
910Submission latency:
911.RS
912.B min, max, mean, standard deviation
913.RE
914Completion latency:
915.RS
916.B min, max, mean, standard deviation
917.RE
918Bandwidth:
919.RS
920.B min, max, aggregate percentage of total, mean, standard deviation
921.RE
922.RE
923.P
Aaron Carrolld1429b52007-09-18 08:10:57 +0200924CPU usage:
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200925.RS
Carl Henrik Lundebd2626f2008-06-12 09:17:46 +0200926.B user, system, context switches, major page faults, minor page faults
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200927.RE
928.P
929IO depth distribution:
930.RS
931.B <=1, 2, 4, 8, 16, 32, >=64
932.RE
933.P
934IO latency distribution (ms):
935.RS
936.B <=2, 4, 10, 20, 50, 100, 250, 500, 750, 1000, >=2000
937.RE
938.P
939.B text description
940.RE
941.SH AUTHORS
942.B fio
943was written by Jens Axboe <jens.axboe@oracle.com>.
Aaron Carrolld1429b52007-09-18 08:10:57 +0200944.br
945This man page was written by Aaron Carroll <aaronc@cse.unsw.edu.au> based
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200946on documentation by Jens Axboe.
947.SH "REPORTING BUGS"
Jens Axboe482900c2009-06-02 12:15:51 +0200948Report bugs to the \fBfio\fR mailing list <fio@vger.kernel.org>.
Aaron Carrolld1429b52007-09-18 08:10:57 +0200949See \fBREADME\fR.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200950.SH "SEE ALSO"
Aaron Carrolld1429b52007-09-18 08:10:57 +0200951For further documentation see \fBHOWTO\fR and \fBREADME\fR.
952.br
953Sample jobfiles are available in the \fBexamples\fR directory.
Aaron Carrolld60e92d2007-09-17 10:32:59 +0200954