| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 1 | Table of contents | 
 | 2 | ----------------- | 
 | 3 |  | 
 | 4 | 1. Overview | 
 | 5 | 2. How fio works | 
 | 6 | 3. Running fio | 
 | 7 | 4. Job file format | 
 | 8 | 5. Detailed list of parameters | 
 | 9 | 6. Normal output | 
 | 10 | 7. Terse output | 
 | 11 |  | 
 | 12 |  | 
 | 13 | 1.0 Overview and history | 
 | 14 | ------------------------ | 
 | 15 | fio was originally written to save me the hassle of writing special test | 
 | 16 | case programs when I wanted to test a specific workload, either for | 
 | 17 | performance reasons or to find/reproduce a bug. The process of writing | 
 | 18 | such a test app can be tiresome, especially if you have to do it often. | 
 | 19 | Hence I needed a tool that would be able to simulate a given io workload | 
 | 20 | without resorting to writing a tailored test case again and again. | 
 | 21 |  | 
 | 22 | A test work load is difficult to define, though. There can be any number | 
 | 23 | of processes or threads involved, and they can each be using their own | 
 | 24 | way of generating io. You could have someone dirtying large amounts of | 
 | 25 | memory in an memory mapped file, or maybe several threads issuing | 
 | 26 | reads using asynchronous io. fio needed to be flexible enough to | 
 | 27 | simulate both of these cases, and many more. | 
 | 28 |  | 
 | 29 | 2.0 How fio works | 
 | 30 | ----------------- | 
 | 31 | The first step in getting fio to simulate a desired io workload, is | 
 | 32 | writing a job file describing that specific setup. A job file may contain | 
 | 33 | any number of threads and/or files - the typical contents of the job file | 
 | 34 | is a global section defining shared parameters, and one or more job | 
 | 35 | sections describing the jobs involved. When run, fio parses this file | 
 | 36 | and sets everything up as described. If we break down a job from top to | 
 | 37 | bottom, it contains the following basic parameters: | 
 | 38 |  | 
 | 39 | 	IO type		Defines the io pattern issued to the file(s). | 
 | 40 | 			We may only be reading sequentially from this | 
 | 41 | 			file(s), or we may be writing randomly. Or even | 
 | 42 | 			mixing reads and writes, sequentially or randomly. | 
 | 43 |  | 
 | 44 | 	Block size	In how large chunks are we issuing io? This may be | 
 | 45 | 			a single value, or it may describe a range of | 
 | 46 | 			block sizes. | 
 | 47 |  | 
 | 48 | 	IO size		How much data are we going to be reading/writing. | 
 | 49 |  | 
 | 50 | 	IO engine	How do we issue io? We could be memory mapping the | 
 | 51 | 			file, we could be using regular read/write, we | 
 | 52 | 			could be using splice, async io, or even | 
 | 53 | 			SG (SCSI generic sg). | 
 | 54 |  | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 55 | 	IO depth	If the io engine is async, how large a queuing | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 56 | 			depth do we want to maintain? | 
 | 57 |  | 
 | 58 | 	IO type		Should we be doing buffered io, or direct/raw io? | 
 | 59 |  | 
 | 60 | 	Num files	How many files are we spreading the workload over. | 
 | 61 |  | 
 | 62 | 	Num threads	How many threads or processes should we spread | 
 | 63 | 			this workload over. | 
 | 64 | 	 | 
 | 65 | The above are the basic parameters defined for a workload, in addition | 
 | 66 | there's a multitude of parameters that modify other aspects of how this | 
 | 67 | job behaves. | 
 | 68 |  | 
 | 69 |  | 
 | 70 | 3.0 Running fio | 
 | 71 | --------------- | 
 | 72 | See the README file for command line parameters, there are only a few | 
 | 73 | of them. | 
 | 74 |  | 
 | 75 | Running fio is normally the easiest part - you just give it the job file | 
 | 76 | (or job files) as parameters: | 
 | 77 |  | 
 | 78 | $ fio job_file | 
 | 79 |  | 
 | 80 | and it will start doing what the job_file tells it to do. You can give | 
 | 81 | more than one job file on the command line, fio will serialize the running | 
 | 82 | of those files. Internally that is the same as using the 'stonewall' | 
 | 83 | parameter described the the parameter section. | 
 | 84 |  | 
| Jens Axboe | b469282 | 2006-10-27 13:43:22 +0200 | [diff] [blame] | 85 | If the job file contains only one job, you may as well just give the | 
 | 86 | parameters on the command line. The command line parameters are identical | 
 | 87 | to the job parameters, with a few extra that control global parameters | 
 | 88 | (see README). For example, for the job file parameter iodepth=2, the | 
| Jens Axboe | c2b1e75 | 2006-10-30 09:03:13 +0100 | [diff] [blame] | 89 | mirror command line option would be --iodepth 2 or --iodepth=2. You can | 
 | 90 | also use the command line for giving more than one job entry. For each | 
 | 91 | --name option that fio sees, it will start a new job with that name. | 
 | 92 | Command line entries following a --name entry will apply to that job, | 
 | 93 | until there are no more entries or a new --name entry is seen. This is | 
 | 94 | similar to the job file options, where each option applies to the current | 
 | 95 | job until a new [] job entry is seen. | 
| Jens Axboe | b469282 | 2006-10-27 13:43:22 +0200 | [diff] [blame] | 96 |  | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 97 | fio does not need to run as root, except if the files or devices specified | 
 | 98 | in the job section requires that. Some other options may also be restricted, | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 99 | such as memory locking, io scheduler switching, and decreasing the nice value. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 100 |  | 
 | 101 |  | 
 | 102 | 4.0 Job file format | 
 | 103 | ------------------- | 
 | 104 | As previously described, fio accepts one or more job files describing | 
 | 105 | what it is supposed to do. The job file format is the classic ini file, | 
 | 106 | where the names enclosed in [] brackets define the job name. You are free | 
 | 107 | to use any ascii name you want, except 'global' which has special meaning. | 
 | 108 | A global section sets defaults for the jobs described in that file. A job | 
 | 109 | may override a global section parameter, and a job file may even have | 
 | 110 | several global sections if so desired. A job is only affected by a global | 
 | 111 | section residing above it. If the first character in a line is a ';', the | 
 | 112 | entire line is discarded as a comment. | 
 | 113 |  | 
 | 114 | So lets look at a really simple job file that define to threads, each | 
 | 115 | randomly reading from a 128MiB file. | 
 | 116 |  | 
 | 117 | ; -- start job file -- | 
 | 118 | [global] | 
 | 119 | rw=randread | 
 | 120 | size=128m | 
 | 121 |  | 
 | 122 | [job1] | 
 | 123 |  | 
 | 124 | [job2] | 
 | 125 |  | 
 | 126 | ; -- end job file -- | 
 | 127 |  | 
 | 128 | As you can see, the job file sections themselves are empty as all the | 
 | 129 | described parameters are shared. As no filename= option is given, fio | 
| Jens Axboe | c2b1e75 | 2006-10-30 09:03:13 +0100 | [diff] [blame] | 130 | makes up a filename for each of the jobs as it sees fit. On the command | 
 | 131 | line, this job would look as follows: | 
 | 132 |  | 
 | 133 | $ fio --name=global --rw=randread --size=128m --name=job1 --name=job2 | 
 | 134 |  | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 135 |  | 
 | 136 | Lets look at an example that have a number of processes writing randomly | 
 | 137 | to files. | 
 | 138 |  | 
 | 139 | ; -- start job file -- | 
 | 140 | [random-writers] | 
 | 141 | ioengine=libaio | 
 | 142 | iodepth=4 | 
 | 143 | rw=randwrite | 
 | 144 | bs=32k | 
 | 145 | direct=0 | 
 | 146 | size=64m | 
 | 147 | numjobs=4 | 
 | 148 |  | 
 | 149 | ; -- end job file -- | 
 | 150 |  | 
 | 151 | Here we have no global section, as we only have one job defined anyway. | 
 | 152 | We want to use async io here, with a depth of 4 for each file. We also | 
 | 153 | increased the buffer size used to 32KiB and define numjobs to 4 to | 
 | 154 | fork 4 identical jobs. The result is 4 processes each randomly writing | 
| Jens Axboe | b469282 | 2006-10-27 13:43:22 +0200 | [diff] [blame] | 155 | to their own 64MiB file. Instead of using the above job file, you could | 
 | 156 | have given the parameters on the command line. For this case, you would | 
 | 157 | specify: | 
 | 158 |  | 
 | 159 | $ fio --name=random-writers --ioengine=libaio --iodepth=4 --rw=randwrite --bs=32k --direct=0 --size=64m --numjobs=4 | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 160 |  | 
 | 161 | fio ships with a few example job files, you can also look there for | 
 | 162 | inspiration. | 
 | 163 |  | 
 | 164 |  | 
 | 165 | 5.0 Detailed list of parameters | 
 | 166 | ------------------------------- | 
 | 167 |  | 
 | 168 | This section describes in details each parameter associated with a job. | 
 | 169 | Some parameters take an option of a given type, such as an integer or | 
 | 170 | a string. The following types are used: | 
 | 171 |  | 
 | 172 | str	String. This is a sequence of alpha characters. | 
 | 173 | int	Integer. A whole number value, may be negative. | 
 | 174 | siint	SI integer. A whole number value, which may contain a postfix | 
 | 175 | 	describing the base of the number. Accepted postfixes are k/m/g, | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 176 | 	meaning kilo, mega, and giga. So if you want to specify 4096, | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 177 | 	you could either write out '4096' or just give 4k. The postfixes | 
 | 178 | 	signify base 2 values, so 1024 is 1k and 1024k is 1m and so on. | 
 | 179 | bool	Boolean. Usually parsed as an integer, however only defined for | 
 | 180 | 	true and false (1 and 0). | 
 | 181 | irange	Integer range with postfix. Allows value range to be given, such | 
 | 182 | 	as 1024-4096. Also see siint. | 
 | 183 |  | 
 | 184 | With the above in mind, here follows the complete list of fio job | 
 | 185 | parameters. | 
 | 186 |  | 
 | 187 | name=str	ASCII name of the job. This may be used to override the | 
 | 188 | 		name printed by fio for this job. Otherwise the job | 
| Jens Axboe | c2b1e75 | 2006-10-30 09:03:13 +0100 | [diff] [blame] | 189 | 		name is used. On the command line this parameter has the | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 190 | 		special purpose of also signaling the start of a new | 
| Jens Axboe | c2b1e75 | 2006-10-30 09:03:13 +0100 | [diff] [blame] | 191 | 		job. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 192 |  | 
 | 193 | directory=str	Prefix filenames with this directory. Used to places files | 
 | 194 | 		in a different location than "./". | 
 | 195 |  | 
 | 196 | filename=str	Fio normally makes up a filename based on the job name, | 
 | 197 | 		thread number, and file number. If you want to share | 
 | 198 | 		files between threads in a job or several jobs, specify | 
 | 199 | 		a filename for each of them to override the default. | 
 | 200 |  | 
 | 201 | rw=str		Type of io pattern. Accepted values are: | 
 | 202 |  | 
 | 203 | 			read		Sequential reads | 
 | 204 | 			write		Sequential writes | 
 | 205 | 			randwrite	Random writes | 
 | 206 | 			randread	Random reads | 
 | 207 | 			rw		Sequential mixed reads and writes | 
 | 208 | 			randrw		Random mixed reads and writes | 
 | 209 |  | 
 | 210 | 		For the mixed io types, the default is to split them 50/50. | 
 | 211 | 		For certain types of io the result may still be skewed a bit, | 
 | 212 | 		since the speed may be different. | 
 | 213 |  | 
 | 214 | size=siint	The total size of file io for this job. This may describe | 
 | 215 | 		the size of the single file the job uses, or it may be | 
 | 216 | 		divided between the number of files in the job. If the | 
 | 217 | 		file already exists, the file size will be adjusted to this | 
 | 218 | 		size if larger than the current file size. If this parameter | 
 | 219 | 		is not given and the file exists, the file size will be used. | 
 | 220 |  | 
| Jens Axboe | f90eff5 | 2006-11-06 11:08:21 +0100 | [diff] [blame] | 221 | bs=siint	The block size used for the io units. Defaults to 4k. Values | 
 | 222 | 		can be given for both read and writes. If a single siint is | 
 | 223 | 		given, it will apply to both. If a second siint is specified | 
 | 224 | 		after a comma, it will apply to writes only. In other words, | 
 | 225 | 		the format is either bs=read_and_write or bs=read,write. | 
 | 226 | 		bs=4k,8k will thus use 4k blocks for reads, and 8k blocks | 
| Jens Axboe | 787f7e9 | 2006-11-06 13:26:29 +0100 | [diff] [blame] | 227 | 		for writes. If you only wish to set the write size, you | 
 | 228 | 		can do so by passing an empty read size - bs=,8k will set | 
 | 229 | 		8k for writes and leave the read default value. | 
| Jens Axboe | a00735e | 2006-11-03 08:58:08 +0100 | [diff] [blame] | 230 |  | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 231 | bsrange=irange	Instead of giving a single block size, specify a range | 
 | 232 | 		and fio will mix the issued io block sizes. The issued | 
 | 233 | 		io unit will always be a multiple of the minimum value | 
| Jens Axboe | f90eff5 | 2006-11-06 11:08:21 +0100 | [diff] [blame] | 234 | 		given (also see bs_unaligned). Applies to both reads and | 
 | 235 | 		writes, however a second range can be given after a comma. | 
 | 236 | 		See bs=. | 
| Jens Axboe | a00735e | 2006-11-03 08:58:08 +0100 | [diff] [blame] | 237 |  | 
| Jens Axboe | 690adba | 2006-10-30 15:25:09 +0100 | [diff] [blame] | 238 | bs_unaligned	If this option is given, any byte size value within bsrange | 
 | 239 | 		may be used as a block range. This typically wont work with | 
 | 240 | 		direct IO, as that normally requires sector alignment. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 241 |  | 
 | 242 | nrfiles=int	Number of files to use for this job. Defaults to 1. | 
 | 243 |  | 
 | 244 | ioengine=str	Defines how the job issues io to the file. The following | 
 | 245 | 		types are defined: | 
 | 246 |  | 
 | 247 | 			sync	Basic read(2) or write(2) io. lseek(2) is | 
 | 248 | 				used to position the io location. | 
 | 249 |  | 
 | 250 | 			libaio	Linux native asynchronous io. | 
 | 251 |  | 
 | 252 | 			posixaio glibc posix asynchronous io. | 
 | 253 |  | 
 | 254 | 			mmap	File is memory mapped and data copied | 
 | 255 | 				to/from using memcpy(3). | 
 | 256 |  | 
 | 257 | 			splice	splice(2) is used to transfer the data and | 
 | 258 | 				vmsplice(2) to transfer data from user | 
 | 259 | 				space to the kernel. | 
 | 260 |  | 
 | 261 | 			sg	SCSI generic sg v3 io. May either be | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 262 | 				synchronous using the SG_IO ioctl, or if | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 263 | 				the target is an sg character device | 
 | 264 | 				we use read(2) and write(2) for asynchronous | 
 | 265 | 				io. | 
 | 266 |  | 
| Jens Axboe | a94ea28 | 2006-11-24 12:37:34 +0100 | [diff] [blame] | 267 | 			null	Doesn't transfer any data, just pretends | 
 | 268 | 				to. This is mainly used to exercise fio | 
 | 269 | 				itself and for debugging/testing purposes. | 
 | 270 |  | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 271 | iodepth=int	This defines how many io units to keep in flight against | 
 | 272 | 		the file. The default is 1 for each file defined in this | 
 | 273 | 		job, can be overridden with a larger value for higher | 
 | 274 | 		concurrency. | 
 | 275 |  | 
 | 276 | direct=bool	If value is true, use non-buffered io. This is usually | 
 | 277 | 		O_DIRECT. Defaults to true. | 
 | 278 |  | 
 | 279 | offset=siint	Start io at the given offset in the file. The data before | 
 | 280 | 		the given offset will not be touched. This effectively | 
 | 281 | 		caps the file size at real_size - offset. | 
 | 282 |  | 
 | 283 | fsync=int	If writing to a file, issue a sync of the dirty data | 
 | 284 | 		for every number of blocks given. For example, if you give | 
 | 285 | 		32 as a parameter, fio will sync the file for every 32 | 
 | 286 | 		writes issued. If fio is using non-buffered io, we may | 
 | 287 | 		not sync the file. The exception is the sg io engine, which | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 288 | 		synchronizes the disk cache anyway. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 289 |  | 
 | 290 | overwrite=bool	If writing to a file, setup the file first and do overwrites. | 
 | 291 |  | 
 | 292 | end_fsync=bool	If true, fsync file contents when the job exits. | 
 | 293 |  | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 294 | rwmixcycle=int	Value in milliseconds describing how often to switch between | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 295 | 		reads and writes for a mixed workload. The default is | 
 | 296 | 		500 msecs. | 
 | 297 |  | 
 | 298 | rwmixread=int	How large a percentage of the mix should be reads. | 
 | 299 |  | 
 | 300 | rwmixwrite=int	How large a percentage of the mix should be writes. If both | 
 | 301 | 		rwmixread and rwmixwrite is given and the values do not add | 
 | 302 | 		up to 100%, the latter of the two will be used to override | 
 | 303 | 		the first. | 
 | 304 |  | 
| Jens Axboe | bb8895e | 2006-10-30 15:14:48 +0100 | [diff] [blame] | 305 | norandommap	Normally fio will cover every block of the file when doing | 
 | 306 | 		random IO. If this option is given, fio will just get a | 
 | 307 | 		new random offset without looking at past io history. This | 
 | 308 | 		means that some blocks may not be read or written, and that | 
 | 309 | 		some blocks may be read/written more than once. This option | 
 | 310 | 		is mutually exclusive with verify= for that reason. | 
 | 311 |  | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 312 | nice=int	Run the job with the given nice value. See man nice(2). | 
 | 313 |  | 
 | 314 | prio=int	Set the io priority value of this job. Linux limits us to | 
 | 315 | 		a positive value between 0 and 7, with 0 being the highest. | 
 | 316 | 		See man ionice(1). | 
 | 317 |  | 
 | 318 | prioclass=int	Set the io priority class. See man ionice(1). | 
 | 319 |  | 
 | 320 | thinktime=int	Stall the job x microseconds after an io has completed before | 
 | 321 | 		issuing the next. May be used to simulate processing being | 
 | 322 | 		done by an application. | 
 | 323 |  | 
 | 324 | rate=int	Cap the bandwidth used by this job to this number of KiB/sec. | 
 | 325 |  | 
 | 326 | ratemin=int	Tell fio to do whatever it can to maintain at least this | 
 | 327 | 		bandwidth. | 
 | 328 |  | 
 | 329 | ratecycle=int	Average bandwidth for 'rate' and 'ratemin' over this number | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 330 | 		of milliseconds. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 331 |  | 
 | 332 | cpumask=int	Set the CPU affinity of this job. The parameter given is a | 
 | 333 | 		bitmask of allowed CPU's the job may run on. See man | 
 | 334 | 		sched_setaffinity(2). | 
 | 335 |  | 
 | 336 | startdelay=int	Start this job the specified number of seconds after fio | 
 | 337 | 		has started. Only useful if the job file contains several | 
 | 338 | 		jobs, and you want to delay starting some jobs to a certain | 
 | 339 | 		time. | 
 | 340 |  | 
 | 341 | timeout=int	Tell fio to terminate processing after the specified number | 
 | 342 | 		of seconds. It can be quite hard to determine for how long | 
 | 343 | 		a specified job will run, so this parameter is handy to | 
 | 344 | 		cap the total runtime to a given time. | 
 | 345 |  | 
 | 346 | invalidate=bool	Invalidate the buffer/page cache parts for this file prior | 
 | 347 | 		to starting io. Defaults to true. | 
 | 348 |  | 
 | 349 | sync=bool	Use sync io for buffered writes. For the majority of the | 
 | 350 | 		io engines, this means using O_SYNC. | 
 | 351 |  | 
 | 352 | mem=str		Fio can use various types of memory as the io unit buffer. | 
 | 353 | 		The allowed values are: | 
 | 354 |  | 
 | 355 | 			malloc	Use memory from malloc(3) as the buffers. | 
 | 356 |  | 
 | 357 | 			shm	Use shared memory as the buffers. Allocated | 
 | 358 | 				through shmget(2). | 
 | 359 |  | 
| Jens Axboe | 74b025b | 2006-12-19 15:18:14 +0100 | [diff] [blame] | 360 | 			shmhuge	Same as shm, but use huge pages as backing. | 
 | 361 |  | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 362 | 			mmap	Use anonymous memory maps as the buffers. | 
 | 363 | 				Allocated through mmap(2). | 
 | 364 |  | 
 | 365 | 		The area allocated is a function of the maximum allowed | 
 | 366 | 		bs size for the job, multiplied by the io depth given. | 
 | 367 |  | 
 | 368 | exitall		When one job finishes, terminate the rest. The default is | 
 | 369 | 		to wait for each job to finish, sometimes that is not the | 
 | 370 | 		desired action. | 
 | 371 |  | 
 | 372 | bwavgtime=int	Average the calculated bandwidth over the given time. Value | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 373 | 		is specified in milliseconds. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 374 |  | 
 | 375 | create_serialize=bool	If true, serialize the file creating for the jobs. | 
 | 376 | 			This may be handy to avoid interleaving of data | 
 | 377 | 			files, which may greatly depend on the filesystem | 
 | 378 | 			used and even the number of processors in the system. | 
 | 379 |  | 
 | 380 | create_fsync=bool	fsync the data file after creation. This is the | 
 | 381 | 			default. | 
 | 382 |  | 
 | 383 | unlink		Unlink the job files when done. fio defaults to doing this, | 
 | 384 | 		if it created the file itself. | 
 | 385 |  | 
 | 386 | loops=int	Run the specified number of iterations of this job. Used | 
 | 387 | 		to repeat the same workload a given number of times. Defaults | 
 | 388 | 		to 1. | 
 | 389 |  | 
 | 390 | verify=str	If writing to a file, fio can verify the file contents | 
 | 391 | 		after each iteration of the job. The allowed values are: | 
 | 392 |  | 
 | 393 | 			md5	Use an md5 sum of the data area and store | 
 | 394 | 				it in the header of each block. | 
 | 395 |  | 
 | 396 | 			crc32	Use a crc32 sum of the data area and store | 
 | 397 | 				it in the header of each block. | 
 | 398 |  | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 399 | 		This option can be used for repeated burn-in tests of a | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 400 | 		system to make sure that the written data is also | 
 | 401 | 		correctly read back. | 
 | 402 |  | 
 | 403 | stonewall	Wait for preceeding jobs in the job file to exit, before | 
 | 404 | 		starting this one. Can be used to insert serialization | 
 | 405 | 		points in the job file. | 
 | 406 |  | 
 | 407 | numjobs=int	Create the specified number of clones of this job. May be | 
 | 408 | 		used to setup a larger number of threads/processes doing | 
 | 409 | 		the same thing. | 
 | 410 |  | 
 | 411 | thread		fio defaults to forking jobs, however if this option is | 
 | 412 | 		given, fio will use pthread_create(3) to create threads | 
 | 413 | 		instead. | 
 | 414 |  | 
 | 415 | zonesize=siint	Divide a file into zones of the specified size. See zoneskip. | 
 | 416 |  | 
 | 417 | zoneskip=siint	Skip the specified number of bytes when zonesize data has | 
 | 418 | 		been read. The two zone options can be used to only do | 
 | 419 | 		io on zones of a file. | 
 | 420 |  | 
| Jens Axboe | 076efc7 | 2006-10-27 11:24:25 +0200 | [diff] [blame] | 421 | write_iolog=str	Write the issued io patterns to the specified file. See | 
 | 422 | 		read_iolog. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 423 |  | 
| Jens Axboe | 076efc7 | 2006-10-27 11:24:25 +0200 | [diff] [blame] | 424 | read_iolog=str	Open an iolog with the specified file name and replay the | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 425 | 		io patterns it contains. This can be used to store a | 
 | 426 | 		workload and replay it sometime later. | 
 | 427 |  | 
 | 428 | write_bw_log	If given, write a bandwidth log of the jobs in this job | 
 | 429 | 		file. Can be used to store data of the bandwidth of the | 
| Jens Axboe | e0da9bc | 2006-10-25 13:08:57 +0200 | [diff] [blame] | 430 | 		jobs in their lifetime. The included fio_generate_plots | 
 | 431 | 		script uses gnuplot to turn these text files into nice | 
 | 432 | 		graphs. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 433 |  | 
 | 434 | write_lat_log	Same as write_bw_log, except that this option stores io | 
 | 435 | 		completion latencies instead. | 
 | 436 |  | 
 | 437 | lockmem=siint	Pin down the specified amount of memory with mlock(2). Can | 
 | 438 | 		potentially be used instead of removing memory or booting | 
 | 439 | 		with less memory to simulate a smaller amount of memory. | 
 | 440 |  | 
 | 441 | exec_prerun=str	Before running this job, issue the command specified | 
 | 442 | 		through system(3). | 
 | 443 |  | 
 | 444 | exec_postrun=str After the job completes, issue the command specified | 
 | 445 | 		 though system(3). | 
 | 446 |  | 
 | 447 | ioscheduler=str	Attempt to switch the device hosting the file to the specified | 
 | 448 | 		io scheduler before running. | 
 | 449 |  | 
 | 450 | cpuload=int	If the job is a CPU cycle eater, attempt to use the specified | 
 | 451 | 		percentage of CPU cycles. | 
 | 452 |  | 
 | 453 | cpuchunks=int	If the job is a CPU cycle eater, split the load into | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 454 | 		cycles of the given time. In milliseconds. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 455 |  | 
 | 456 |  | 
 | 457 | 6.0 Interpreting the output | 
 | 458 | --------------------------- | 
 | 459 |  | 
 | 460 | fio spits out a lot of output. While running, fio will display the | 
 | 461 | status of the jobs created. An example of that would be: | 
 | 462 |  | 
| Jens Axboe | 6043c57 | 2006-11-03 11:37:47 +0100 | [diff] [blame] | 463 | Threads running: 1: [_r] [24.79% done] [ 13509/  8334 kb/s] [eta 00h:01m:31s] | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 464 |  | 
 | 465 | The characters inside the square brackets denote the current status of | 
 | 466 | each thread. The possible values (in typical life cycle order) are: | 
 | 467 |  | 
 | 468 | Idle	Run | 
 | 469 | ----    --- | 
 | 470 | P		Thread setup, but not started. | 
 | 471 | C		Thread created. | 
 | 472 | I		Thread initialized, waiting. | 
 | 473 | 	R	Running, doing sequential reads. | 
 | 474 | 	r	Running, doing random reads. | 
 | 475 | 	W	Running, doing sequential writes. | 
 | 476 | 	w	Running, doing random writes. | 
 | 477 | 	M	Running, doing mixed sequential reads/writes. | 
 | 478 | 	m	Running, doing mixed random reads/writes. | 
 | 479 | 	F	Running, currently waiting for fsync() | 
 | 480 | V		Running, doing verification of written data. | 
 | 481 | E		Thread exited, not reaped by main thread yet. | 
 | 482 | _		Thread reaped. | 
 | 483 |  | 
 | 484 | The other values are fairly self explanatory - number of threads | 
| Jens Axboe | 6043c57 | 2006-11-03 11:37:47 +0100 | [diff] [blame] | 485 | currently running and doing io, rate of io since last check, and the estimated | 
 | 486 | completion percentage and time for the running group. It's impossible to | 
 | 487 | estimate runtime of the following groups (if any). | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 488 |  | 
 | 489 | When fio is done (or interrupted by ctrl-c), it will show the data for | 
 | 490 | each thread, group of threads, and disks in that order. For each data | 
 | 491 | direction, the output looks like: | 
 | 492 |  | 
 | 493 | Client1 (g=0): err= 0: | 
 | 494 |   write: io=    32MiB, bw=   666KiB/s, runt= 50320msec | 
 | 495 |     slat (msec): min=    0, max=  136, avg= 0.03, dev= 1.92 | 
 | 496 |     clat (msec): min=    0, max=  631, avg=48.50, dev=86.82 | 
 | 497 |     bw (KiB/s) : min=    0, max= 1196, per=51.00%, avg=664.02, dev=681.68 | 
 | 498 |   cpu        : usr=1.49%, sys=0.25%, ctx=7969 | 
 | 499 |  | 
 | 500 | The client number is printed, along with the group id and error of that | 
 | 501 | thread. Below is the io statistics, here for writes. In the order listed, | 
 | 502 | they denote: | 
 | 503 |  | 
 | 504 | io=		Number of megabytes io performed | 
 | 505 | bw=		Average bandwidth rate | 
 | 506 | runt=		The runtime of that thread | 
 | 507 | 	slat=	Submission latency (avg being the average, dev being the | 
 | 508 | 		standard deviation). This is the time it took to submit | 
 | 509 | 		the io. For sync io, the slat is really the completion | 
 | 510 | 		latency, since queue/complete is one operation there. | 
 | 511 | 	clat=	Completion latency. Same names as slat, this denotes the | 
 | 512 | 		time from submission to completion of the io pieces. For | 
 | 513 | 		sync io, clat will usually be equal (or very close) to 0, | 
 | 514 | 		as the time from submit to complete is basically just | 
 | 515 | 		CPU time (io has already been done, see slat explanation). | 
 | 516 | 	bw=	Bandwidth. Same names as the xlat stats, but also includes | 
 | 517 | 		an approximate percentage of total aggregate bandwidth | 
 | 518 | 		this thread received in this group. This last value is | 
 | 519 | 		only really useful if the threads in this group are on the | 
 | 520 | 		same disk, since they are then competing for disk access. | 
 | 521 | cpu=		CPU usage. User and system time, along with the number | 
 | 522 | 		of context switches this thread went through. | 
 | 523 |  | 
 | 524 | After each client has been listed, the group statistics are printed. They | 
 | 525 | will look like this: | 
 | 526 |  | 
 | 527 | Run status group 0 (all jobs): | 
 | 528 |    READ: io=64MiB, aggrb=22178, minb=11355, maxb=11814, mint=2840msec, maxt=2955msec | 
 | 529 |   WRITE: io=64MiB, aggrb=1302, minb=666, maxb=669, mint=50093msec, maxt=50320msec | 
 | 530 |  | 
 | 531 | For each data direction, it prints: | 
 | 532 |  | 
 | 533 | io=		Number of megabytes io performed. | 
 | 534 | aggrb=		Aggregate bandwidth of threads in this group. | 
 | 535 | minb=		The minimum average bandwidth a thread saw. | 
 | 536 | maxb=		The maximum average bandwidth a thread saw. | 
 | 537 | mint=		The smallest runtime of the threads in that group. | 
 | 538 | maxt=		The longest runtime of the threads in that group. | 
 | 539 |  | 
 | 540 | And finally, the disk statistics are printed. They will look like this: | 
 | 541 |  | 
 | 542 | Disk stats (read/write): | 
 | 543 |   sda: ios=16398/16511, merge=30/162, ticks=6853/819634, in_queue=826487, util=100.00% | 
 | 544 |  | 
 | 545 | Each value is printed for both reads and writes, with reads first. The | 
 | 546 | numbers denote: | 
 | 547 |  | 
 | 548 | ios=		Number of ios performed by all groups. | 
 | 549 | merge=		Number of merges io the io scheduler. | 
 | 550 | ticks=		Number of ticks we kept the disk busy. | 
 | 551 | io_queue=	Total time spent in the disk queue. | 
 | 552 | util=		The disk utilization. A value of 100% means we kept the disk | 
 | 553 | 		busy constantly, 50% would be a disk idling half of the time. | 
 | 554 |  | 
 | 555 |  | 
 | 556 | 7.0 Terse output | 
 | 557 | ---------------- | 
 | 558 |  | 
 | 559 | For scripted usage where you typically want to generate tables or graphs | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 560 | of the results, fio can output the results in a comma separated format. | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 561 | The format is one long line of values, such as: | 
 | 562 |  | 
 | 563 | client1,0,0,936,331,2894,0,0,0.000000,0.000000,1,170,22.115385,34.290410,16,714,84.252874%,366.500000,566.417819,3496,1237,2894,0,0,0.000000,0.000000,0,246,6.671625,21.436952,0,2534,55.465300%,1406.600000,2008.044216,0.000000%,0.431928%,1109 | 
 | 564 |  | 
 | 565 | Split up, the format is as follows: | 
 | 566 |  | 
 | 567 | 	jobname, groupid, error | 
 | 568 | 	READ status: | 
 | 569 | 		KiB IO, bandwidth (KiB/sec), runtime (msec) | 
 | 570 | 		Submission latency: min, max, mean, deviation | 
 | 571 | 		Completion latency: min, max, mean, deviation | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 572 | 		Bw: min, max, aggregate percentage of total, mean, deviation | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 573 | 	WRITE status: | 
 | 574 | 		KiB IO, bandwidth (KiB/sec), runtime (msec) | 
 | 575 | 		Submission latency: min, max, mean, deviation | 
 | 576 | 		Completion latency: min, max, mean, deviation | 
| Jens Axboe | 6c21976 | 2006-11-03 15:51:45 +0100 | [diff] [blame] | 577 | 		Bw: min, max, aggregate percentage of total, mean, deviation | 
| Jens Axboe | 71bfa16 | 2006-10-25 11:08:19 +0200 | [diff] [blame] | 578 | 	CPU usage: user, system, context switches | 
 | 579 |  |