blob: 13a7cac8043083c976fe41e0cba766bb3059db08 [file] [log] [blame]
David Woodhouse709ac062009-01-07 09:54:24 -05001
Eric Sandeenc854a992013-03-26 19:36:12 +00002BTRFS
3=====
David Woodhouse709ac062009-01-07 09:54:24 -05004
Eric Sandeenc854a992013-03-26 19:36:12 +00005Btrfs is a copy on write filesystem for Linux aimed at
David Woodhouse709ac062009-01-07 09:54:24 -05006implementing advanced features while focusing on fault tolerance,
7repair and easy administration. Initially developed by Oracle, Btrfs
8is licensed under the GPL and open for contribution from anyone.
9
10Linux has a wealth of filesystems to choose from, but we are facing a
11number of challenges with scaling to the large storage subsystems that
12are becoming common in today's data centers. Filesystems need to scale
13in their ability to address and manage large storage, and also in
14their ability to detect, repair and tolerate errors in the data stored
15on disk. Btrfs is under heavy development, and is not suitable for
16any uses other than benchmarking and review. The Btrfs disk format is
17not yet finalized.
18
19The main Btrfs features include:
20
21 * Extent based file storage (2^64 max file size)
22 * Space efficient packing of small files
23 * Space efficient indexed directories
24 * Dynamic inode allocation
25 * Writable snapshots
26 * Subvolumes (separate internal filesystem roots)
27 * Object level mirroring and striping
28 * Checksums on data and metadata (multiple algorithms available)
29 * Compression
30 * Integrated multiple device support, with several raid algorithms
31 * Online filesystem check (not yet implemented)
32 * Very fast offline filesystem check
33 * Efficient incremental backup and FS mirroring (not yet implemented)
34 * Online filesystem defragmentation
35
36
Eric Sandeenc854a992013-03-26 19:36:12 +000037Mount Options
38=============
David Woodhouse709ac062009-01-07 09:54:24 -050039
Eric Sandeenc854a992013-03-26 19:36:12 +000040When mounting a btrfs filesystem, the following option are accepted.
Qu Wenruo842bef52014-01-06 09:58:25 +080041Options with (*) are default options and will not show in the mount options.
Eric Sandeenc854a992013-03-26 19:36:12 +000042
43 alloc_start=<bytes>
44 Debugging option to force all block allocations above a certain
45 byte threshold on each block device. The value is specified in
46 bytes, optionally with a K, M, or G suffix, case insensitive.
47 Default is 1MB.
48
Qu Wenruofc0ca9af2014-01-06 09:58:26 +080049 noautodefrag(*)
Eric Sandeenc854a992013-03-26 19:36:12 +000050 autodefrag
Qu Wenruofc0ca9af2014-01-06 09:58:26 +080051 Disable/enable auto defragmentation.
52 Auto defragmentation detects small random writes into files and queue
53 them up for the defrag process. Works best for small files;
54 Not well suited for large database workloads.
Eric Sandeenc854a992013-03-26 19:36:12 +000055
56 check_int
57 check_int_data
58 check_int_print_mask=<value>
59 These debugging options control the behavior of the integrity checking
60 module (the BTRFS_FS_CHECK_INTEGRITY config option required).
61
62 check_int enables the integrity checker module, which examines all
63 block write requests to ensure on-disk consistency, at a large
64 memory and CPU cost.
65
66 check_int_data includes extent data in the integrity checks, and
67 implies the check_int option.
68
69 check_int_print_mask takes a bitmask of BTRFSIC_PRINT_MASK_* values
70 as defined in fs/btrfs/check-integrity.c, to control the integrity
71 checker module behavior.
72
73 See comments at the top of fs/btrfs/check-integrity.c for more info.
74
David Sterba906c1762013-11-20 15:05:51 +010075 commit=<seconds>
76 Set the interval of periodic commit, 30 seconds by default. Higher
77 values defer data being synced to permanent storage with obvious
78 consequences when the system crashes. The upper bound is not forced,
79 but a warning is printed if it's more than 300 seconds (5 minutes).
80
Eric Sandeenc854a992013-03-26 19:36:12 +000081 compress
82 compress=<type>
83 compress-force
84 compress-force=<type>
85 Control BTRFS file data compression. Type may be specified as "zlib"
86 "lzo" or "no" (for no compression, used for remounting). If no type
87 is specified, zlib is used. If compress-force is specified,
88 all files will be compressed, whether or not they compress well.
89 If compression is enabled, nodatacow and nodatasum are disabled.
90
91 degraded
92 Allow mounts to continue with missing devices. A read-write mount may
93 fail with too many devices missing, for example if a stripe member
94 is completely missing.
95
96 device=<devicepath>
97 Specify a device during mount so that ioctls on the control device
Masanari Iida9ed354b2013-08-20 20:33:17 +090098 can be avoided. Especially useful when trying to mount a multi-device
Eric Sandeenc854a992013-03-26 19:36:12 +000099 setup as root. May be specified multiple times for multiple devices.
100
Qu Wenruoe07a2ad2014-01-06 09:58:27 +0800101 nodiscard(*)
Eric Sandeenc854a992013-03-26 19:36:12 +0000102 discard
Qu Wenruoe07a2ad2014-01-06 09:58:27 +0800103 Disable/enable discard mount option.
104 Discard issues frequent commands to let the block device reclaim space
105 freed by the filesystem.
106 This is useful for SSD devices, thinly provisioned
Eric Sandeenc854a992013-03-26 19:36:12 +0000107 LUNs and virtual machine images, but may have a significant
108 performance impact. (The fstrim command is also available to
109 initiate batch trims from userspace).
110
Qu Wenruo53036292014-01-06 09:58:28 +0800111 noenospc_debug(*)
Eric Sandeenc854a992013-03-26 19:36:12 +0000112 enospc_debug
Qu Wenruo53036292014-01-06 09:58:28 +0800113 Disable/enable debugging option to be more verbose in some ENOSPC conditions.
Eric Sandeenc854a992013-03-26 19:36:12 +0000114
115 fatal_errors=<action>
116 Action to take when encountering a fatal error:
117 "bug" - BUG() on a fatal error. This is the default.
118 "panic" - panic() on a fatal error.
119
120 flushoncommit
121 The 'flushoncommit' mount option forces any data dirtied by a write in a
122 prior transaction to commit as part of the current commit. This makes
123 the committed state a fully consistent view of the file system from the
124 application's perspective (i.e., it includes all completed file system
125 operations). This was previously the behavior only when a snapshot is
126 created.
127
128 inode_cache
129 Enable free inode number caching. Defaults to off due to an overflow
130 problem when the free space crcs don't fit inside a single page.
131
132 max_inline=<bytes>
133 Specify the maximum amount of space, in bytes, that can be inlined in
134 a metadata B-tree leaf. The value is specified in bytes, optionally
135 with a K, M, or G suffix, case insensitive. In practice, this value
136 is limited by the root sector size, with some space unavailable due
137 to leaf headers. For a 4k sectorsize, max inline data is ~3900 bytes.
138
139 metadata_ratio=<value>
140 Specify that 1 metadata chunk should be allocated after every <value>
141 data chunks. Off by default.
142
143 noacl
144 Disable support for Posix Access Control Lists (ACLs). See the
145 acl(5) manual page for more information about ACLs.
146
Qu Wenruo842bef52014-01-06 09:58:25 +0800147 barrier(*)
Eric Sandeenc854a992013-03-26 19:36:12 +0000148 nobarrier
Qu Wenruo842bef52014-01-06 09:58:25 +0800149 Enable/disable the use of block layer write barriers. Write barriers
150 ensure that certain IOs make it through the device cache and are on
151 persistent storage. If disabled on a device with a volatile
152 (non-battery-backed) write-back cache, nobarrier option will lead to
153 filesystem corruption on a system crash or power loss.
Eric Sandeenc854a992013-03-26 19:36:12 +0000154
155 nodatacow
156 Disable data copy-on-write for newly created files. Implies nodatasum,
157 and disables all compression.
158
159 nodatasum
160 Disable data checksumming for newly created files.
161
162 notreelog
163 Disable the tree logging used for fsync and O_SYNC writes.
164
165 recovery
166 Enable autorecovery attempts if a bad tree root is found at mount time.
167 Currently this scans a list of several previous tree roots and tries to
168 use the first readable.
169
David Sterba906c1762013-11-20 15:05:51 +0100170 rescan_uuid_tree
171 Force check and rebuild procedure of the UUID tree. This should not
172 normally be needed.
173
174 skip_balance
Eric Sandeenc854a992013-03-26 19:36:12 +0000175 Skip automatic resume of interrupted balance operation after mount.
176 May be resumed with "btrfs balance resume."
177
178 space_cache (*)
179 Enable the on-disk freespace cache.
180 nospace_cache
181 Disable freespace cache loading without clearing the cache.
182 clear_cache
183 Force clearing and rebuilding of the disk space cache if something
184 has gone wrong.
185
186 ssd
187 nossd
188 ssd_spread
189 Options to control ssd allocation schemes. By default, BTRFS will
190 enable or disable ssd allocation heuristics depending on whether a
191 rotational or nonrotational disk is in use. The ssd and nossd options
192 can override this autodetection.
193
194 The ssd_spread mount option attempts to allocate into big chunks
195 of unused space, and may perform better on low-end ssds. ssd_spread
196 implies ssd, enabling all other ssd heuristics as well.
197
198 subvol=<path>
199 Mount subvolume at <path> rather than the root subvolume. <path> is
200 relative to the top level subvolume.
201
202 subvolid=<ID>
203 Mount subvolume specified by an ID number rather than the root subvolume.
204 This allows mounting of subvolumes which are not in the root of the mounted
205 filesystem.
206 You can use "btrfs subvolume list" to see subvolume ID numbers.
207
208 subvolrootid=<objectid> (deprecated)
209 Mount subvolume specified by <objectid> rather than the root subvolume.
210 This allows mounting of subvolumes which are not in the root of the mounted
211 filesystem.
212 You can use "btrfs subvolume show " to see the object ID for a subvolume.
213
214 thread_pool=<number>
215 The number of worker threads to allocate. The default number is equal
216 to the number of CPUs + 2, or 8, whichever is smaller.
217
218 user_subvol_rm_allowed
219 Allow subvolumes to be deleted by a non-root user. Use with caution.
220
221MAILING LIST
222============
David Woodhouse709ac062009-01-07 09:54:24 -0500223
224There is a Btrfs mailing list hosted on vger.kernel.org. You can
225find details on how to subscribe here:
226
227http://vger.kernel.org/vger-lists.html#linux-btrfs
228
229Mailing list archives are available from gmane:
230
231http://dir.gmane.org/gmane.comp.file-systems.btrfs
232
233
234
Eric Sandeenc854a992013-03-26 19:36:12 +0000235IRC
236===
David Woodhouse709ac062009-01-07 09:54:24 -0500237
238Discussion of Btrfs also occurs on the #btrfs channel of the Freenode
239IRC network.
240
241
242
243 UTILITIES
244 =========
245
246Userspace tools for creating and manipulating Btrfs file systems are
247available from the git repository at the following location:
248
Arnd Hannemannb52f75a2011-11-16 17:35:37 +0100249 http://git.kernel.org/?p=linux/kernel/git/mason/btrfs-progs.git
250 git://git.kernel.org/pub/scm/linux/kernel/git/mason/btrfs-progs.git
David Woodhouse709ac062009-01-07 09:54:24 -0500251
252These include the following tools:
253
David Sterbac7501792013-11-20 15:06:08 +0100254* mkfs.btrfs: create a filesystem
David Woodhouse709ac062009-01-07 09:54:24 -0500255
David Sterbac7501792013-11-20 15:06:08 +0100256* btrfs: a single tool to manage the filesystems, refer to the manpage for more details
David Woodhouse709ac062009-01-07 09:54:24 -0500257
David Sterbac7501792013-11-20 15:06:08 +0100258* 'btrfsck' or 'btrfs check': do a consistency check of the filesystem
David Woodhouse709ac062009-01-07 09:54:24 -0500259
David Sterbac7501792013-11-20 15:06:08 +0100260Other tools for specific tasks:
David Woodhouse709ac062009-01-07 09:54:24 -0500261
David Sterbac7501792013-11-20 15:06:08 +0100262* btrfs-convert: in-place conversion from ext2/3/4 filesystems
David Woodhouse709ac062009-01-07 09:54:24 -0500263
David Sterbac7501792013-11-20 15:06:08 +0100264* btrfs-image: dump filesystem metadata for debugging