Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 1 | USING VFAT |
| 2 | ---------------------------------------------------------------------- |
| 3 | To use the vfat filesystem, use the filesystem type 'vfat'. i.e. |
| 4 | mount -t vfat /dev/fd0 /mnt |
| 5 | |
| 6 | No special partition formatter is required. mkdosfs will work fine |
| 7 | if you want to format from within Linux. |
| 8 | |
| 9 | VFAT MOUNT OPTIONS |
| 10 | ---------------------------------------------------------------------- |
| 11 | umask=### -- The permission mask (for files and directories, see umask(1)). |
| 12 | The default is the umask of current process. |
| 13 | |
| 14 | dmask=### -- The permission mask for the directory. |
| 15 | The default is the umask of current process. |
| 16 | |
| 17 | fmask=### -- The permission mask for files. |
| 18 | The default is the umask of current process. |
| 19 | |
| 20 | codepage=### -- Sets the codepage number for converting to shortname |
| 21 | characters on FAT filesystem. |
| 22 | By default, FAT_DEFAULT_CODEPAGE setting is used. |
| 23 | |
| 24 | iocharset=name -- Character set to use for converting between the |
| 25 | encoding is used for user visible filename and 16 bit |
| 26 | Unicode characters. Long filenames are stored on disk |
| 27 | in Unicode format, but Unix for the most part doesn't |
| 28 | know how to deal with Unicode. |
| 29 | By default, FAT_DEFAULT_IOCHARSET setting is used. |
| 30 | |
| 31 | There is also an option of doing UTF8 translations |
| 32 | with the utf8 option. |
| 33 | |
| 34 | NOTE: "iocharset=utf8" is not recommended. If unsure, |
| 35 | you should consider the following option instead. |
| 36 | |
| 37 | utf8=<bool> -- UTF8 is the filesystem safe version of Unicode that |
| 38 | is used by the console. It can be be enabled for the |
| 39 | filesystem with this option. If 'uni_xlate' gets set, |
| 40 | UTF8 gets disabled. |
| 41 | |
| 42 | uni_xlate=<bool> -- Translate unhandled Unicode characters to special |
| 43 | escaped sequences. This would let you backup and |
| 44 | restore filenames that are created with any Unicode |
| 45 | characters. Until Linux supports Unicode for real, |
| 46 | this gives you an alternative. Without this option, |
| 47 | a '?' is used when no translation is possible. The |
| 48 | escape character is ':' because it is otherwise |
| 49 | illegal on the vfat filesystem. The escape sequence |
| 50 | that gets used is ':' and the four digits of hexadecimal |
| 51 | unicode. |
| 52 | |
| 53 | nonumtail=<bool> -- When creating 8.3 aliases, normally the alias will |
| 54 | end in '~1' or tilde followed by some number. If this |
| 55 | option is set, then if the filename is |
| 56 | "longfilename.txt" and "longfile.txt" does not |
| 57 | currently exist in the directory, 'longfile.txt' will |
| 58 | be the short alias instead of 'longfi~1.txt'. |
| 59 | |
| 60 | quiet -- Stops printing certain warning messages. |
| 61 | |
| 62 | check=s|r|n -- Case sensitivity checking setting. |
| 63 | s: strict, case sensitive |
| 64 | r: relaxed, case insensitive |
| 65 | n: normal, default setting, currently case insensitive |
| 66 | |
| 67 | shortname=lower|win95|winnt|mixed |
| 68 | -- Shortname display/create setting. |
| 69 | lower: convert to lowercase for display, |
| 70 | emulate the Windows 95 rule for create. |
| 71 | win95: emulate the Windows 95 rule for display/create. |
| 72 | winnt: emulate the Windows NT rule for display/create. |
| 73 | mixed: emulate the Windows NT rule for display, |
| 74 | emulate the Windows 95 rule for create. |
| 75 | Default setting is `lower'. |
| 76 | |
| 77 | <bool>: 0,1,yes,no,true,false |
| 78 | |
| 79 | TODO |
| 80 | ---------------------------------------------------------------------- |
| 81 | * Need to get rid of the raw scanning stuff. Instead, always use |
| 82 | a get next directory entry approach. The only thing left that uses |
| 83 | raw scanning is the directory renaming code. |
| 84 | |
| 85 | |
| 86 | POSSIBLE PROBLEMS |
| 87 | ---------------------------------------------------------------------- |
| 88 | * vfat_valid_longname does not properly checked reserved names. |
| 89 | * When a volume name is the same as a directory name in the root |
| 90 | directory of the filesystem, the directory name sometimes shows |
| 91 | up as an empty file. |
| 92 | * autoconv option does not work correctly. |
| 93 | |
| 94 | BUG REPORTS |
| 95 | ---------------------------------------------------------------------- |
| 96 | If you have trouble with the VFAT filesystem, mail bug reports to |
| 97 | chaffee@bmrc.cs.berkeley.edu. Please specify the filename |
| 98 | and the operation that gave you trouble. |
| 99 | |
| 100 | TEST SUITE |
| 101 | ---------------------------------------------------------------------- |
| 102 | If you plan to make any modifications to the vfat filesystem, please |
| 103 | get the test suite that comes with the vfat distribution at |
| 104 | |
| 105 | http://bmrc.berkeley.edu/people/chaffee/vfat.html |
| 106 | |
| 107 | This tests quite a few parts of the vfat filesystem and additional |
| 108 | tests for new features or untested features would be appreciated. |
| 109 | |
| 110 | NOTES ON THE STRUCTURE OF THE VFAT FILESYSTEM |
| 111 | ---------------------------------------------------------------------- |
| 112 | (This documentation was provided by Galen C. Hunt <gchunt@cs.rochester.edu> |
| 113 | and lightly annotated by Gordon Chaffee). |
| 114 | |
| 115 | This document presents a very rough, technical overview of my |
| 116 | knowledge of the extended FAT file system used in Windows NT 3.5 and |
| 117 | Windows 95. I don't guarantee that any of the following is correct, |
| 118 | but it appears to be so. |
| 119 | |
| 120 | The extended FAT file system is almost identical to the FAT |
| 121 | file system used in DOS versions up to and including 6.223410239847 |
| 122 | :-). The significant change has been the addition of long file names. |
| 123 | These names support up to 255 characters including spaces and lower |
| 124 | case characters as opposed to the traditional 8.3 short names. |
| 125 | |
| 126 | Here is the description of the traditional FAT entry in the current |
| 127 | Windows 95 filesystem: |
| 128 | |
| 129 | struct directory { // Short 8.3 names |
| 130 | unsigned char name[8]; // file name |
| 131 | unsigned char ext[3]; // file extension |
| 132 | unsigned char attr; // attribute byte |
| 133 | unsigned char lcase; // Case for base and extension |
| 134 | unsigned char ctime_ms; // Creation time, milliseconds |
| 135 | unsigned char ctime[2]; // Creation time |
| 136 | unsigned char cdate[2]; // Creation date |
| 137 | unsigned char adate[2]; // Last access date |
| 138 | unsigned char reserved[2]; // reserved values (ignored) |
| 139 | unsigned char time[2]; // time stamp |
| 140 | unsigned char date[2]; // date stamp |
| 141 | unsigned char start[2]; // starting cluster number |
| 142 | unsigned char size[4]; // size of the file |
| 143 | }; |
| 144 | |
| 145 | The lcase field specifies if the base and/or the extension of an 8.3 |
| 146 | name should be capitalized. This field does not seem to be used by |
| 147 | Windows 95 but it is used by Windows NT. The case of filenames is not |
| 148 | completely compatible from Windows NT to Windows 95. It is not completely |
| 149 | compatible in the reverse direction, however. Filenames that fit in |
| 150 | the 8.3 namespace and are written on Windows NT to be lowercase will |
| 151 | show up as uppercase on Windows 95. |
| 152 | |
| 153 | Note that the "start" and "size" values are actually little |
| 154 | endian integer values. The descriptions of the fields in this |
| 155 | structure are public knowledge and can be found elsewhere. |
| 156 | |
| 157 | With the extended FAT system, Microsoft has inserted extra |
| 158 | directory entries for any files with extended names. (Any name which |
| 159 | legally fits within the old 8.3 encoding scheme does not have extra |
| 160 | entries.) I call these extra entries slots. Basically, a slot is a |
| 161 | specially formatted directory entry which holds up to 13 characters of |
| 162 | a file's extended name. Think of slots as additional labeling for the |
| 163 | directory entry of the file to which they correspond. Microsoft |
| 164 | prefers to refer to the 8.3 entry for a file as its alias and the |
| 165 | extended slot directory entries as the file name. |
| 166 | |
| 167 | The C structure for a slot directory entry follows: |
| 168 | |
| 169 | struct slot { // Up to 13 characters of a long name |
| 170 | unsigned char id; // sequence number for slot |
| 171 | unsigned char name0_4[10]; // first 5 characters in name |
| 172 | unsigned char attr; // attribute byte |
| 173 | unsigned char reserved; // always 0 |
| 174 | unsigned char alias_checksum; // checksum for 8.3 alias |
| 175 | unsigned char name5_10[12]; // 6 more characters in name |
| 176 | unsigned char start[2]; // starting cluster number |
| 177 | unsigned char name11_12[4]; // last 2 characters in name |
| 178 | }; |
| 179 | |
| 180 | If the layout of the slots looks a little odd, it's only |
| 181 | because of Microsoft's efforts to maintain compatibility with old |
| 182 | software. The slots must be disguised to prevent old software from |
| 183 | panicking. To this end, a number of measures are taken: |
| 184 | |
| 185 | 1) The attribute byte for a slot directory entry is always set |
| 186 | to 0x0f. This corresponds to an old directory entry with |
| 187 | attributes of "hidden", "system", "read-only", and "volume |
| 188 | label". Most old software will ignore any directory |
| 189 | entries with the "volume label" bit set. Real volume label |
| 190 | entries don't have the other three bits set. |
| 191 | |
| 192 | 2) The starting cluster is always set to 0, an impossible |
| 193 | value for a DOS file. |
| 194 | |
| 195 | Because the extended FAT system is backward compatible, it is |
| 196 | possible for old software to modify directory entries. Measures must |
| 197 | be taken to ensure the validity of slots. An extended FAT system can |
| 198 | verify that a slot does in fact belong to an 8.3 directory entry by |
| 199 | the following: |
| 200 | |
| 201 | 1) Positioning. Slots for a file always immediately proceed |
| 202 | their corresponding 8.3 directory entry. In addition, each |
| 203 | slot has an id which marks its order in the extended file |
| 204 | name. Here is a very abbreviated view of an 8.3 directory |
| 205 | entry and its corresponding long name slots for the file |
| 206 | "My Big File.Extension which is long": |
| 207 | |
| 208 | <proceeding files...> |
| 209 | <slot #3, id = 0x43, characters = "h is long"> |
| 210 | <slot #2, id = 0x02, characters = "xtension whic"> |
| 211 | <slot #1, id = 0x01, characters = "My Big File.E"> |
| 212 | <directory entry, name = "MYBIGFIL.EXT"> |
| 213 | |
| 214 | Note that the slots are stored from last to first. Slots |
| 215 | are numbered from 1 to N. The Nth slot is or'ed with 0x40 |
| 216 | to mark it as the last one. |
| 217 | |
| 218 | 2) Checksum. Each slot has an "alias_checksum" value. The |
| 219 | checksum is calculated from the 8.3 name using the |
| 220 | following algorithm: |
| 221 | |
| 222 | for (sum = i = 0; i < 11; i++) { |
| 223 | sum = (((sum&1)<<7)|((sum&0xfe)>>1)) + name[i] |
| 224 | } |
| 225 | |
| 226 | 3) If there is free space in the final slot, a Unicode NULL (0x0000) |
| 227 | is stored after the final character. After that, all unused |
| 228 | characters in the final slot are set to Unicode 0xFFFF. |
| 229 | |
| 230 | Finally, note that the extended name is stored in Unicode. Each Unicode |
| 231 | character takes two bytes. |