Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 1 | :mod:`binascii` --- Convert between binary and ASCII |
| 2 | ==================================================== |
| 3 | |
| 4 | .. module:: binascii |
| 5 | :synopsis: Tools for converting between binary and various ASCII-encoded binary |
| 6 | representations. |
| 7 | |
| 8 | |
| 9 | .. index:: |
| 10 | module: uu |
| 11 | module: base64 |
| 12 | module: binhex |
| 13 | |
| 14 | The :mod:`binascii` module contains a number of methods to convert between |
| 15 | binary and various ASCII-encoded binary representations. Normally, you will not |
| 16 | use these functions directly but use wrapper modules like :mod:`uu`, |
| 17 | :mod:`base64`, or :mod:`binhex` instead. The :mod:`binascii` module contains |
| 18 | low-level functions written in C for greater speed that are used by the |
| 19 | higher-level modules. |
| 20 | |
Florent Xicluna | f1046ca | 2010-07-27 21:20:15 +0000 | [diff] [blame] | 21 | .. note:: |
| 22 | |
| 23 | Encoding and decoding functions do not accept Unicode strings. Only bytestring |
| 24 | and bytearray objects can be processed. |
| 25 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 26 | The :mod:`binascii` module defines the following functions: |
| 27 | |
| 28 | |
| 29 | .. function:: a2b_uu(string) |
| 30 | |
| 31 | Convert a single line of uuencoded data back to binary and return the binary |
| 32 | data. Lines normally contain 45 (binary) bytes, except for the last line. Line |
| 33 | data may be followed by whitespace. |
| 34 | |
| 35 | |
| 36 | .. function:: b2a_uu(data) |
| 37 | |
| 38 | Convert binary data to a line of ASCII characters, the return value is the |
| 39 | converted line, including a newline char. The length of *data* should be at most |
| 40 | 45. |
| 41 | |
| 42 | |
| 43 | .. function:: a2b_base64(string) |
| 44 | |
| 45 | Convert a block of base64 data back to binary and return the binary data. More |
| 46 | than one line may be passed at a time. |
| 47 | |
| 48 | |
| 49 | .. function:: b2a_base64(data) |
| 50 | |
| 51 | Convert binary data to a line of ASCII characters in base64 coding. The return |
| 52 | value is the converted line, including a newline char. The length of *data* |
| 53 | should be at most 57 to adhere to the base64 standard. |
| 54 | |
| 55 | |
Georg Brandl | b868a66 | 2009-04-02 02:56:10 +0000 | [diff] [blame] | 56 | .. function:: a2b_qp(string, header=False) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 57 | |
| 58 | Convert a block of quoted-printable data back to binary and return the binary |
| 59 | data. More than one line may be passed at a time. If the optional argument |
| 60 | *header* is present and true, underscores will be decoded as spaces. |
| 61 | |
Florent Xicluna | f1046ca | 2010-07-27 21:20:15 +0000 | [diff] [blame] | 62 | .. versionchanged:: 3.2 |
Georg Brandl | 67b21b7 | 2010-08-17 15:07:14 +0000 | [diff] [blame] | 63 | Accept only bytestring or bytearray objects as input. |
Florent Xicluna | f1046ca | 2010-07-27 21:20:15 +0000 | [diff] [blame] | 64 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 65 | |
Georg Brandl | b868a66 | 2009-04-02 02:56:10 +0000 | [diff] [blame] | 66 | .. function:: b2a_qp(data, quotetabs=False, istext=True, header=False) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 67 | |
| 68 | Convert binary data to a line(s) of ASCII characters in quoted-printable |
| 69 | encoding. The return value is the converted line(s). If the optional argument |
| 70 | *quotetabs* is present and true, all tabs and spaces will be encoded. If the |
| 71 | optional argument *istext* is present and true, newlines are not encoded but |
| 72 | trailing whitespace will be encoded. If the optional argument *header* is |
| 73 | present and true, spaces will be encoded as underscores per RFC1522. If the |
| 74 | optional argument *header* is present and false, newline characters will be |
| 75 | encoded as well; otherwise linefeed conversion might corrupt the binary data |
| 76 | stream. |
| 77 | |
| 78 | |
| 79 | .. function:: a2b_hqx(string) |
| 80 | |
| 81 | Convert binhex4 formatted ASCII data to binary, without doing RLE-decompression. |
| 82 | The string should contain a complete number of binary bytes, or (in case of the |
| 83 | last portion of the binhex4 data) have the remaining bits zero. |
| 84 | |
| 85 | |
| 86 | .. function:: rledecode_hqx(data) |
| 87 | |
| 88 | Perform RLE-decompression on the data, as per the binhex4 standard. The |
| 89 | algorithm uses ``0x90`` after a byte as a repeat indicator, followed by a count. |
| 90 | A count of ``0`` specifies a byte value of ``0x90``. The routine returns the |
| 91 | decompressed data, unless data input data ends in an orphaned repeat indicator, |
| 92 | in which case the :exc:`Incomplete` exception is raised. |
| 93 | |
Florent Xicluna | f1046ca | 2010-07-27 21:20:15 +0000 | [diff] [blame] | 94 | .. versionchanged:: 3.2 |
Georg Brandl | 67b21b7 | 2010-08-17 15:07:14 +0000 | [diff] [blame] | 95 | Accept only bytestring or bytearray objects as input. |
Florent Xicluna | f1046ca | 2010-07-27 21:20:15 +0000 | [diff] [blame] | 96 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 97 | |
| 98 | .. function:: rlecode_hqx(data) |
| 99 | |
| 100 | Perform binhex4 style RLE-compression on *data* and return the result. |
| 101 | |
| 102 | |
| 103 | .. function:: b2a_hqx(data) |
| 104 | |
| 105 | Perform hexbin4 binary-to-ASCII translation and return the resulting string. The |
| 106 | argument should already be RLE-coded, and have a length divisible by 3 (except |
| 107 | possibly the last fragment). |
| 108 | |
| 109 | |
| 110 | .. function:: crc_hqx(data, crc) |
| 111 | |
| 112 | Compute the binhex4 crc value of *data*, starting with an initial *crc* and |
| 113 | returning the result. |
| 114 | |
| 115 | |
| 116 | .. function:: crc32(data[, crc]) |
| 117 | |
| 118 | Compute CRC-32, the 32-bit checksum of data, starting with an initial crc. This |
| 119 | is consistent with the ZIP file checksum. Since the algorithm is designed for |
| 120 | use as a checksum algorithm, it is not suitable for use as a general hash |
| 121 | algorithm. Use as follows:: |
| 122 | |
Georg Brandl | ede6c2a | 2010-01-05 10:22:04 +0000 | [diff] [blame] | 123 | print(binascii.crc32(b"hello world")) |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 124 | # Or, in two pieces: |
Georg Brandl | ede6c2a | 2010-01-05 10:22:04 +0000 | [diff] [blame] | 125 | crc = binascii.crc32(b"hello") |
| 126 | crc = binascii.crc32(b" world", crc) & 0xffffffff |
| 127 | print('crc32 = {:#010x}'.format(crc)) |
Benjamin Peterson | 058e31e | 2009-01-16 03:54:08 +0000 | [diff] [blame] | 128 | |
| 129 | .. note:: |
| 130 | To generate the same numeric value across all Python versions and |
| 131 | platforms use crc32(data) & 0xffffffff. If you are only using |
| 132 | the checksum in packed binary format this is not necessary as the |
Gregory P. Smith | fa6cf39 | 2009-02-01 00:30:50 +0000 | [diff] [blame] | 133 | return value is the correct 32bit binary representation |
Benjamin Peterson | 058e31e | 2009-01-16 03:54:08 +0000 | [diff] [blame] | 134 | regardless of sign. |
| 135 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 136 | |
| 137 | .. function:: b2a_hex(data) |
| 138 | hexlify(data) |
| 139 | |
| 140 | Return the hexadecimal representation of the binary *data*. Every byte of |
| 141 | *data* is converted into the corresponding 2-digit hex representation. The |
| 142 | resulting string is therefore twice as long as the length of *data*. |
| 143 | |
| 144 | |
| 145 | .. function:: a2b_hex(hexstr) |
| 146 | unhexlify(hexstr) |
| 147 | |
| 148 | Return the binary data represented by the hexadecimal string *hexstr*. This |
| 149 | function is the inverse of :func:`b2a_hex`. *hexstr* must contain an even number |
| 150 | of hexadecimal digits (which can be upper or lower case), otherwise a |
| 151 | :exc:`TypeError` is raised. |
| 152 | |
Florent Xicluna | f1046ca | 2010-07-27 21:20:15 +0000 | [diff] [blame] | 153 | .. versionchanged:: 3.2 |
Georg Brandl | 67b21b7 | 2010-08-17 15:07:14 +0000 | [diff] [blame] | 154 | Accept only bytestring or bytearray objects as input. |
Florent Xicluna | f1046ca | 2010-07-27 21:20:15 +0000 | [diff] [blame] | 155 | |
Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 156 | |
| 157 | .. exception:: Error |
| 158 | |
| 159 | Exception raised on errors. These are usually programming errors. |
| 160 | |
| 161 | |
| 162 | .. exception:: Incomplete |
| 163 | |
| 164 | Exception raised on incomplete data. These are usually not programming errors, |
| 165 | but may be handled by reading a little more data and trying again. |
| 166 | |
| 167 | |
| 168 | .. seealso:: |
| 169 | |
| 170 | Module :mod:`base64` |
| 171 | Support for base64 encoding used in MIME email messages. |
| 172 | |
| 173 | Module :mod:`binhex` |
| 174 | Support for the binhex format used on the Macintosh. |
| 175 | |
| 176 | Module :mod:`uu` |
| 177 | Support for UU encoding used on Unix. |
| 178 | |
| 179 | Module :mod:`quopri` |
| 180 | Support for quoted-printable encoding used in MIME email messages. |