commit | 489b56e04480b8ca3f2d1676265e67c65bae788d | [log] [tgz] |
---|---|---|
author | Marc-André Lemburg <mal@egenix.com> | Mon May 21 20:30:15 2001 +0000 |
committer | Marc-André Lemburg <mal@egenix.com> | Mon May 21 20:30:15 2001 +0000 |
tree | a148a1f74890d004f6434a77eb14185b76c73c77 | |
parent | f52d27e52d289b99837b4555fb3f757f2c89f4ad [diff] |
This patch changes the behaviour of the UTF-16 codec family. Only the UTF-16 codec will now interpret and remove a *leading* BOM mark. Sub- sequent BOM characters are no longer interpreted and removed. UTF-16-LE and -BE pass through all BOM mark characters. These changes should get the UTF-16 codec more in line with what the Unicode FAQ recommends w/r to BOM marks.