Roozbeh Pournader | 68ce780 | 2017-10-13 14:30:55 -0700 | [diff] [blame] | 1 | URL: https://github.com/hyphenation/tex-hyphen |
Calvin Pan | 2705f4e | 2021-10-13 11:02:28 +0800 | [diff] [blame] | 2 | Version: ecf976ab6995acb653d38ab1af0b9b9829ec0c77 |
Roozbeh Pournader | 7104a91 | 2015-05-01 10:46:58 -0700 | [diff] [blame] | 3 | License: various |
Roozbeh Pournader | 1851bc1 | 2015-04-06 14:48:09 -0700 | [diff] [blame] | 4 | License File: NOTICE |
| 5 | |
Calvin Pan | 79ef5ae | 2022-01-24 11:49:13 +0800 | [diff] [blame] | 6 | The tex-hyphen package contains hyphenation patterns under several different licenses. |
| 7 | Patterns licensed under the following open source licenses can be imported here: |
| 8 | * Apache 2.0 |
| 9 | * Unlicense |
| 10 | * MIT |
| 11 | * BSD |
| 12 | * MPL |
| 13 | * LGPL 2.1 |
| 14 | |
Roozbeh Pournader | 1851bc1 | 2015-04-06 14:48:09 -0700 | [diff] [blame] | 15 | Description: |
Roozbeh Pournader | ff85326 | 2017-02-22 18:05:00 -0800 | [diff] [blame] | 16 | Hyphenation patterns for automatic hyphenation from TeX hyphenation patterns |
Roozbeh Pournader | 68ce780 | 2017-10-13 14:30:55 -0700 | [diff] [blame] | 17 | project at http://www.hyphenation.org. |
Roozbeh Pournader | 1851bc1 | 2015-04-06 14:48:09 -0700 | [diff] [blame] | 18 | |
Calvin Pan | 79ef5ae | 2022-01-24 11:49:13 +0800 | [diff] [blame] | 19 | A few languages are from other sources with the individual repositories as listed below: |
| 20 | Czech (cs) - https://github.com/tensojka/cshyphen |
| 21 | Dutch (nl) - https://github.com/OpenTaal/opentaal-hunspell |
| 22 | Swedish (sv) - https://cgit.freedesktop.org/libreoffice/dictionaries/tree/sv_SE/hyph_sv.dic |
| 23 | Russian (ru) - https://github.com/laboratory50/russian-spellpack |
| 24 | |
Roozbeh Pournader | 1851bc1 | 2015-04-06 14:48:09 -0700 | [diff] [blame] | 25 | Local Modifications: |
Roozbeh Pournader | 68ce780 | 2017-10-13 14:30:55 -0700 | [diff] [blame] | 26 | The patterns are taken from the hyph-utf8/tex/generic/hyph-utf8/patterns/txt directory. |
| 27 | |
| 28 | Only the following patterns are included from that directory: |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 29 | |
Calvin Pan | 2705f4e | 2021-10-13 11:02:28 +0800 | [diff] [blame] | 30 | * Amharic (am) |
Roozbeh Pournader | 7104a91 | 2015-05-01 10:46:58 -0700 | [diff] [blame] | 31 | * Armenian (hy) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 32 | * Assamese (as) |
| 33 | * Basque (eu) |
Roozbeh Pournader | 68ce780 | 2017-10-13 14:30:55 -0700 | [diff] [blame] | 34 | * Belarusian (be) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 35 | * Bengali (bn) |
Roozbeh Pournader | ff85326 | 2017-02-22 18:05:00 -0800 | [diff] [blame] | 36 | * Bulgarian (bg) |
| 37 | * Church Slavonic (cu) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 38 | * Croation (hr) |
| 39 | * Danish (da) |
| 40 | * Estonian (et) |
Roozbeh Pournader | 7104a91 | 2015-05-01 10:46:58 -0700 | [diff] [blame] | 41 | * Ethiopic script (und-Ethi) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 42 | * French (fr) |
Calvin Pan | 2705f4e | 2021-10-13 11:02:28 +0800 | [diff] [blame] | 43 | * Galician (gl) |
Roozbeh Pournader | 68ce780 | 2017-10-13 14:30:55 -0700 | [diff] [blame] | 44 | * German (de) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 45 | * Gujarati (gu) |
| 46 | * Hindi (hi) |
Roozbeh Pournader | 7104a91 | 2015-05-01 10:46:58 -0700 | [diff] [blame] | 47 | * Hungarian (hu) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 48 | * Irish (ga) |
| 49 | * Kannada (kn) |
Roozbeh Pournader | 68ce780 | 2017-10-13 14:30:55 -0700 | [diff] [blame] | 50 | * Latin (la) |
Calvin Pan | 2705f4e | 2021-10-13 11:02:28 +0800 | [diff] [blame] | 51 | * Latvian (lv) |
| 52 | * Lithuanian (lt) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 53 | * Malayalam (ml) |
| 54 | * Marathi (mr) |
| 55 | * Mongolian written in Cyrillic (mn-cyrl) |
Roozbeh Pournader | 7104a91 | 2015-05-01 10:46:58 -0700 | [diff] [blame] | 56 | * Norwegian Bokmål (nb) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 57 | * Norwegian Nynorsk (nn) |
| 58 | * Oriya (or) |
Roozbeh Pournader | 1cf3431 | 2015-12-13 15:25:15 -0800 | [diff] [blame] | 59 | * Portuguese (pt) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 60 | * Punjabi (pa) |
Roozbeh Pournader | 7104a91 | 2015-05-01 10:46:58 -0700 | [diff] [blame] | 61 | * Sanskrit (sa) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 62 | * Slovenian (sl) |
Roozbeh Pournader | b6bdfb9 | 2015-12-02 12:16:10 -0800 | [diff] [blame] | 63 | * Spanish (es) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 64 | * Tamil (ta) |
| 65 | * Telugu (te) |
| 66 | * Turkmen (tk) |
Calvin Pan | 2705f4e | 2021-10-13 11:02:28 +0800 | [diff] [blame] | 67 | * Ukrainian (uk) |
Roozbeh Pournader | 7104a91 | 2015-05-01 10:46:58 -0700 | [diff] [blame] | 68 | * US English (en-US) |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 69 | * Welsh (cy) |
Roozbeh Pournader | 7104a91 | 2015-05-01 10:46:58 -0700 | [diff] [blame] | 70 | |
Roozbeh Pournader | ff85326 | 2017-02-22 18:05:00 -0800 | [diff] [blame] | 71 | The Ethiopic script pattern files have been renamed from mul-ethi to und-ethi, |
| 72 | to properly follow BCP 47 naming. |
Roozbeh Pournader | 1cf3431 | 2015-12-13 15:25:15 -0800 | [diff] [blame] | 73 | |
Roozbeh Pournader | ff85326 | 2017-02-22 18:05:00 -0800 | [diff] [blame] | 74 | In case hyphenation patterns are licensed under dual licenses, |
| 75 | hyph-{lang}.lic.txt files are edited to keep only the license Android applies |
| 76 | as listed in the corresponding NOTICE files. |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 77 | |
Roozbeh Pournader | 68ce780 | 2017-10-13 14:30:55 -0700 | [diff] [blame] | 78 | For the German (de) patterns, the *.chr.txt has been automatically generated |
| 79 | based on the contents of the pattern files, using the the create_chr.py script |
| 80 | created by Android engineers. |
Jungshik Shin | dedeff6 | 2016-03-18 15:54:40 -0700 | [diff] [blame] | 81 | |
Roozbeh Pournader | 68ce780 | 2017-10-13 14:30:55 -0700 | [diff] [blame] | 82 | British English (en-GB) patterns are taken from a different part of the |
| 83 | tex-hyphen repository. Please see the README.android file in the 'en-GB' |
Roozbeh Pournader | ff85326 | 2017-02-22 18:05:00 -0800 | [diff] [blame] | 84 | directory. |
Roozbeh Pournader | 56ede1d | 2017-06-09 14:18:46 -0700 | [diff] [blame] | 85 | |
Roozbeh Pournader | 68ce780 | 2017-10-13 14:30:55 -0700 | [diff] [blame] | 86 | US English (en-US) patterns have been modified to remove patterns added to |
Roozbeh Pournader | 56ede1d | 2017-06-09 14:18:46 -0700 | [diff] [blame] | 87 | the end of the file that had been incorrectly computed based on the TUGboat |
| 88 | hyphenation exception list. To compensate for that, the TUGboat hyphenation |
| 89 | exception list for US English has been downloaded from |
| 90 | <http://mirror.ctan.org/info/digests/tugboat/hyphenex/ushyphex.tex> and |
| 91 | explicitly added to the end of the hyphenation exception file for US |
| 92 | English. |
Calvin Pan | 79ef5ae | 2022-01-24 11:49:13 +0800 | [diff] [blame] | 93 | |
| 94 | Swedish (sv) patterns are converted to utf-8 and the first three |
| 95 | lines(encoding, leftmin, rightmin) are removed to preserve the hyphenation |
| 96 | pattern only in *.pat.txt |