blob: 93748a2e47268d4285655a8a3eb73535579a71ad [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`gettext` --- Multilingual internationalization services
2=============================================================
3
4.. module:: gettext
5 :synopsis: Multilingual internationalization services.
Terry Jan Reedyfa089b92016-06-11 15:02:54 -04006
Andrew Kuchling587e9702013-11-12 10:02:35 -05007.. moduleauthor:: Barry A. Warsaw <barry@python.org>
8.. sectionauthor:: Barry A. Warsaw <barry@python.org>
Georg Brandl116aa622007-08-15 14:28:22 +00009
Raymond Hettinger469271d2011-01-27 20:38:46 +000010**Source code:** :source:`Lib/gettext.py`
11
12--------------
Georg Brandl116aa622007-08-15 14:28:22 +000013
14The :mod:`gettext` module provides internationalization (I18N) and localization
15(L10N) services for your Python modules and applications. It supports both the
16GNU ``gettext`` message catalog API and a higher level, class-based API that may
17be more appropriate for Python files. The interface described below allows you
18to write your module and application messages in one natural language, and
19provide a catalog of translated messages for running under different natural
20languages.
21
22Some hints on localizing your Python modules and applications are also given.
23
24
25GNU :program:`gettext` API
26--------------------------
27
28The :mod:`gettext` module defines the following API, which is very similar to
29the GNU :program:`gettext` API. If you use this API you will affect the
30translation of your entire application globally. Often this is what you want if
31your application is monolingual, with the choice of language dependent on the
32locale of your user. If you are localizing a Python module, or if your
33application needs to switch languages on the fly, you probably want to use the
34class-based API instead.
35
36
Georg Brandl036490d2009-05-17 13:00:36 +000037.. function:: bindtextdomain(domain, localedir=None)
Georg Brandl116aa622007-08-15 14:28:22 +000038
39 Bind the *domain* to the locale directory *localedir*. More concretely,
40 :mod:`gettext` will look for binary :file:`.mo` files for the given domain using
41 the path (on Unix): :file:`localedir/language/LC_MESSAGES/domain.mo`, where
42 *languages* is searched for in the environment variables :envvar:`LANGUAGE`,
43 :envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and :envvar:`LANG` respectively.
44
45 If *localedir* is omitted or ``None``, then the current binding for *domain* is
46 returned. [#]_
47
48
Georg Brandl036490d2009-05-17 13:00:36 +000049.. function:: bind_textdomain_codeset(domain, codeset=None)
Georg Brandl116aa622007-08-15 14:28:22 +000050
Serhiy Storchaka26cb4652017-06-20 17:13:29 +030051 Bind the *domain* to *codeset*, changing the encoding of byte strings
52 returned by the :func:`lgettext`, :func:`ldgettext`, :func:`lngettext`
53 and :func:`ldngettext` functions.
54 If *codeset* is omitted, then the current binding is returned.
Georg Brandl116aa622007-08-15 14:28:22 +000055
Georg Brandl116aa622007-08-15 14:28:22 +000056
Georg Brandl036490d2009-05-17 13:00:36 +000057.. function:: textdomain(domain=None)
Georg Brandl116aa622007-08-15 14:28:22 +000058
59 Change or query the current global domain. If *domain* is ``None``, then the
60 current global domain is returned, otherwise the global domain is set to
61 *domain*, which is returned.
62
63
Serhiy Storchakaddb961d2018-10-26 09:00:49 +030064.. index:: single: _; gettext
Georg Brandl116aa622007-08-15 14:28:22 +000065.. function:: gettext(message)
66
67 Return the localized translation of *message*, based on the current global
68 domain, language, and locale directory. This function is usually aliased as
69 :func:`_` in the local namespace (see examples below).
70
71
Georg Brandl116aa622007-08-15 14:28:22 +000072.. function:: dgettext(domain, message)
73
Serhiy Storchaka26cb4652017-06-20 17:13:29 +030074 Like :func:`.gettext`, but look the message up in the specified *domain*.
Georg Brandl116aa622007-08-15 14:28:22 +000075
Georg Brandl116aa622007-08-15 14:28:22 +000076
77.. function:: ngettext(singular, plural, n)
78
Serhiy Storchaka26cb4652017-06-20 17:13:29 +030079 Like :func:`.gettext`, but consider plural forms. If a translation is found,
Georg Brandl116aa622007-08-15 14:28:22 +000080 apply the plural formula to *n*, and return the resulting message (some
81 languages have more than two plural forms). If no translation is found, return
82 *singular* if *n* is 1; return *plural* otherwise.
83
84 The Plural formula is taken from the catalog header. It is a C or Python
85 expression that has a free variable *n*; the expression evaluates to the index
Andrew Kuchling30c5ad22013-11-19 11:05:20 -050086 of the plural in the catalog. See
87 `the GNU gettext documentation <https://www.gnu.org/software/gettext/manual/gettext.html>`__
88 for the precise syntax to be used in :file:`.po` files and the
89 formulas for a variety of languages.
Georg Brandl116aa622007-08-15 14:28:22 +000090
Georg Brandl116aa622007-08-15 14:28:22 +000091
Georg Brandl116aa622007-08-15 14:28:22 +000092.. function:: dngettext(domain, singular, plural, n)
93
94 Like :func:`ngettext`, but look the message up in the specified *domain*.
95
Georg Brandl116aa622007-08-15 14:28:22 +000096
Serhiy Storchaka26cb4652017-06-20 17:13:29 +030097.. function:: lgettext(message)
98.. function:: ldgettext(domain, message)
99.. function:: lngettext(singular, plural, n)
Georg Brandl116aa622007-08-15 14:28:22 +0000100.. function:: ldngettext(domain, singular, plural, n)
101
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300102 Equivalent to the corresponding functions without the ``l`` prefix
103 (:func:`.gettext`, :func:`dgettext`, :func:`ngettext` and :func:`dngettext`),
104 but the translation is returned as a byte string encoded in the preferred
105 system encoding if no other encoding was explicitly set with
Georg Brandl116aa622007-08-15 14:28:22 +0000106 :func:`bind_textdomain_codeset`.
107
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300108 .. warning::
109
110 These functions should be avoided in Python 3, because they return
111 encoded bytes. It's much better to use alternatives which return
112 Unicode strings instead, since most Python applications will want to
113 manipulate human readable text as strings instead of bytes. Further,
114 it's possible that you may get unexpected Unicode-related exceptions
115 if there are encoding problems with the translated strings. It is
116 possible that the ``l*()`` functions will be deprecated in future Python
117 versions due to their inherent problems and limitations.
118
Georg Brandl116aa622007-08-15 14:28:22 +0000119
120Note that GNU :program:`gettext` also defines a :func:`dcgettext` method, but
121this was deemed not useful and so it is currently unimplemented.
122
123Here's an example of typical usage for this API::
124
125 import gettext
126 gettext.bindtextdomain('myapplication', '/path/to/my/language/directory')
127 gettext.textdomain('myapplication')
128 _ = gettext.gettext
129 # ...
Georg Brandl6911e3c2007-09-04 07:15:32 +0000130 print(_('This is a translatable string.'))
Georg Brandl116aa622007-08-15 14:28:22 +0000131
132
133Class-based API
134---------------
135
136The class-based API of the :mod:`gettext` module gives you more flexibility and
137greater convenience than the GNU :program:`gettext` API. It is the recommended
Serhiy Storchakac02a1f42017-10-04 20:28:20 +0300138way of localizing your Python applications and modules. :mod:`!gettext` defines
Georg Brandl116aa622007-08-15 14:28:22 +0000139a "translations" class which implements the parsing of GNU :file:`.mo` format
Georg Brandlf6945182008-02-01 11:56:49 +0000140files, and has methods for returning strings. Instances of this "translations"
141class can also install themselves in the built-in namespace as the function
142:func:`_`.
Georg Brandl116aa622007-08-15 14:28:22 +0000143
144
Georg Brandl036490d2009-05-17 13:00:36 +0000145.. function:: find(domain, localedir=None, languages=None, all=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000146
147 This function implements the standard :file:`.mo` file search algorithm. It
148 takes a *domain*, identical to what :func:`textdomain` takes. Optional
149 *localedir* is as in :func:`bindtextdomain` Optional *languages* is a list of
150 strings, where each string is a language code.
151
152 If *localedir* is not given, then the default system locale directory is used.
153 [#]_ If *languages* is not given, then the following environment variables are
154 searched: :envvar:`LANGUAGE`, :envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and
155 :envvar:`LANG`. The first one returning a non-empty value is used for the
156 *languages* variable. The environment variables should contain a colon separated
157 list of languages, which will be split on the colon to produce the expected list
158 of language code strings.
159
160 :func:`find` then expands and normalizes the languages, and then iterates
161 through them, searching for an existing file built of these components:
162
Georg Brandl036490d2009-05-17 13:00:36 +0000163 :file:`{localedir}/{language}/LC_MESSAGES/{domain}.mo`
Georg Brandl116aa622007-08-15 14:28:22 +0000164
165 The first such file name that exists is returned by :func:`find`. If no such
166 file is found, then ``None`` is returned. If *all* is given, it returns a list
167 of all file names, in the order in which they appear in the languages list or
168 the environment variables.
169
170
Georg Brandl036490d2009-05-17 13:00:36 +0000171.. function:: translation(domain, localedir=None, languages=None, class_=None, fallback=False, codeset=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000172
Georg Brandlbded4d32008-07-17 18:15:35 +0000173 Return a :class:`Translations` instance based on the *domain*, *localedir*,
174 and *languages*, which are first passed to :func:`find` to get a list of the
Georg Brandl116aa622007-08-15 14:28:22 +0000175 associated :file:`.mo` file paths. Instances with identical :file:`.mo` file
Georg Brandlbded4d32008-07-17 18:15:35 +0000176 names are cached. The actual class instantiated is either *class_* if
177 provided, otherwise :class:`GNUTranslations`. The class's constructor must
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000178 take a single :term:`file object` argument. If provided, *codeset* will change
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300179 the charset used to encode translated strings in the
180 :meth:`~NullTranslations.lgettext` and :meth:`~NullTranslations.lngettext`
181 methods.
Georg Brandl116aa622007-08-15 14:28:22 +0000182
183 If multiple files are found, later files are used as fallbacks for earlier ones.
184 To allow setting the fallback, :func:`copy.copy` is used to clone each
185 translation object from the cache; the actual instance data is still shared with
186 the cache.
187
Antoine Pitrou62ab10a02011-10-12 20:10:51 +0200188 If no :file:`.mo` file is found, this function raises :exc:`OSError` if
Georg Brandl116aa622007-08-15 14:28:22 +0000189 *fallback* is false (which is the default), and returns a
190 :class:`NullTranslations` instance if *fallback* is true.
191
Antoine Pitrou62ab10a02011-10-12 20:10:51 +0200192 .. versionchanged:: 3.3
193 :exc:`IOError` used to be raised instead of :exc:`OSError`.
194
Georg Brandl116aa622007-08-15 14:28:22 +0000195
Georg Brandl036490d2009-05-17 13:00:36 +0000196.. function:: install(domain, localedir=None, codeset=None, names=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000197
Georg Brandl22b34312009-07-26 14:54:51 +0000198 This installs the function :func:`_` in Python's builtins namespace, based on
Georg Brandl116aa622007-08-15 14:28:22 +0000199 *domain*, *localedir*, and *codeset* which are passed to the function
Benjamin Peterson801844d2008-07-14 14:32:15 +0000200 :func:`translation`.
Georg Brandl116aa622007-08-15 14:28:22 +0000201
202 For the *names* parameter, please see the description of the translation
Alexandre Vassalotti6d3dfc32009-07-29 19:54:39 +0000203 object's :meth:`~NullTranslations.install` method.
Georg Brandl116aa622007-08-15 14:28:22 +0000204
205 As seen below, you usually mark the strings in your application that are
206 candidates for translation, by wrapping them in a call to the :func:`_`
207 function, like this::
208
Georg Brandl6911e3c2007-09-04 07:15:32 +0000209 print(_('This string will be translated.'))
Georg Brandl116aa622007-08-15 14:28:22 +0000210
211 For convenience, you want the :func:`_` function to be installed in Python's
Georg Brandl22b34312009-07-26 14:54:51 +0000212 builtins namespace, so it is easily accessible in all modules of your
Georg Brandl116aa622007-08-15 14:28:22 +0000213 application.
214
Georg Brandl116aa622007-08-15 14:28:22 +0000215
216The :class:`NullTranslations` class
217^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
218
219Translation classes are what actually implement the translation of original
220source file message strings to translated message strings. The base class used
221by all translation classes is :class:`NullTranslations`; this provides the basic
222interface you can use to write your own specialized translation classes. Here
Serhiy Storchakac02a1f42017-10-04 20:28:20 +0300223are the methods of :class:`!NullTranslations`:
Georg Brandl116aa622007-08-15 14:28:22 +0000224
225
Georg Brandl036490d2009-05-17 13:00:36 +0000226.. class:: NullTranslations(fp=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000227
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000228 Takes an optional :term:`file object` *fp*, which is ignored by the base class.
Georg Brandl116aa622007-08-15 14:28:22 +0000229 Initializes "protected" instance variables *_info* and *_charset* which are set
230 by derived classes, as well as *_fallback*, which is set through
231 :meth:`add_fallback`. It then calls ``self._parse(fp)`` if *fp* is not
232 ``None``.
233
Georg Brandlbded4d32008-07-17 18:15:35 +0000234 .. method:: _parse(fp)
Georg Brandl116aa622007-08-15 14:28:22 +0000235
Georg Brandlbded4d32008-07-17 18:15:35 +0000236 No-op'd in the base class, this method takes file object *fp*, and reads
237 the data from the file, initializing its message catalog. If you have an
238 unsupported message catalog file format, you should override this method
239 to parse your format.
Georg Brandl116aa622007-08-15 14:28:22 +0000240
241
Georg Brandlbded4d32008-07-17 18:15:35 +0000242 .. method:: add_fallback(fallback)
Georg Brandl116aa622007-08-15 14:28:22 +0000243
Georg Brandlbded4d32008-07-17 18:15:35 +0000244 Add *fallback* as the fallback object for the current translation object.
245 A translation object should consult the fallback if it cannot provide a
246 translation for a given message.
Georg Brandl116aa622007-08-15 14:28:22 +0000247
248
Georg Brandlbded4d32008-07-17 18:15:35 +0000249 .. method:: gettext(message)
Georg Brandl116aa622007-08-15 14:28:22 +0000250
Serhiy Storchakac02a1f42017-10-04 20:28:20 +0300251 If a fallback has been set, forward :meth:`!gettext` to the fallback.
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300252 Otherwise, return *message*. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000253
Georg Brandl116aa622007-08-15 14:28:22 +0000254
Georg Brandlbded4d32008-07-17 18:15:35 +0000255 .. method:: ngettext(singular, plural, n)
Georg Brandl116aa622007-08-15 14:28:22 +0000256
Serhiy Storchakac02a1f42017-10-04 20:28:20 +0300257 If a fallback has been set, forward :meth:`!ngettext` to the fallback.
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300258 Otherwise, return *singular* if *n* is 1; return *plural* otherwise.
259 Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000260
261
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300262 .. method:: lgettext(message)
Georg Brandlbded4d32008-07-17 18:15:35 +0000263 .. method:: lngettext(singular, plural, n)
Georg Brandl116aa622007-08-15 14:28:22 +0000264
Serhiy Storchakac02a1f42017-10-04 20:28:20 +0300265 Equivalent to :meth:`.gettext` and :meth:`.ngettext`, but the translation
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300266 is returned as a byte string encoded in the preferred system encoding
267 if no encoding was explicitly set with :meth:`set_output_charset`.
268 Overridden in derived classes.
269
270 .. warning::
271
272 These methods should be avoided in Python 3. See the warning for the
273 :func:`lgettext` function.
Georg Brandl116aa622007-08-15 14:28:22 +0000274
Georg Brandl116aa622007-08-15 14:28:22 +0000275
Georg Brandlbded4d32008-07-17 18:15:35 +0000276 .. method:: info()
Georg Brandl116aa622007-08-15 14:28:22 +0000277
Georg Brandlbded4d32008-07-17 18:15:35 +0000278 Return the "protected" :attr:`_info` variable.
Georg Brandl116aa622007-08-15 14:28:22 +0000279
Georg Brandl116aa622007-08-15 14:28:22 +0000280
Georg Brandlbded4d32008-07-17 18:15:35 +0000281 .. method:: charset()
Georg Brandl116aa622007-08-15 14:28:22 +0000282
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300283 Return the encoding of the message catalog file.
Georg Brandl116aa622007-08-15 14:28:22 +0000284
Georg Brandl116aa622007-08-15 14:28:22 +0000285
Georg Brandlbded4d32008-07-17 18:15:35 +0000286 .. method:: output_charset()
Georg Brandl116aa622007-08-15 14:28:22 +0000287
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300288 Return the encoding used to return translated messages in :meth:`.lgettext`
289 and :meth:`.lngettext`.
Georg Brandl116aa622007-08-15 14:28:22 +0000290
291
Georg Brandlbded4d32008-07-17 18:15:35 +0000292 .. method:: set_output_charset(charset)
Georg Brandl116aa622007-08-15 14:28:22 +0000293
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300294 Change the encoding used to return translated messages.
Georg Brandl116aa622007-08-15 14:28:22 +0000295
Benjamin Peterson801844d2008-07-14 14:32:15 +0000296
Georg Brandl036490d2009-05-17 13:00:36 +0000297 .. method:: install(names=None)
Benjamin Peterson801844d2008-07-14 14:32:15 +0000298
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300299 This method installs :meth:`.gettext` into the built-in namespace,
Georg Brandlbded4d32008-07-17 18:15:35 +0000300 binding it to ``_``.
Benjamin Peterson801844d2008-07-14 14:32:15 +0000301
Georg Brandlbded4d32008-07-17 18:15:35 +0000302 If the *names* parameter is given, it must be a sequence containing the
Georg Brandl22b34312009-07-26 14:54:51 +0000303 names of functions you want to install in the builtins namespace in
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300304 addition to :func:`_`. Supported names are ``'gettext'``, ``'ngettext'``,
Georg Brandlbded4d32008-07-17 18:15:35 +0000305 ``'lgettext'`` and ``'lngettext'``.
Benjamin Peterson801844d2008-07-14 14:32:15 +0000306
Georg Brandlbded4d32008-07-17 18:15:35 +0000307 Note that this is only one way, albeit the most convenient way, to make
308 the :func:`_` function available to your application. Because it affects
309 the entire application globally, and specifically the built-in namespace,
310 localized modules should never install :func:`_`. Instead, they should use
311 this code to make :func:`_` available to their module::
Benjamin Peterson801844d2008-07-14 14:32:15 +0000312
Georg Brandlbded4d32008-07-17 18:15:35 +0000313 import gettext
314 t = gettext.translation('mymodule', ...)
315 _ = t.gettext
Benjamin Peterson801844d2008-07-14 14:32:15 +0000316
Georg Brandlbded4d32008-07-17 18:15:35 +0000317 This puts :func:`_` only in the module's global namespace and so only
318 affects calls within this module.
Georg Brandl116aa622007-08-15 14:28:22 +0000319
Georg Brandl116aa622007-08-15 14:28:22 +0000320
321The :class:`GNUTranslations` class
322^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
323
324The :mod:`gettext` module provides one additional class derived from
325:class:`NullTranslations`: :class:`GNUTranslations`. This class overrides
326:meth:`_parse` to enable reading GNU :program:`gettext` format :file:`.mo` files
Benjamin Peterson801844d2008-07-14 14:32:15 +0000327in both big-endian and little-endian format.
Georg Brandl116aa622007-08-15 14:28:22 +0000328
329:class:`GNUTranslations` parses optional meta-data out of the translation
330catalog. It is convention with GNU :program:`gettext` to include meta-data as
331the translation for the empty string. This meta-data is in :rfc:`822`\ -style
332``key: value`` pairs, and should contain the ``Project-Id-Version`` key. If the
333key ``Content-Type`` is found, then the ``charset`` property is used to
334initialize the "protected" :attr:`_charset` instance variable, defaulting to
335``None`` if not found. If the charset encoding is specified, then all message
336ids and message strings read from the catalog are converted to Unicode using
Georg Brandlbded4d32008-07-17 18:15:35 +0000337this encoding, else ASCII encoding is assumed.
338
339Since message ids are read as Unicode strings too, all :meth:`*gettext` methods
340will assume message ids as Unicode strings, not byte strings.
Georg Brandl116aa622007-08-15 14:28:22 +0000341
342The entire set of key/value pairs are placed into a dictionary and set as the
343"protected" :attr:`_info` instance variable.
344
Antoine Pitroube8d06f2014-10-28 20:17:51 +0100345If the :file:`.mo` file's magic number is invalid, the major version number is
346unexpected, or if other problems occur while reading the file, instantiating a
347:class:`GNUTranslations` class can raise :exc:`OSError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000348
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300349.. class:: GNUTranslations
350
351 The following methods are overridden from the base class implementation:
352
353 .. method:: gettext(message)
354
355 Look up the *message* id in the catalog and return the corresponding message
356 string, as a Unicode string. If there is no entry in the catalog for the
357 *message* id, and a fallback has been set, the look up is forwarded to the
358 fallback's :meth:`~NullTranslations.gettext` method. Otherwise, the
359 *message* id is returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000360
361
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300362 .. method:: ngettext(singular, plural, n)
Georg Brandl116aa622007-08-15 14:28:22 +0000363
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300364 Do a plural-forms lookup of a message id. *singular* is used as the message id
365 for purposes of lookup in the catalog, while *n* is used to determine which
366 plural form to use. The returned message string is a Unicode string.
367
368 If the message id is not found in the catalog, and a fallback is specified,
369 the request is forwarded to the fallback's :meth:`~NullTranslations.ngettext`
370 method. Otherwise, when *n* is 1 *singular* is returned, and *plural* is
371 returned in all other cases.
372
373 Here is an example::
374
375 n = len(os.listdir('.'))
376 cat = GNUTranslations(somefile)
377 message = cat.ngettext(
378 'There is %(num)d file in this directory',
379 'There are %(num)d files in this directory',
380 n) % {'num': n}
Georg Brandl116aa622007-08-15 14:28:22 +0000381
382
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300383 .. method:: lgettext(message)
384 .. method:: lngettext(singular, plural, n)
Georg Brandl116aa622007-08-15 14:28:22 +0000385
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300386 Equivalent to :meth:`.gettext` and :meth:`.ngettext`, but the translation
387 is returned as a byte string encoded in the preferred system encoding
388 if no encoding was explicitly set with
389 :meth:`~NullTranslations.set_output_charset`.
Georg Brandl116aa622007-08-15 14:28:22 +0000390
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300391 .. warning::
Georg Brandl116aa622007-08-15 14:28:22 +0000392
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300393 These methods should be avoided in Python 3. See the warning for the
394 :func:`lgettext` function.
Georg Brandl116aa622007-08-15 14:28:22 +0000395
Georg Brandl116aa622007-08-15 14:28:22 +0000396
Georg Brandl116aa622007-08-15 14:28:22 +0000397Solaris message catalog support
398^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
399
400The Solaris operating system defines its own binary :file:`.mo` file format, but
401since no documentation can be found on this format, it is not supported at this
402time.
403
404
405The Catalog constructor
406^^^^^^^^^^^^^^^^^^^^^^^
407
408.. index:: single: GNOME
409
410GNOME uses a version of the :mod:`gettext` module by James Henstridge, but this
411version has a slightly different API. Its documented usage was::
412
413 import gettext
414 cat = gettext.Catalog(domain, localedir)
415 _ = cat.gettext
Georg Brandl6911e3c2007-09-04 07:15:32 +0000416 print(_('hello world'))
Georg Brandl116aa622007-08-15 14:28:22 +0000417
418For compatibility with this older module, the function :func:`Catalog` is an
419alias for the :func:`translation` function described above.
420
421One difference between this module and Henstridge's: his catalog objects
422supported access through a mapping API, but this appears to be unused and so is
423not currently supported.
424
425
426Internationalizing your programs and modules
427--------------------------------------------
428
429Internationalization (I18N) refers to the operation by which a program is made
430aware of multiple languages. Localization (L10N) refers to the adaptation of
431your program, once internationalized, to the local language and cultural habits.
432In order to provide multilingual messages for your Python programs, you need to
433take the following steps:
434
435#. prepare your program or module by specially marking translatable strings
436
437#. run a suite of tools over your marked files to generate raw messages catalogs
438
439#. create language specific translations of the message catalogs
440
441#. use the :mod:`gettext` module so that message strings are properly translated
442
443In order to prepare your code for I18N, you need to look at all the strings in
444your files. Any string that needs to be translated should be marked by wrapping
445it in ``_('...')`` --- that is, a call to the function :func:`_`. For example::
446
447 filename = 'mylog.txt'
448 message = _('writing a log message')
449 fp = open(filename, 'w')
450 fp.write(message)
451 fp.close()
452
453In this example, the string ``'writing a log message'`` is marked as a candidate
454for translation, while the strings ``'mylog.txt'`` and ``'w'`` are not.
455
Andrew Kuchling30c5ad22013-11-19 11:05:20 -0500456There are a few tools to extract the strings meant for translation.
457The original GNU :program:`gettext` only supported C or C++ source
458code but its extended version :program:`xgettext` scans code written
459in a number of languages, including Python, to find strings marked as
460translatable. `Babel <http://babel.pocoo.org/>`__ is a Python
461internationalization library that includes a :file:`pybabel` script to
462extract and compile message catalogs. François Pinard's program
463called :program:`xpot` does a similar job and is available as part of
Georg Brandl525d3552014-10-29 10:26:56 +0100464his `po-utils package <https://github.com/pinard/po-utils>`__.
Georg Brandl116aa622007-08-15 14:28:22 +0000465
Andrew Kuchling30c5ad22013-11-19 11:05:20 -0500466(Python also includes pure-Python versions of these programs, called
467:program:`pygettext.py` and :program:`msgfmt.py`; some Python distributions
468will install them for you. :program:`pygettext.py` is similar to
469:program:`xgettext`, but only understands Python source code and
470cannot handle other programming languages such as C or C++.
471:program:`pygettext.py` supports a command-line interface similar to
472:program:`xgettext`; for details on its use, run ``pygettext.py
473--help``. :program:`msgfmt.py` is binary compatible with GNU
474:program:`msgfmt`. With these two programs, you may not need the GNU
475:program:`gettext` package to internationalize your Python
476applications.)
Georg Brandl116aa622007-08-15 14:28:22 +0000477
Andrew Kuchling30c5ad22013-11-19 11:05:20 -0500478:program:`xgettext`, :program:`pygettext`, and similar tools generate
479:file:`.po` files that are message catalogs. They are structured
Georg Brandled007d52013-11-24 16:09:26 +0100480human-readable files that contain every marked string in the source
481code, along with a placeholder for the translated versions of these
482strings.
Georg Brandl116aa622007-08-15 14:28:22 +0000483
Andrew Kuchling30c5ad22013-11-19 11:05:20 -0500484Copies of these :file:`.po` files are then handed over to the
485individual human translators who write translations for every
486supported natural language. They send back the completed
487language-specific versions as a :file:`<language-name>.po` file that's
488compiled into a machine-readable :file:`.mo` binary catalog file using
489the :program:`msgfmt` program. The :file:`.mo` files are used by the
490:mod:`gettext` module for the actual translation processing at
491run-time.
Georg Brandl116aa622007-08-15 14:28:22 +0000492
493How you use the :mod:`gettext` module in your code depends on whether you are
494internationalizing a single module or your entire application. The next two
495sections will discuss each case.
496
497
498Localizing your module
499^^^^^^^^^^^^^^^^^^^^^^
500
501If you are localizing your module, you must take care not to make global
502changes, e.g. to the built-in namespace. You should not use the GNU ``gettext``
503API but instead the class-based API.
504
505Let's say your module is called "spam" and the module's various natural language
506translation :file:`.mo` files reside in :file:`/usr/share/locale` in GNU
507:program:`gettext` format. Here's what you would put at the top of your
508module::
509
510 import gettext
511 t = gettext.translation('spam', '/usr/share/locale')
Serhiy Storchaka26cb4652017-06-20 17:13:29 +0300512 _ = t.gettext
Georg Brandl116aa622007-08-15 14:28:22 +0000513
Georg Brandl116aa622007-08-15 14:28:22 +0000514
515Localizing your application
516^^^^^^^^^^^^^^^^^^^^^^^^^^^
517
518If you are localizing your application, you can install the :func:`_` function
519globally into the built-in namespace, usually in the main driver file of your
520application. This will let all your application-specific files just use
521``_('...')`` without having to explicitly install it in each file.
522
523In the simple case then, you need only add the following bit of code to the main
524driver file of your application::
525
526 import gettext
527 gettext.install('myapplication')
528
Andrew Kuchling30c5ad22013-11-19 11:05:20 -0500529If you need to set the locale directory, you can pass it into the
Benjamin Peterson801844d2008-07-14 14:32:15 +0000530:func:`install` function::
Georg Brandl116aa622007-08-15 14:28:22 +0000531
532 import gettext
Benjamin Peterson801844d2008-07-14 14:32:15 +0000533 gettext.install('myapplication', '/usr/share/locale')
Georg Brandl116aa622007-08-15 14:28:22 +0000534
535
536Changing languages on the fly
537^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
538
539If your program needs to support many languages at the same time, you may want
540to create multiple translation instances and then switch between them
541explicitly, like so::
542
543 import gettext
544
545 lang1 = gettext.translation('myapplication', languages=['en'])
546 lang2 = gettext.translation('myapplication', languages=['fr'])
547 lang3 = gettext.translation('myapplication', languages=['de'])
548
549 # start by using language1
550 lang1.install()
551
552 # ... time goes by, user selects language 2
553 lang2.install()
554
555 # ... more time goes by, user selects language 3
556 lang3.install()
557
558
559Deferred translations
560^^^^^^^^^^^^^^^^^^^^^
561
562In most coding situations, strings are translated where they are coded.
563Occasionally however, you need to mark strings for translation, but defer actual
564translation until later. A classic example is::
565
566 animals = ['mollusk',
567 'albatross',
Georg Brandla1c6a1c2009-01-03 21:26:05 +0000568 'rat',
569 'penguin',
570 'python', ]
Georg Brandl116aa622007-08-15 14:28:22 +0000571 # ...
572 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000573 print(a)
Georg Brandl116aa622007-08-15 14:28:22 +0000574
575Here, you want to mark the strings in the ``animals`` list as being
576translatable, but you don't actually want to translate them until they are
577printed.
578
579Here is one way you can handle this situation::
580
581 def _(message): return message
582
583 animals = [_('mollusk'),
584 _('albatross'),
Georg Brandla1c6a1c2009-01-03 21:26:05 +0000585 _('rat'),
586 _('penguin'),
587 _('python'), ]
Georg Brandl116aa622007-08-15 14:28:22 +0000588
589 del _
590
591 # ...
592 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000593 print(_(a))
Georg Brandl116aa622007-08-15 14:28:22 +0000594
595This works because the dummy definition of :func:`_` simply returns the string
596unchanged. And this dummy definition will temporarily override any definition
597of :func:`_` in the built-in namespace (until the :keyword:`del` command). Take
598care, though if you have a previous definition of :func:`_` in the local
599namespace.
600
601Note that the second use of :func:`_` will not identify "a" as being
Andrew Kuchling30c5ad22013-11-19 11:05:20 -0500602translatable to the :program:`gettext` program, because the parameter
603is not a string literal.
Georg Brandl116aa622007-08-15 14:28:22 +0000604
605Another way to handle this is with the following example::
606
607 def N_(message): return message
608
609 animals = [N_('mollusk'),
610 N_('albatross'),
Georg Brandla1c6a1c2009-01-03 21:26:05 +0000611 N_('rat'),
612 N_('penguin'),
613 N_('python'), ]
Georg Brandl116aa622007-08-15 14:28:22 +0000614
615 # ...
616 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000617 print(_(a))
Georg Brandl116aa622007-08-15 14:28:22 +0000618
Andrew Kuchling30c5ad22013-11-19 11:05:20 -0500619In this case, you are marking translatable strings with the function
620:func:`N_`, which won't conflict with any definition of :func:`_`.
621However, you will need to teach your message extraction program to
622look for translatable strings marked with :func:`N_`. :program:`xgettext`,
623:program:`pygettext`, ``pybabel extract``, and :program:`xpot` all
Martin Panter5c679332016-10-30 04:20:17 +0000624support this through the use of the :option:`!-k` command-line switch.
Andrew Kuchling30c5ad22013-11-19 11:05:20 -0500625The choice of :func:`N_` here is totally arbitrary; it could have just
626as easily been :func:`MarkThisStringForTranslation`.
Georg Brandl116aa622007-08-15 14:28:22 +0000627
628
Georg Brandl116aa622007-08-15 14:28:22 +0000629Acknowledgements
630----------------
631
632The following people contributed code, feedback, design suggestions, previous
633implementations, and valuable experience to the creation of this module:
634
635* Peter Funk
636
637* James Henstridge
638
639* Juan David Ibáñez Palomar
640
641* Marc-André Lemburg
642
643* Martin von Löwis
644
645* François Pinard
646
647* Barry Warsaw
648
649* Gustavo Niemeyer
650
651.. rubric:: Footnotes
652
653.. [#] The default locale directory is system dependent; for example, on RedHat Linux
654 it is :file:`/usr/share/locale`, but on Solaris it is :file:`/usr/lib/locale`.
655 The :mod:`gettext` module does not try to support these system dependent
656 defaults; instead its default is :file:`sys.prefix/share/locale`. For this
657 reason, it is always best to call :func:`bindtextdomain` with an explicit
658 absolute path at the start of your application.
659
660.. [#] See the footnote for :func:`bindtextdomain` above.