blob: bc825cc8223c2da15b99a09f69f7d07de3d5c5c7 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`gettext` --- Multilingual internationalization services
2=============================================================
3
4.. module:: gettext
5 :synopsis: Multilingual internationalization services.
6.. moduleauthor:: Barry A. Warsaw <barry@zope.com>
7.. sectionauthor:: Barry A. Warsaw <barry@zope.com>
8
Raymond Hettinger469271d2011-01-27 20:38:46 +00009**Source code:** :source:`Lib/gettext.py`
10
11--------------
Georg Brandl116aa622007-08-15 14:28:22 +000012
13The :mod:`gettext` module provides internationalization (I18N) and localization
14(L10N) services for your Python modules and applications. It supports both the
15GNU ``gettext`` message catalog API and a higher level, class-based API that may
16be more appropriate for Python files. The interface described below allows you
17to write your module and application messages in one natural language, and
18provide a catalog of translated messages for running under different natural
19languages.
20
21Some hints on localizing your Python modules and applications are also given.
22
23
24GNU :program:`gettext` API
25--------------------------
26
27The :mod:`gettext` module defines the following API, which is very similar to
28the GNU :program:`gettext` API. If you use this API you will affect the
29translation of your entire application globally. Often this is what you want if
30your application is monolingual, with the choice of language dependent on the
31locale of your user. If you are localizing a Python module, or if your
32application needs to switch languages on the fly, you probably want to use the
33class-based API instead.
34
35
Georg Brandl036490d2009-05-17 13:00:36 +000036.. function:: bindtextdomain(domain, localedir=None)
Georg Brandl116aa622007-08-15 14:28:22 +000037
38 Bind the *domain* to the locale directory *localedir*. More concretely,
39 :mod:`gettext` will look for binary :file:`.mo` files for the given domain using
40 the path (on Unix): :file:`localedir/language/LC_MESSAGES/domain.mo`, where
41 *languages* is searched for in the environment variables :envvar:`LANGUAGE`,
42 :envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and :envvar:`LANG` respectively.
43
44 If *localedir* is omitted or ``None``, then the current binding for *domain* is
45 returned. [#]_
46
47
Georg Brandl036490d2009-05-17 13:00:36 +000048.. function:: bind_textdomain_codeset(domain, codeset=None)
Georg Brandl116aa622007-08-15 14:28:22 +000049
50 Bind the *domain* to *codeset*, changing the encoding of strings returned by the
51 :func:`gettext` family of functions. If *codeset* is omitted, then the current
52 binding is returned.
53
Georg Brandl116aa622007-08-15 14:28:22 +000054
Georg Brandl036490d2009-05-17 13:00:36 +000055.. function:: textdomain(domain=None)
Georg Brandl116aa622007-08-15 14:28:22 +000056
57 Change or query the current global domain. If *domain* is ``None``, then the
58 current global domain is returned, otherwise the global domain is set to
59 *domain*, which is returned.
60
61
62.. function:: gettext(message)
63
64 Return the localized translation of *message*, based on the current global
65 domain, language, and locale directory. This function is usually aliased as
66 :func:`_` in the local namespace (see examples below).
67
68
69.. function:: lgettext(message)
70
Georg Brandlbded4d32008-07-17 18:15:35 +000071 Equivalent to :func:`gettext`, but the translation is returned in the
72 preferred system encoding, if no other encoding was explicitly set with
Georg Brandl116aa622007-08-15 14:28:22 +000073 :func:`bind_textdomain_codeset`.
74
Georg Brandl116aa622007-08-15 14:28:22 +000075
76.. function:: dgettext(domain, message)
77
78 Like :func:`gettext`, but look the message up in the specified *domain*.
79
80
81.. function:: ldgettext(domain, message)
82
Georg Brandlbded4d32008-07-17 18:15:35 +000083 Equivalent to :func:`dgettext`, but the translation is returned in the
84 preferred system encoding, if no other encoding was explicitly set with
Georg Brandl116aa622007-08-15 14:28:22 +000085 :func:`bind_textdomain_codeset`.
86
Georg Brandl116aa622007-08-15 14:28:22 +000087
88.. function:: ngettext(singular, plural, n)
89
90 Like :func:`gettext`, but consider plural forms. If a translation is found,
91 apply the plural formula to *n*, and return the resulting message (some
92 languages have more than two plural forms). If no translation is found, return
93 *singular* if *n* is 1; return *plural* otherwise.
94
95 The Plural formula is taken from the catalog header. It is a C or Python
96 expression that has a free variable *n*; the expression evaluates to the index
97 of the plural in the catalog. See the GNU gettext documentation for the precise
98 syntax to be used in :file:`.po` files and the formulas for a variety of
99 languages.
100
Georg Brandl116aa622007-08-15 14:28:22 +0000101
102.. function:: lngettext(singular, plural, n)
103
Georg Brandlbded4d32008-07-17 18:15:35 +0000104 Equivalent to :func:`ngettext`, but the translation is returned in the
105 preferred system encoding, if no other encoding was explicitly set with
Georg Brandl116aa622007-08-15 14:28:22 +0000106 :func:`bind_textdomain_codeset`.
107
Georg Brandl116aa622007-08-15 14:28:22 +0000108
109.. function:: dngettext(domain, singular, plural, n)
110
111 Like :func:`ngettext`, but look the message up in the specified *domain*.
112
Georg Brandl116aa622007-08-15 14:28:22 +0000113
114.. function:: ldngettext(domain, singular, plural, n)
115
116 Equivalent to :func:`dngettext`, but the translation is returned in the
117 preferred system encoding, if no other encoding was explicitly set with
118 :func:`bind_textdomain_codeset`.
119
Georg Brandl116aa622007-08-15 14:28:22 +0000120
121Note that GNU :program:`gettext` also defines a :func:`dcgettext` method, but
122this was deemed not useful and so it is currently unimplemented.
123
124Here's an example of typical usage for this API::
125
126 import gettext
127 gettext.bindtextdomain('myapplication', '/path/to/my/language/directory')
128 gettext.textdomain('myapplication')
129 _ = gettext.gettext
130 # ...
Georg Brandl6911e3c2007-09-04 07:15:32 +0000131 print(_('This is a translatable string.'))
Georg Brandl116aa622007-08-15 14:28:22 +0000132
133
134Class-based API
135---------------
136
137The class-based API of the :mod:`gettext` module gives you more flexibility and
138greater convenience than the GNU :program:`gettext` API. It is the recommended
139way of localizing your Python applications and modules. :mod:`gettext` defines
140a "translations" class which implements the parsing of GNU :file:`.mo` format
Georg Brandlf6945182008-02-01 11:56:49 +0000141files, and has methods for returning strings. Instances of this "translations"
142class can also install themselves in the built-in namespace as the function
143:func:`_`.
Georg Brandl116aa622007-08-15 14:28:22 +0000144
145
Georg Brandl036490d2009-05-17 13:00:36 +0000146.. function:: find(domain, localedir=None, languages=None, all=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000147
148 This function implements the standard :file:`.mo` file search algorithm. It
149 takes a *domain*, identical to what :func:`textdomain` takes. Optional
150 *localedir* is as in :func:`bindtextdomain` Optional *languages* is a list of
151 strings, where each string is a language code.
152
153 If *localedir* is not given, then the default system locale directory is used.
154 [#]_ If *languages* is not given, then the following environment variables are
155 searched: :envvar:`LANGUAGE`, :envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and
156 :envvar:`LANG`. The first one returning a non-empty value is used for the
157 *languages* variable. The environment variables should contain a colon separated
158 list of languages, which will be split on the colon to produce the expected list
159 of language code strings.
160
161 :func:`find` then expands and normalizes the languages, and then iterates
162 through them, searching for an existing file built of these components:
163
Georg Brandl036490d2009-05-17 13:00:36 +0000164 :file:`{localedir}/{language}/LC_MESSAGES/{domain}.mo`
Georg Brandl116aa622007-08-15 14:28:22 +0000165
166 The first such file name that exists is returned by :func:`find`. If no such
167 file is found, then ``None`` is returned. If *all* is given, it returns a list
168 of all file names, in the order in which they appear in the languages list or
169 the environment variables.
170
171
Georg Brandl036490d2009-05-17 13:00:36 +0000172.. function:: translation(domain, localedir=None, languages=None, class_=None, fallback=False, codeset=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000173
Georg Brandlbded4d32008-07-17 18:15:35 +0000174 Return a :class:`Translations` instance based on the *domain*, *localedir*,
175 and *languages*, which are first passed to :func:`find` to get a list of the
Georg Brandl116aa622007-08-15 14:28:22 +0000176 associated :file:`.mo` file paths. Instances with identical :file:`.mo` file
Georg Brandlbded4d32008-07-17 18:15:35 +0000177 names are cached. The actual class instantiated is either *class_* if
178 provided, otherwise :class:`GNUTranslations`. The class's constructor must
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000179 take a single :term:`file object` argument. If provided, *codeset* will change
180 the charset used to encode translated strings in the :meth:`lgettext` and
Georg Brandlbded4d32008-07-17 18:15:35 +0000181 :meth:`lngettext` methods.
Georg Brandl116aa622007-08-15 14:28:22 +0000182
183 If multiple files are found, later files are used as fallbacks for earlier ones.
184 To allow setting the fallback, :func:`copy.copy` is used to clone each
185 translation object from the cache; the actual instance data is still shared with
186 the cache.
187
188 If no :file:`.mo` file is found, this function raises :exc:`IOError` if
189 *fallback* is false (which is the default), and returns a
190 :class:`NullTranslations` instance if *fallback* is true.
191
Georg Brandl116aa622007-08-15 14:28:22 +0000192
Georg Brandl036490d2009-05-17 13:00:36 +0000193.. function:: install(domain, localedir=None, codeset=None, names=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000194
Georg Brandl22b34312009-07-26 14:54:51 +0000195 This installs the function :func:`_` in Python's builtins namespace, based on
Georg Brandl116aa622007-08-15 14:28:22 +0000196 *domain*, *localedir*, and *codeset* which are passed to the function
Benjamin Peterson801844d2008-07-14 14:32:15 +0000197 :func:`translation`.
Georg Brandl116aa622007-08-15 14:28:22 +0000198
199 For the *names* parameter, please see the description of the translation
Alexandre Vassalotti6d3dfc32009-07-29 19:54:39 +0000200 object's :meth:`~NullTranslations.install` method.
Georg Brandl116aa622007-08-15 14:28:22 +0000201
202 As seen below, you usually mark the strings in your application that are
203 candidates for translation, by wrapping them in a call to the :func:`_`
204 function, like this::
205
Georg Brandl6911e3c2007-09-04 07:15:32 +0000206 print(_('This string will be translated.'))
Georg Brandl116aa622007-08-15 14:28:22 +0000207
208 For convenience, you want the :func:`_` function to be installed in Python's
Georg Brandl22b34312009-07-26 14:54:51 +0000209 builtins namespace, so it is easily accessible in all modules of your
Georg Brandl116aa622007-08-15 14:28:22 +0000210 application.
211
Georg Brandl116aa622007-08-15 14:28:22 +0000212
213The :class:`NullTranslations` class
214^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
215
216Translation classes are what actually implement the translation of original
217source file message strings to translated message strings. The base class used
218by all translation classes is :class:`NullTranslations`; this provides the basic
219interface you can use to write your own specialized translation classes. Here
220are the methods of :class:`NullTranslations`:
221
222
Georg Brandl036490d2009-05-17 13:00:36 +0000223.. class:: NullTranslations(fp=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000224
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000225 Takes an optional :term:`file object` *fp*, which is ignored by the base class.
Georg Brandl116aa622007-08-15 14:28:22 +0000226 Initializes "protected" instance variables *_info* and *_charset* which are set
227 by derived classes, as well as *_fallback*, which is set through
228 :meth:`add_fallback`. It then calls ``self._parse(fp)`` if *fp* is not
229 ``None``.
230
Georg Brandlbded4d32008-07-17 18:15:35 +0000231 .. method:: _parse(fp)
Georg Brandl116aa622007-08-15 14:28:22 +0000232
Georg Brandlbded4d32008-07-17 18:15:35 +0000233 No-op'd in the base class, this method takes file object *fp*, and reads
234 the data from the file, initializing its message catalog. If you have an
235 unsupported message catalog file format, you should override this method
236 to parse your format.
Georg Brandl116aa622007-08-15 14:28:22 +0000237
238
Georg Brandlbded4d32008-07-17 18:15:35 +0000239 .. method:: add_fallback(fallback)
Georg Brandl116aa622007-08-15 14:28:22 +0000240
Georg Brandlbded4d32008-07-17 18:15:35 +0000241 Add *fallback* as the fallback object for the current translation object.
242 A translation object should consult the fallback if it cannot provide a
243 translation for a given message.
Georg Brandl116aa622007-08-15 14:28:22 +0000244
245
Georg Brandlbded4d32008-07-17 18:15:35 +0000246 .. method:: gettext(message)
Georg Brandl116aa622007-08-15 14:28:22 +0000247
Georg Brandlbded4d32008-07-17 18:15:35 +0000248 If a fallback has been set, forward :meth:`gettext` to the fallback.
249 Otherwise, return the translated message. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000250
251
Georg Brandlbded4d32008-07-17 18:15:35 +0000252 .. method:: lgettext(message)
Georg Brandl116aa622007-08-15 14:28:22 +0000253
Georg Brandlbded4d32008-07-17 18:15:35 +0000254 If a fallback has been set, forward :meth:`lgettext` to the fallback.
255 Otherwise, return the translated message. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000256
Georg Brandl116aa622007-08-15 14:28:22 +0000257
Georg Brandlbded4d32008-07-17 18:15:35 +0000258 .. method:: ngettext(singular, plural, n)
Georg Brandl116aa622007-08-15 14:28:22 +0000259
Georg Brandlbded4d32008-07-17 18:15:35 +0000260 If a fallback has been set, forward :meth:`ngettext` to the fallback.
261 Otherwise, return the translated message. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000262
263
Georg Brandlbded4d32008-07-17 18:15:35 +0000264 .. method:: lngettext(singular, plural, n)
Georg Brandl116aa622007-08-15 14:28:22 +0000265
Georg Brandlbded4d32008-07-17 18:15:35 +0000266 If a fallback has been set, forward :meth:`ngettext` to the fallback.
267 Otherwise, return the translated message. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000268
Georg Brandl116aa622007-08-15 14:28:22 +0000269
Georg Brandlbded4d32008-07-17 18:15:35 +0000270 .. method:: info()
Georg Brandl116aa622007-08-15 14:28:22 +0000271
Georg Brandlbded4d32008-07-17 18:15:35 +0000272 Return the "protected" :attr:`_info` variable.
Georg Brandl116aa622007-08-15 14:28:22 +0000273
Georg Brandl116aa622007-08-15 14:28:22 +0000274
Georg Brandlbded4d32008-07-17 18:15:35 +0000275 .. method:: charset()
Georg Brandl116aa622007-08-15 14:28:22 +0000276
Georg Brandlbded4d32008-07-17 18:15:35 +0000277 Return the "protected" :attr:`_charset` variable, which is the encoding of
278 the message catalog file.
Georg Brandl116aa622007-08-15 14:28:22 +0000279
Georg Brandl116aa622007-08-15 14:28:22 +0000280
Georg Brandlbded4d32008-07-17 18:15:35 +0000281 .. method:: output_charset()
Georg Brandl116aa622007-08-15 14:28:22 +0000282
Georg Brandlbded4d32008-07-17 18:15:35 +0000283 Return the "protected" :attr:`_output_charset` variable, which defines the
284 encoding used to return translated messages in :meth:`lgettext` and
285 :meth:`lngettext`.
Georg Brandl116aa622007-08-15 14:28:22 +0000286
287
Georg Brandlbded4d32008-07-17 18:15:35 +0000288 .. method:: set_output_charset(charset)
Georg Brandl116aa622007-08-15 14:28:22 +0000289
Georg Brandlbded4d32008-07-17 18:15:35 +0000290 Change the "protected" :attr:`_output_charset` variable, which defines the
291 encoding used to return translated messages.
Georg Brandl116aa622007-08-15 14:28:22 +0000292
Benjamin Peterson801844d2008-07-14 14:32:15 +0000293
Georg Brandl036490d2009-05-17 13:00:36 +0000294 .. method:: install(names=None)
Benjamin Peterson801844d2008-07-14 14:32:15 +0000295
Georg Brandlbded4d32008-07-17 18:15:35 +0000296 This method installs :meth:`self.gettext` into the built-in namespace,
297 binding it to ``_``.
Benjamin Peterson801844d2008-07-14 14:32:15 +0000298
Georg Brandlbded4d32008-07-17 18:15:35 +0000299 If the *names* parameter is given, it must be a sequence containing the
Georg Brandl22b34312009-07-26 14:54:51 +0000300 names of functions you want to install in the builtins namespace in
Georg Brandlbded4d32008-07-17 18:15:35 +0000301 addition to :func:`_`. Supported names are ``'gettext'`` (bound to
302 :meth:`self.gettext`), ``'ngettext'`` (bound to :meth:`self.ngettext`),
303 ``'lgettext'`` and ``'lngettext'``.
Benjamin Peterson801844d2008-07-14 14:32:15 +0000304
Georg Brandlbded4d32008-07-17 18:15:35 +0000305 Note that this is only one way, albeit the most convenient way, to make
306 the :func:`_` function available to your application. Because it affects
307 the entire application globally, and specifically the built-in namespace,
308 localized modules should never install :func:`_`. Instead, they should use
309 this code to make :func:`_` available to their module::
Benjamin Peterson801844d2008-07-14 14:32:15 +0000310
Georg Brandlbded4d32008-07-17 18:15:35 +0000311 import gettext
312 t = gettext.translation('mymodule', ...)
313 _ = t.gettext
Benjamin Peterson801844d2008-07-14 14:32:15 +0000314
Georg Brandlbded4d32008-07-17 18:15:35 +0000315 This puts :func:`_` only in the module's global namespace and so only
316 affects calls within this module.
Georg Brandl116aa622007-08-15 14:28:22 +0000317
Georg Brandl116aa622007-08-15 14:28:22 +0000318
319The :class:`GNUTranslations` class
320^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
321
322The :mod:`gettext` module provides one additional class derived from
323:class:`NullTranslations`: :class:`GNUTranslations`. This class overrides
324:meth:`_parse` to enable reading GNU :program:`gettext` format :file:`.mo` files
Benjamin Peterson801844d2008-07-14 14:32:15 +0000325in both big-endian and little-endian format.
Georg Brandl116aa622007-08-15 14:28:22 +0000326
327:class:`GNUTranslations` parses optional meta-data out of the translation
328catalog. It is convention with GNU :program:`gettext` to include meta-data as
329the translation for the empty string. This meta-data is in :rfc:`822`\ -style
330``key: value`` pairs, and should contain the ``Project-Id-Version`` key. If the
331key ``Content-Type`` is found, then the ``charset`` property is used to
332initialize the "protected" :attr:`_charset` instance variable, defaulting to
333``None`` if not found. If the charset encoding is specified, then all message
334ids and message strings read from the catalog are converted to Unicode using
Georg Brandlbded4d32008-07-17 18:15:35 +0000335this encoding, else ASCII encoding is assumed.
336
337Since message ids are read as Unicode strings too, all :meth:`*gettext` methods
338will assume message ids as Unicode strings, not byte strings.
Georg Brandl116aa622007-08-15 14:28:22 +0000339
340The entire set of key/value pairs are placed into a dictionary and set as the
341"protected" :attr:`_info` instance variable.
342
343If the :file:`.mo` file's magic number is invalid, or if other problems occur
344while reading the file, instantiating a :class:`GNUTranslations` class can raise
345:exc:`IOError`.
346
347The following methods are overridden from the base class implementation:
348
349
350.. method:: GNUTranslations.gettext(message)
351
352 Look up the *message* id in the catalog and return the corresponding message
Georg Brandlbded4d32008-07-17 18:15:35 +0000353 string, as a Unicode string. If there is no entry in the catalog for the
354 *message* id, and a fallback has been set, the look up is forwarded to the
355 fallback's :meth:`gettext` method. Otherwise, the *message* id is returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000356
357
358.. method:: GNUTranslations.lgettext(message)
359
Georg Brandlbded4d32008-07-17 18:15:35 +0000360 Equivalent to :meth:`gettext`, but the translation is returned as a
361 bytestring encoded in the selected output charset, or in the preferred system
362 encoding if no encoding was explicitly set with :meth:`set_output_charset`.
Georg Brandl116aa622007-08-15 14:28:22 +0000363
Georg Brandl116aa622007-08-15 14:28:22 +0000364
Georg Brandl116aa622007-08-15 14:28:22 +0000365.. method:: GNUTranslations.ngettext(singular, plural, n)
366
367 Do a plural-forms lookup of a message id. *singular* is used as the message id
368 for purposes of lookup in the catalog, while *n* is used to determine which
Georg Brandlbded4d32008-07-17 18:15:35 +0000369 plural form to use. The returned message string is a Unicode string.
Georg Brandl116aa622007-08-15 14:28:22 +0000370
371 If the message id is not found in the catalog, and a fallback is specified, the
372 request is forwarded to the fallback's :meth:`ngettext` method. Otherwise, when
373 *n* is 1 *singular* is returned, and *plural* is returned in all other cases.
Georg Brandl48310cd2009-01-03 21:18:54 +0000374
Benjamin Peterson801844d2008-07-14 14:32:15 +0000375 Here is an example::
376
377 n = len(os.listdir('.'))
378 cat = GNUTranslations(somefile)
379 message = cat.ngettext(
380 'There is %(num)d file in this directory',
381 'There are %(num)d files in this directory',
382 n) % {'num': n}
Georg Brandl116aa622007-08-15 14:28:22 +0000383
Georg Brandl116aa622007-08-15 14:28:22 +0000384
385.. method:: GNUTranslations.lngettext(singular, plural, n)
386
Georg Brandlbded4d32008-07-17 18:15:35 +0000387 Equivalent to :meth:`gettext`, but the translation is returned as a
388 bytestring encoded in the selected output charset, or in the preferred system
389 encoding if no encoding was explicitly set with :meth:`set_output_charset`.
Georg Brandl116aa622007-08-15 14:28:22 +0000390
Georg Brandl116aa622007-08-15 14:28:22 +0000391
Georg Brandl116aa622007-08-15 14:28:22 +0000392Solaris message catalog support
393^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
394
395The Solaris operating system defines its own binary :file:`.mo` file format, but
396since no documentation can be found on this format, it is not supported at this
397time.
398
399
400The Catalog constructor
401^^^^^^^^^^^^^^^^^^^^^^^
402
403.. index:: single: GNOME
404
405GNOME uses a version of the :mod:`gettext` module by James Henstridge, but this
406version has a slightly different API. Its documented usage was::
407
408 import gettext
409 cat = gettext.Catalog(domain, localedir)
410 _ = cat.gettext
Georg Brandl6911e3c2007-09-04 07:15:32 +0000411 print(_('hello world'))
Georg Brandl116aa622007-08-15 14:28:22 +0000412
413For compatibility with this older module, the function :func:`Catalog` is an
414alias for the :func:`translation` function described above.
415
416One difference between this module and Henstridge's: his catalog objects
417supported access through a mapping API, but this appears to be unused and so is
418not currently supported.
419
420
421Internationalizing your programs and modules
422--------------------------------------------
423
424Internationalization (I18N) refers to the operation by which a program is made
425aware of multiple languages. Localization (L10N) refers to the adaptation of
426your program, once internationalized, to the local language and cultural habits.
427In order to provide multilingual messages for your Python programs, you need to
428take the following steps:
429
430#. prepare your program or module by specially marking translatable strings
431
432#. run a suite of tools over your marked files to generate raw messages catalogs
433
434#. create language specific translations of the message catalogs
435
436#. use the :mod:`gettext` module so that message strings are properly translated
437
438In order to prepare your code for I18N, you need to look at all the strings in
439your files. Any string that needs to be translated should be marked by wrapping
440it in ``_('...')`` --- that is, a call to the function :func:`_`. For example::
441
442 filename = 'mylog.txt'
443 message = _('writing a log message')
444 fp = open(filename, 'w')
445 fp.write(message)
446 fp.close()
447
448In this example, the string ``'writing a log message'`` is marked as a candidate
449for translation, while the strings ``'mylog.txt'`` and ``'w'`` are not.
450
451The Python distribution comes with two tools which help you generate the message
452catalogs once you've prepared your source code. These may or may not be
453available from a binary distribution, but they can be found in a source
454distribution, in the :file:`Tools/i18n` directory.
455
456The :program:`pygettext` [#]_ program scans all your Python source code looking
457for the strings you previously marked as translatable. It is similar to the GNU
458:program:`gettext` program except that it understands all the intricacies of
459Python source code, but knows nothing about C or C++ source code. You don't
460need GNU ``gettext`` unless you're also going to be translating C code (such as
461C extension modules).
462
463:program:`pygettext` generates textual Uniforum-style human readable message
464catalog :file:`.pot` files, essentially structured human readable files which
465contain every marked string in the source code, along with a placeholder for the
466translation strings. :program:`pygettext` is a command line script that supports
467a similar command line interface as :program:`xgettext`; for details on its use,
468run::
469
470 pygettext.py --help
471
472Copies of these :file:`.pot` files are then handed over to the individual human
473translators who write language-specific versions for every supported natural
474language. They send you back the filled in language-specific versions as a
475:file:`.po` file. Using the :program:`msgfmt.py` [#]_ program (in the
476:file:`Tools/i18n` directory), you take the :file:`.po` files from your
477translators and generate the machine-readable :file:`.mo` binary catalog files.
478The :file:`.mo` files are what the :mod:`gettext` module uses for the actual
479translation processing during run-time.
480
481How you use the :mod:`gettext` module in your code depends on whether you are
482internationalizing a single module or your entire application. The next two
483sections will discuss each case.
484
485
486Localizing your module
487^^^^^^^^^^^^^^^^^^^^^^
488
489If you are localizing your module, you must take care not to make global
490changes, e.g. to the built-in namespace. You should not use the GNU ``gettext``
491API but instead the class-based API.
492
493Let's say your module is called "spam" and the module's various natural language
494translation :file:`.mo` files reside in :file:`/usr/share/locale` in GNU
495:program:`gettext` format. Here's what you would put at the top of your
496module::
497
498 import gettext
499 t = gettext.translation('spam', '/usr/share/locale')
500 _ = t.lgettext
501
Georg Brandl116aa622007-08-15 14:28:22 +0000502
503Localizing your application
504^^^^^^^^^^^^^^^^^^^^^^^^^^^
505
506If you are localizing your application, you can install the :func:`_` function
507globally into the built-in namespace, usually in the main driver file of your
508application. This will let all your application-specific files just use
509``_('...')`` without having to explicitly install it in each file.
510
511In the simple case then, you need only add the following bit of code to the main
512driver file of your application::
513
514 import gettext
515 gettext.install('myapplication')
516
Benjamin Peterson801844d2008-07-14 14:32:15 +0000517If you need to set the locale directory, you can pass these into the
518:func:`install` function::
Georg Brandl116aa622007-08-15 14:28:22 +0000519
520 import gettext
Benjamin Peterson801844d2008-07-14 14:32:15 +0000521 gettext.install('myapplication', '/usr/share/locale')
Georg Brandl116aa622007-08-15 14:28:22 +0000522
523
524Changing languages on the fly
525^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
526
527If your program needs to support many languages at the same time, you may want
528to create multiple translation instances and then switch between them
529explicitly, like so::
530
531 import gettext
532
533 lang1 = gettext.translation('myapplication', languages=['en'])
534 lang2 = gettext.translation('myapplication', languages=['fr'])
535 lang3 = gettext.translation('myapplication', languages=['de'])
536
537 # start by using language1
538 lang1.install()
539
540 # ... time goes by, user selects language 2
541 lang2.install()
542
543 # ... more time goes by, user selects language 3
544 lang3.install()
545
546
547Deferred translations
548^^^^^^^^^^^^^^^^^^^^^
549
550In most coding situations, strings are translated where they are coded.
551Occasionally however, you need to mark strings for translation, but defer actual
552translation until later. A classic example is::
553
554 animals = ['mollusk',
555 'albatross',
Georg Brandla1c6a1c2009-01-03 21:26:05 +0000556 'rat',
557 'penguin',
558 'python', ]
Georg Brandl116aa622007-08-15 14:28:22 +0000559 # ...
560 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000561 print(a)
Georg Brandl116aa622007-08-15 14:28:22 +0000562
563Here, you want to mark the strings in the ``animals`` list as being
564translatable, but you don't actually want to translate them until they are
565printed.
566
567Here is one way you can handle this situation::
568
569 def _(message): return message
570
571 animals = [_('mollusk'),
572 _('albatross'),
Georg Brandla1c6a1c2009-01-03 21:26:05 +0000573 _('rat'),
574 _('penguin'),
575 _('python'), ]
Georg Brandl116aa622007-08-15 14:28:22 +0000576
577 del _
578
579 # ...
580 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000581 print(_(a))
Georg Brandl116aa622007-08-15 14:28:22 +0000582
583This works because the dummy definition of :func:`_` simply returns the string
584unchanged. And this dummy definition will temporarily override any definition
585of :func:`_` in the built-in namespace (until the :keyword:`del` command). Take
586care, though if you have a previous definition of :func:`_` in the local
587namespace.
588
589Note that the second use of :func:`_` will not identify "a" as being
590translatable to the :program:`pygettext` program, since it is not a string.
591
592Another way to handle this is with the following example::
593
594 def N_(message): return message
595
596 animals = [N_('mollusk'),
597 N_('albatross'),
Georg Brandla1c6a1c2009-01-03 21:26:05 +0000598 N_('rat'),
599 N_('penguin'),
600 N_('python'), ]
Georg Brandl116aa622007-08-15 14:28:22 +0000601
602 # ...
603 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000604 print(_(a))
Georg Brandl116aa622007-08-15 14:28:22 +0000605
606In this case, you are marking translatable strings with the function :func:`N_`,
607[#]_ which won't conflict with any definition of :func:`_`. However, you will
608need to teach your message extraction program to look for translatable strings
609marked with :func:`N_`. :program:`pygettext` and :program:`xpot` both support
610this through the use of command line switches.
611
612
Georg Brandl116aa622007-08-15 14:28:22 +0000613Acknowledgements
614----------------
615
616The following people contributed code, feedback, design suggestions, previous
617implementations, and valuable experience to the creation of this module:
618
619* Peter Funk
620
621* James Henstridge
622
623* Juan David Ibáñez Palomar
624
625* Marc-André Lemburg
626
627* Martin von Löwis
628
629* François Pinard
630
631* Barry Warsaw
632
633* Gustavo Niemeyer
634
635.. rubric:: Footnotes
636
637.. [#] The default locale directory is system dependent; for example, on RedHat Linux
638 it is :file:`/usr/share/locale`, but on Solaris it is :file:`/usr/lib/locale`.
639 The :mod:`gettext` module does not try to support these system dependent
640 defaults; instead its default is :file:`sys.prefix/share/locale`. For this
641 reason, it is always best to call :func:`bindtextdomain` with an explicit
642 absolute path at the start of your application.
643
644.. [#] See the footnote for :func:`bindtextdomain` above.
645
646.. [#] François Pinard has written a program called :program:`xpot` which does a
647 similar job. It is available as part of his :program:`po-utils` package at http
648 ://po-utils.progiciels-bpi.ca/.
649
650.. [#] :program:`msgfmt.py` is binary compatible with GNU :program:`msgfmt` except that
651 it provides a simpler, all-Python implementation. With this and
652 :program:`pygettext.py`, you generally won't need to install the GNU
653 :program:`gettext` package to internationalize your Python applications.
654
655.. [#] The choice of :func:`N_` here is totally arbitrary; it could have just as easily
656 been :func:`MarkThisStringForTranslation`.
657