blob: 825311bfdc2baf8b4bbcdfd5c623657b7d394716 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`gettext` --- Multilingual internationalization services
2=============================================================
3
4.. module:: gettext
5 :synopsis: Multilingual internationalization services.
6.. moduleauthor:: Barry A. Warsaw <barry@zope.com>
7.. sectionauthor:: Barry A. Warsaw <barry@zope.com>
8
Raymond Hettinger469271d2011-01-27 20:38:46 +00009**Source code:** :source:`Lib/gettext.py`
10
11--------------
Georg Brandl116aa622007-08-15 14:28:22 +000012
13The :mod:`gettext` module provides internationalization (I18N) and localization
14(L10N) services for your Python modules and applications. It supports both the
15GNU ``gettext`` message catalog API and a higher level, class-based API that may
16be more appropriate for Python files. The interface described below allows you
17to write your module and application messages in one natural language, and
18provide a catalog of translated messages for running under different natural
19languages.
20
21Some hints on localizing your Python modules and applications are also given.
22
23
24GNU :program:`gettext` API
25--------------------------
26
27The :mod:`gettext` module defines the following API, which is very similar to
28the GNU :program:`gettext` API. If you use this API you will affect the
29translation of your entire application globally. Often this is what you want if
30your application is monolingual, with the choice of language dependent on the
31locale of your user. If you are localizing a Python module, or if your
32application needs to switch languages on the fly, you probably want to use the
33class-based API instead.
34
35
Georg Brandl036490d2009-05-17 13:00:36 +000036.. function:: bindtextdomain(domain, localedir=None)
Georg Brandl116aa622007-08-15 14:28:22 +000037
38 Bind the *domain* to the locale directory *localedir*. More concretely,
39 :mod:`gettext` will look for binary :file:`.mo` files for the given domain using
40 the path (on Unix): :file:`localedir/language/LC_MESSAGES/domain.mo`, where
41 *languages* is searched for in the environment variables :envvar:`LANGUAGE`,
42 :envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and :envvar:`LANG` respectively.
43
44 If *localedir* is omitted or ``None``, then the current binding for *domain* is
45 returned. [#]_
46
47
Georg Brandl036490d2009-05-17 13:00:36 +000048.. function:: bind_textdomain_codeset(domain, codeset=None)
Georg Brandl116aa622007-08-15 14:28:22 +000049
50 Bind the *domain* to *codeset*, changing the encoding of strings returned by the
51 :func:`gettext` family of functions. If *codeset* is omitted, then the current
52 binding is returned.
53
Georg Brandl116aa622007-08-15 14:28:22 +000054
Georg Brandl036490d2009-05-17 13:00:36 +000055.. function:: textdomain(domain=None)
Georg Brandl116aa622007-08-15 14:28:22 +000056
57 Change or query the current global domain. If *domain* is ``None``, then the
58 current global domain is returned, otherwise the global domain is set to
59 *domain*, which is returned.
60
61
62.. function:: gettext(message)
63
64 Return the localized translation of *message*, based on the current global
65 domain, language, and locale directory. This function is usually aliased as
66 :func:`_` in the local namespace (see examples below).
67
68
69.. function:: lgettext(message)
70
Georg Brandlbded4d32008-07-17 18:15:35 +000071 Equivalent to :func:`gettext`, but the translation is returned in the
72 preferred system encoding, if no other encoding was explicitly set with
Georg Brandl116aa622007-08-15 14:28:22 +000073 :func:`bind_textdomain_codeset`.
74
Georg Brandl116aa622007-08-15 14:28:22 +000075
76.. function:: dgettext(domain, message)
77
78 Like :func:`gettext`, but look the message up in the specified *domain*.
79
80
81.. function:: ldgettext(domain, message)
82
Georg Brandlbded4d32008-07-17 18:15:35 +000083 Equivalent to :func:`dgettext`, but the translation is returned in the
84 preferred system encoding, if no other encoding was explicitly set with
Georg Brandl116aa622007-08-15 14:28:22 +000085 :func:`bind_textdomain_codeset`.
86
Georg Brandl116aa622007-08-15 14:28:22 +000087
88.. function:: ngettext(singular, plural, n)
89
90 Like :func:`gettext`, but consider plural forms. If a translation is found,
91 apply the plural formula to *n*, and return the resulting message (some
92 languages have more than two plural forms). If no translation is found, return
93 *singular* if *n* is 1; return *plural* otherwise.
94
95 The Plural formula is taken from the catalog header. It is a C or Python
96 expression that has a free variable *n*; the expression evaluates to the index
97 of the plural in the catalog. See the GNU gettext documentation for the precise
98 syntax to be used in :file:`.po` files and the formulas for a variety of
99 languages.
100
Georg Brandl116aa622007-08-15 14:28:22 +0000101
102.. function:: lngettext(singular, plural, n)
103
Georg Brandlbded4d32008-07-17 18:15:35 +0000104 Equivalent to :func:`ngettext`, but the translation is returned in the
105 preferred system encoding, if no other encoding was explicitly set with
Georg Brandl116aa622007-08-15 14:28:22 +0000106 :func:`bind_textdomain_codeset`.
107
Georg Brandl116aa622007-08-15 14:28:22 +0000108
109.. function:: dngettext(domain, singular, plural, n)
110
111 Like :func:`ngettext`, but look the message up in the specified *domain*.
112
Georg Brandl116aa622007-08-15 14:28:22 +0000113
114.. function:: ldngettext(domain, singular, plural, n)
115
116 Equivalent to :func:`dngettext`, but the translation is returned in the
117 preferred system encoding, if no other encoding was explicitly set with
118 :func:`bind_textdomain_codeset`.
119
Georg Brandl116aa622007-08-15 14:28:22 +0000120
121Note that GNU :program:`gettext` also defines a :func:`dcgettext` method, but
122this was deemed not useful and so it is currently unimplemented.
123
124Here's an example of typical usage for this API::
125
126 import gettext
127 gettext.bindtextdomain('myapplication', '/path/to/my/language/directory')
128 gettext.textdomain('myapplication')
129 _ = gettext.gettext
130 # ...
Georg Brandl6911e3c2007-09-04 07:15:32 +0000131 print(_('This is a translatable string.'))
Georg Brandl116aa622007-08-15 14:28:22 +0000132
133
134Class-based API
135---------------
136
137The class-based API of the :mod:`gettext` module gives you more flexibility and
138greater convenience than the GNU :program:`gettext` API. It is the recommended
139way of localizing your Python applications and modules. :mod:`gettext` defines
140a "translations" class which implements the parsing of GNU :file:`.mo` format
Georg Brandlf6945182008-02-01 11:56:49 +0000141files, and has methods for returning strings. Instances of this "translations"
142class can also install themselves in the built-in namespace as the function
143:func:`_`.
Georg Brandl116aa622007-08-15 14:28:22 +0000144
145
Georg Brandl036490d2009-05-17 13:00:36 +0000146.. function:: find(domain, localedir=None, languages=None, all=False)
Georg Brandl116aa622007-08-15 14:28:22 +0000147
148 This function implements the standard :file:`.mo` file search algorithm. It
149 takes a *domain*, identical to what :func:`textdomain` takes. Optional
150 *localedir* is as in :func:`bindtextdomain` Optional *languages* is a list of
151 strings, where each string is a language code.
152
153 If *localedir* is not given, then the default system locale directory is used.
154 [#]_ If *languages* is not given, then the following environment variables are
155 searched: :envvar:`LANGUAGE`, :envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and
156 :envvar:`LANG`. The first one returning a non-empty value is used for the
157 *languages* variable. The environment variables should contain a colon separated
158 list of languages, which will be split on the colon to produce the expected list
159 of language code strings.
160
161 :func:`find` then expands and normalizes the languages, and then iterates
162 through them, searching for an existing file built of these components:
163
Georg Brandl036490d2009-05-17 13:00:36 +0000164 :file:`{localedir}/{language}/LC_MESSAGES/{domain}.mo`
Georg Brandl116aa622007-08-15 14:28:22 +0000165
166 The first such file name that exists is returned by :func:`find`. If no such
167 file is found, then ``None`` is returned. If *all* is given, it returns a list
168 of all file names, in the order in which they appear in the languages list or
169 the environment variables.
170
171
Georg Brandl036490d2009-05-17 13:00:36 +0000172.. function:: translation(domain, localedir=None, languages=None, class_=None, fallback=False, codeset=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000173
Georg Brandlbded4d32008-07-17 18:15:35 +0000174 Return a :class:`Translations` instance based on the *domain*, *localedir*,
175 and *languages*, which are first passed to :func:`find` to get a list of the
Georg Brandl116aa622007-08-15 14:28:22 +0000176 associated :file:`.mo` file paths. Instances with identical :file:`.mo` file
Georg Brandlbded4d32008-07-17 18:15:35 +0000177 names are cached. The actual class instantiated is either *class_* if
178 provided, otherwise :class:`GNUTranslations`. The class's constructor must
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000179 take a single :term:`file object` argument. If provided, *codeset* will change
180 the charset used to encode translated strings in the :meth:`lgettext` and
Georg Brandlbded4d32008-07-17 18:15:35 +0000181 :meth:`lngettext` methods.
Georg Brandl116aa622007-08-15 14:28:22 +0000182
183 If multiple files are found, later files are used as fallbacks for earlier ones.
184 To allow setting the fallback, :func:`copy.copy` is used to clone each
185 translation object from the cache; the actual instance data is still shared with
186 the cache.
187
Antoine Pitrou62ab10a02011-10-12 20:10:51 +0200188 If no :file:`.mo` file is found, this function raises :exc:`OSError` if
Georg Brandl116aa622007-08-15 14:28:22 +0000189 *fallback* is false (which is the default), and returns a
190 :class:`NullTranslations` instance if *fallback* is true.
191
Antoine Pitrou62ab10a02011-10-12 20:10:51 +0200192 .. versionchanged:: 3.3
193 :exc:`IOError` used to be raised instead of :exc:`OSError`.
194
Georg Brandl116aa622007-08-15 14:28:22 +0000195
Georg Brandl036490d2009-05-17 13:00:36 +0000196.. function:: install(domain, localedir=None, codeset=None, names=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000197
Georg Brandl22b34312009-07-26 14:54:51 +0000198 This installs the function :func:`_` in Python's builtins namespace, based on
Georg Brandl116aa622007-08-15 14:28:22 +0000199 *domain*, *localedir*, and *codeset* which are passed to the function
Benjamin Peterson801844d2008-07-14 14:32:15 +0000200 :func:`translation`.
Georg Brandl116aa622007-08-15 14:28:22 +0000201
202 For the *names* parameter, please see the description of the translation
Alexandre Vassalotti6d3dfc32009-07-29 19:54:39 +0000203 object's :meth:`~NullTranslations.install` method.
Georg Brandl116aa622007-08-15 14:28:22 +0000204
205 As seen below, you usually mark the strings in your application that are
206 candidates for translation, by wrapping them in a call to the :func:`_`
207 function, like this::
208
Georg Brandl6911e3c2007-09-04 07:15:32 +0000209 print(_('This string will be translated.'))
Georg Brandl116aa622007-08-15 14:28:22 +0000210
211 For convenience, you want the :func:`_` function to be installed in Python's
Georg Brandl22b34312009-07-26 14:54:51 +0000212 builtins namespace, so it is easily accessible in all modules of your
Georg Brandl116aa622007-08-15 14:28:22 +0000213 application.
214
Georg Brandl116aa622007-08-15 14:28:22 +0000215
216The :class:`NullTranslations` class
217^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
218
219Translation classes are what actually implement the translation of original
220source file message strings to translated message strings. The base class used
221by all translation classes is :class:`NullTranslations`; this provides the basic
222interface you can use to write your own specialized translation classes. Here
223are the methods of :class:`NullTranslations`:
224
225
Georg Brandl036490d2009-05-17 13:00:36 +0000226.. class:: NullTranslations(fp=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000227
Antoine Pitrou11cb9612010-09-15 11:11:28 +0000228 Takes an optional :term:`file object` *fp*, which is ignored by the base class.
Georg Brandl116aa622007-08-15 14:28:22 +0000229 Initializes "protected" instance variables *_info* and *_charset* which are set
230 by derived classes, as well as *_fallback*, which is set through
231 :meth:`add_fallback`. It then calls ``self._parse(fp)`` if *fp* is not
232 ``None``.
233
Georg Brandlbded4d32008-07-17 18:15:35 +0000234 .. method:: _parse(fp)
Georg Brandl116aa622007-08-15 14:28:22 +0000235
Georg Brandlbded4d32008-07-17 18:15:35 +0000236 No-op'd in the base class, this method takes file object *fp*, and reads
237 the data from the file, initializing its message catalog. If you have an
238 unsupported message catalog file format, you should override this method
239 to parse your format.
Georg Brandl116aa622007-08-15 14:28:22 +0000240
241
Georg Brandlbded4d32008-07-17 18:15:35 +0000242 .. method:: add_fallback(fallback)
Georg Brandl116aa622007-08-15 14:28:22 +0000243
Georg Brandlbded4d32008-07-17 18:15:35 +0000244 Add *fallback* as the fallback object for the current translation object.
245 A translation object should consult the fallback if it cannot provide a
246 translation for a given message.
Georg Brandl116aa622007-08-15 14:28:22 +0000247
248
Georg Brandlbded4d32008-07-17 18:15:35 +0000249 .. method:: gettext(message)
Georg Brandl116aa622007-08-15 14:28:22 +0000250
Georg Brandlbded4d32008-07-17 18:15:35 +0000251 If a fallback has been set, forward :meth:`gettext` to the fallback.
252 Otherwise, return the translated message. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000253
254
Georg Brandlbded4d32008-07-17 18:15:35 +0000255 .. method:: lgettext(message)
Georg Brandl116aa622007-08-15 14:28:22 +0000256
Georg Brandlbded4d32008-07-17 18:15:35 +0000257 If a fallback has been set, forward :meth:`lgettext` to the fallback.
258 Otherwise, return the translated message. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000259
Georg Brandl116aa622007-08-15 14:28:22 +0000260
Georg Brandlbded4d32008-07-17 18:15:35 +0000261 .. method:: ngettext(singular, plural, n)
Georg Brandl116aa622007-08-15 14:28:22 +0000262
Georg Brandlbded4d32008-07-17 18:15:35 +0000263 If a fallback has been set, forward :meth:`ngettext` to the fallback.
264 Otherwise, return the translated message. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000265
266
Georg Brandlbded4d32008-07-17 18:15:35 +0000267 .. method:: lngettext(singular, plural, n)
Georg Brandl116aa622007-08-15 14:28:22 +0000268
Éric Araujo35a502b2011-10-07 22:02:58 +0200269 If a fallback has been set, forward :meth:`lngettext` to the fallback.
Georg Brandlbded4d32008-07-17 18:15:35 +0000270 Otherwise, return the translated message. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000271
Georg Brandl116aa622007-08-15 14:28:22 +0000272
Georg Brandlbded4d32008-07-17 18:15:35 +0000273 .. method:: info()
Georg Brandl116aa622007-08-15 14:28:22 +0000274
Georg Brandlbded4d32008-07-17 18:15:35 +0000275 Return the "protected" :attr:`_info` variable.
Georg Brandl116aa622007-08-15 14:28:22 +0000276
Georg Brandl116aa622007-08-15 14:28:22 +0000277
Georg Brandlbded4d32008-07-17 18:15:35 +0000278 .. method:: charset()
Georg Brandl116aa622007-08-15 14:28:22 +0000279
Georg Brandlbded4d32008-07-17 18:15:35 +0000280 Return the "protected" :attr:`_charset` variable, which is the encoding of
281 the message catalog file.
Georg Brandl116aa622007-08-15 14:28:22 +0000282
Georg Brandl116aa622007-08-15 14:28:22 +0000283
Georg Brandlbded4d32008-07-17 18:15:35 +0000284 .. method:: output_charset()
Georg Brandl116aa622007-08-15 14:28:22 +0000285
Georg Brandlbded4d32008-07-17 18:15:35 +0000286 Return the "protected" :attr:`_output_charset` variable, which defines the
287 encoding used to return translated messages in :meth:`lgettext` and
288 :meth:`lngettext`.
Georg Brandl116aa622007-08-15 14:28:22 +0000289
290
Georg Brandlbded4d32008-07-17 18:15:35 +0000291 .. method:: set_output_charset(charset)
Georg Brandl116aa622007-08-15 14:28:22 +0000292
Georg Brandlbded4d32008-07-17 18:15:35 +0000293 Change the "protected" :attr:`_output_charset` variable, which defines the
294 encoding used to return translated messages.
Georg Brandl116aa622007-08-15 14:28:22 +0000295
Benjamin Peterson801844d2008-07-14 14:32:15 +0000296
Georg Brandl036490d2009-05-17 13:00:36 +0000297 .. method:: install(names=None)
Benjamin Peterson801844d2008-07-14 14:32:15 +0000298
Georg Brandlbded4d32008-07-17 18:15:35 +0000299 This method installs :meth:`self.gettext` into the built-in namespace,
300 binding it to ``_``.
Benjamin Peterson801844d2008-07-14 14:32:15 +0000301
Georg Brandlbded4d32008-07-17 18:15:35 +0000302 If the *names* parameter is given, it must be a sequence containing the
Georg Brandl22b34312009-07-26 14:54:51 +0000303 names of functions you want to install in the builtins namespace in
Georg Brandlbded4d32008-07-17 18:15:35 +0000304 addition to :func:`_`. Supported names are ``'gettext'`` (bound to
305 :meth:`self.gettext`), ``'ngettext'`` (bound to :meth:`self.ngettext`),
306 ``'lgettext'`` and ``'lngettext'``.
Benjamin Peterson801844d2008-07-14 14:32:15 +0000307
Georg Brandlbded4d32008-07-17 18:15:35 +0000308 Note that this is only one way, albeit the most convenient way, to make
309 the :func:`_` function available to your application. Because it affects
310 the entire application globally, and specifically the built-in namespace,
311 localized modules should never install :func:`_`. Instead, they should use
312 this code to make :func:`_` available to their module::
Benjamin Peterson801844d2008-07-14 14:32:15 +0000313
Georg Brandlbded4d32008-07-17 18:15:35 +0000314 import gettext
315 t = gettext.translation('mymodule', ...)
316 _ = t.gettext
Benjamin Peterson801844d2008-07-14 14:32:15 +0000317
Georg Brandlbded4d32008-07-17 18:15:35 +0000318 This puts :func:`_` only in the module's global namespace and so only
319 affects calls within this module.
Georg Brandl116aa622007-08-15 14:28:22 +0000320
Georg Brandl116aa622007-08-15 14:28:22 +0000321
322The :class:`GNUTranslations` class
323^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
324
325The :mod:`gettext` module provides one additional class derived from
326:class:`NullTranslations`: :class:`GNUTranslations`. This class overrides
327:meth:`_parse` to enable reading GNU :program:`gettext` format :file:`.mo` files
Benjamin Peterson801844d2008-07-14 14:32:15 +0000328in both big-endian and little-endian format.
Georg Brandl116aa622007-08-15 14:28:22 +0000329
330:class:`GNUTranslations` parses optional meta-data out of the translation
331catalog. It is convention with GNU :program:`gettext` to include meta-data as
332the translation for the empty string. This meta-data is in :rfc:`822`\ -style
333``key: value`` pairs, and should contain the ``Project-Id-Version`` key. If the
334key ``Content-Type`` is found, then the ``charset`` property is used to
335initialize the "protected" :attr:`_charset` instance variable, defaulting to
336``None`` if not found. If the charset encoding is specified, then all message
337ids and message strings read from the catalog are converted to Unicode using
Georg Brandlbded4d32008-07-17 18:15:35 +0000338this encoding, else ASCII encoding is assumed.
339
340Since message ids are read as Unicode strings too, all :meth:`*gettext` methods
341will assume message ids as Unicode strings, not byte strings.
Georg Brandl116aa622007-08-15 14:28:22 +0000342
343The entire set of key/value pairs are placed into a dictionary and set as the
344"protected" :attr:`_info` instance variable.
345
346If the :file:`.mo` file's magic number is invalid, or if other problems occur
347while reading the file, instantiating a :class:`GNUTranslations` class can raise
Antoine Pitrou62ab10a02011-10-12 20:10:51 +0200348:exc:`OSError`.
Georg Brandl116aa622007-08-15 14:28:22 +0000349
350The following methods are overridden from the base class implementation:
351
352
353.. method:: GNUTranslations.gettext(message)
354
355 Look up the *message* id in the catalog and return the corresponding message
Georg Brandlbded4d32008-07-17 18:15:35 +0000356 string, as a Unicode string. If there is no entry in the catalog for the
357 *message* id, and a fallback has been set, the look up is forwarded to the
358 fallback's :meth:`gettext` method. Otherwise, the *message* id is returned.
Georg Brandl116aa622007-08-15 14:28:22 +0000359
360
361.. method:: GNUTranslations.lgettext(message)
362
Georg Brandlbded4d32008-07-17 18:15:35 +0000363 Equivalent to :meth:`gettext`, but the translation is returned as a
364 bytestring encoded in the selected output charset, or in the preferred system
365 encoding if no encoding was explicitly set with :meth:`set_output_charset`.
Georg Brandl116aa622007-08-15 14:28:22 +0000366
Georg Brandl116aa622007-08-15 14:28:22 +0000367
Georg Brandl116aa622007-08-15 14:28:22 +0000368.. method:: GNUTranslations.ngettext(singular, plural, n)
369
370 Do a plural-forms lookup of a message id. *singular* is used as the message id
371 for purposes of lookup in the catalog, while *n* is used to determine which
Georg Brandlbded4d32008-07-17 18:15:35 +0000372 plural form to use. The returned message string is a Unicode string.
Georg Brandl116aa622007-08-15 14:28:22 +0000373
374 If the message id is not found in the catalog, and a fallback is specified, the
375 request is forwarded to the fallback's :meth:`ngettext` method. Otherwise, when
376 *n* is 1 *singular* is returned, and *plural* is returned in all other cases.
Georg Brandl48310cd2009-01-03 21:18:54 +0000377
Benjamin Peterson801844d2008-07-14 14:32:15 +0000378 Here is an example::
379
380 n = len(os.listdir('.'))
381 cat = GNUTranslations(somefile)
382 message = cat.ngettext(
383 'There is %(num)d file in this directory',
384 'There are %(num)d files in this directory',
385 n) % {'num': n}
Georg Brandl116aa622007-08-15 14:28:22 +0000386
Georg Brandl116aa622007-08-15 14:28:22 +0000387
388.. method:: GNUTranslations.lngettext(singular, plural, n)
389
Georg Brandlbded4d32008-07-17 18:15:35 +0000390 Equivalent to :meth:`gettext`, but the translation is returned as a
391 bytestring encoded in the selected output charset, or in the preferred system
392 encoding if no encoding was explicitly set with :meth:`set_output_charset`.
Georg Brandl116aa622007-08-15 14:28:22 +0000393
Georg Brandl116aa622007-08-15 14:28:22 +0000394
Georg Brandl116aa622007-08-15 14:28:22 +0000395Solaris message catalog support
396^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
397
398The Solaris operating system defines its own binary :file:`.mo` file format, but
399since no documentation can be found on this format, it is not supported at this
400time.
401
402
403The Catalog constructor
404^^^^^^^^^^^^^^^^^^^^^^^
405
406.. index:: single: GNOME
407
408GNOME uses a version of the :mod:`gettext` module by James Henstridge, but this
409version has a slightly different API. Its documented usage was::
410
411 import gettext
412 cat = gettext.Catalog(domain, localedir)
413 _ = cat.gettext
Georg Brandl6911e3c2007-09-04 07:15:32 +0000414 print(_('hello world'))
Georg Brandl116aa622007-08-15 14:28:22 +0000415
416For compatibility with this older module, the function :func:`Catalog` is an
417alias for the :func:`translation` function described above.
418
419One difference between this module and Henstridge's: his catalog objects
420supported access through a mapping API, but this appears to be unused and so is
421not currently supported.
422
423
424Internationalizing your programs and modules
425--------------------------------------------
426
427Internationalization (I18N) refers to the operation by which a program is made
428aware of multiple languages. Localization (L10N) refers to the adaptation of
429your program, once internationalized, to the local language and cultural habits.
430In order to provide multilingual messages for your Python programs, you need to
431take the following steps:
432
433#. prepare your program or module by specially marking translatable strings
434
435#. run a suite of tools over your marked files to generate raw messages catalogs
436
437#. create language specific translations of the message catalogs
438
439#. use the :mod:`gettext` module so that message strings are properly translated
440
441In order to prepare your code for I18N, you need to look at all the strings in
442your files. Any string that needs to be translated should be marked by wrapping
443it in ``_('...')`` --- that is, a call to the function :func:`_`. For example::
444
445 filename = 'mylog.txt'
446 message = _('writing a log message')
447 fp = open(filename, 'w')
448 fp.write(message)
449 fp.close()
450
451In this example, the string ``'writing a log message'`` is marked as a candidate
452for translation, while the strings ``'mylog.txt'`` and ``'w'`` are not.
453
454The Python distribution comes with two tools which help you generate the message
455catalogs once you've prepared your source code. These may or may not be
456available from a binary distribution, but they can be found in a source
457distribution, in the :file:`Tools/i18n` directory.
458
459The :program:`pygettext` [#]_ program scans all your Python source code looking
460for the strings you previously marked as translatable. It is similar to the GNU
461:program:`gettext` program except that it understands all the intricacies of
462Python source code, but knows nothing about C or C++ source code. You don't
463need GNU ``gettext`` unless you're also going to be translating C code (such as
464C extension modules).
465
466:program:`pygettext` generates textual Uniforum-style human readable message
467catalog :file:`.pot` files, essentially structured human readable files which
468contain every marked string in the source code, along with a placeholder for the
469translation strings. :program:`pygettext` is a command line script that supports
470a similar command line interface as :program:`xgettext`; for details on its use,
471run::
472
473 pygettext.py --help
474
475Copies of these :file:`.pot` files are then handed over to the individual human
476translators who write language-specific versions for every supported natural
477language. They send you back the filled in language-specific versions as a
478:file:`.po` file. Using the :program:`msgfmt.py` [#]_ program (in the
479:file:`Tools/i18n` directory), you take the :file:`.po` files from your
480translators and generate the machine-readable :file:`.mo` binary catalog files.
481The :file:`.mo` files are what the :mod:`gettext` module uses for the actual
482translation processing during run-time.
483
484How you use the :mod:`gettext` module in your code depends on whether you are
485internationalizing a single module or your entire application. The next two
486sections will discuss each case.
487
488
489Localizing your module
490^^^^^^^^^^^^^^^^^^^^^^
491
492If you are localizing your module, you must take care not to make global
493changes, e.g. to the built-in namespace. You should not use the GNU ``gettext``
494API but instead the class-based API.
495
496Let's say your module is called "spam" and the module's various natural language
497translation :file:`.mo` files reside in :file:`/usr/share/locale` in GNU
498:program:`gettext` format. Here's what you would put at the top of your
499module::
500
501 import gettext
502 t = gettext.translation('spam', '/usr/share/locale')
503 _ = t.lgettext
504
Georg Brandl116aa622007-08-15 14:28:22 +0000505
506Localizing your application
507^^^^^^^^^^^^^^^^^^^^^^^^^^^
508
509If you are localizing your application, you can install the :func:`_` function
510globally into the built-in namespace, usually in the main driver file of your
511application. This will let all your application-specific files just use
512``_('...')`` without having to explicitly install it in each file.
513
514In the simple case then, you need only add the following bit of code to the main
515driver file of your application::
516
517 import gettext
518 gettext.install('myapplication')
519
Benjamin Peterson801844d2008-07-14 14:32:15 +0000520If you need to set the locale directory, you can pass these into the
521:func:`install` function::
Georg Brandl116aa622007-08-15 14:28:22 +0000522
523 import gettext
Benjamin Peterson801844d2008-07-14 14:32:15 +0000524 gettext.install('myapplication', '/usr/share/locale')
Georg Brandl116aa622007-08-15 14:28:22 +0000525
526
527Changing languages on the fly
528^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
529
530If your program needs to support many languages at the same time, you may want
531to create multiple translation instances and then switch between them
532explicitly, like so::
533
534 import gettext
535
536 lang1 = gettext.translation('myapplication', languages=['en'])
537 lang2 = gettext.translation('myapplication', languages=['fr'])
538 lang3 = gettext.translation('myapplication', languages=['de'])
539
540 # start by using language1
541 lang1.install()
542
543 # ... time goes by, user selects language 2
544 lang2.install()
545
546 # ... more time goes by, user selects language 3
547 lang3.install()
548
549
550Deferred translations
551^^^^^^^^^^^^^^^^^^^^^
552
553In most coding situations, strings are translated where they are coded.
554Occasionally however, you need to mark strings for translation, but defer actual
555translation until later. A classic example is::
556
557 animals = ['mollusk',
558 'albatross',
Georg Brandla1c6a1c2009-01-03 21:26:05 +0000559 'rat',
560 'penguin',
561 'python', ]
Georg Brandl116aa622007-08-15 14:28:22 +0000562 # ...
563 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000564 print(a)
Georg Brandl116aa622007-08-15 14:28:22 +0000565
566Here, you want to mark the strings in the ``animals`` list as being
567translatable, but you don't actually want to translate them until they are
568printed.
569
570Here is one way you can handle this situation::
571
572 def _(message): return message
573
574 animals = [_('mollusk'),
575 _('albatross'),
Georg Brandla1c6a1c2009-01-03 21:26:05 +0000576 _('rat'),
577 _('penguin'),
578 _('python'), ]
Georg Brandl116aa622007-08-15 14:28:22 +0000579
580 del _
581
582 # ...
583 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000584 print(_(a))
Georg Brandl116aa622007-08-15 14:28:22 +0000585
586This works because the dummy definition of :func:`_` simply returns the string
587unchanged. And this dummy definition will temporarily override any definition
588of :func:`_` in the built-in namespace (until the :keyword:`del` command). Take
589care, though if you have a previous definition of :func:`_` in the local
590namespace.
591
592Note that the second use of :func:`_` will not identify "a" as being
593translatable to the :program:`pygettext` program, since it is not a string.
594
595Another way to handle this is with the following example::
596
597 def N_(message): return message
598
599 animals = [N_('mollusk'),
600 N_('albatross'),
Georg Brandla1c6a1c2009-01-03 21:26:05 +0000601 N_('rat'),
602 N_('penguin'),
603 N_('python'), ]
Georg Brandl116aa622007-08-15 14:28:22 +0000604
605 # ...
606 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000607 print(_(a))
Georg Brandl116aa622007-08-15 14:28:22 +0000608
609In this case, you are marking translatable strings with the function :func:`N_`,
610[#]_ which won't conflict with any definition of :func:`_`. However, you will
611need to teach your message extraction program to look for translatable strings
612marked with :func:`N_`. :program:`pygettext` and :program:`xpot` both support
613this through the use of command line switches.
614
615
Georg Brandl116aa622007-08-15 14:28:22 +0000616Acknowledgements
617----------------
618
619The following people contributed code, feedback, design suggestions, previous
620implementations, and valuable experience to the creation of this module:
621
622* Peter Funk
623
624* James Henstridge
625
626* Juan David Ibáñez Palomar
627
628* Marc-André Lemburg
629
630* Martin von Löwis
631
632* François Pinard
633
634* Barry Warsaw
635
636* Gustavo Niemeyer
637
638.. rubric:: Footnotes
639
640.. [#] The default locale directory is system dependent; for example, on RedHat Linux
641 it is :file:`/usr/share/locale`, but on Solaris it is :file:`/usr/lib/locale`.
642 The :mod:`gettext` module does not try to support these system dependent
643 defaults; instead its default is :file:`sys.prefix/share/locale`. For this
644 reason, it is always best to call :func:`bindtextdomain` with an explicit
645 absolute path at the start of your application.
646
647.. [#] See the footnote for :func:`bindtextdomain` above.
648
649.. [#] François Pinard has written a program called :program:`xpot` which does a
Éric Araujo35a502b2011-10-07 22:02:58 +0200650 similar job. It is available as part of his `po-utils package
651 <http://po-utils.progiciels-bpi.ca/>`_.
Georg Brandl116aa622007-08-15 14:28:22 +0000652
653.. [#] :program:`msgfmt.py` is binary compatible with GNU :program:`msgfmt` except that
654 it provides a simpler, all-Python implementation. With this and
655 :program:`pygettext.py`, you generally won't need to install the GNU
656 :program:`gettext` package to internationalize your Python applications.
657
658.. [#] The choice of :func:`N_` here is totally arbitrary; it could have just as easily
659 been :func:`MarkThisStringForTranslation`.
660