blob: da8f4f1256f213403cb33e2d1462c78775d90d10 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001
2:mod:`gettext` --- Multilingual internationalization services
3=============================================================
4
5.. module:: gettext
6 :synopsis: Multilingual internationalization services.
7.. moduleauthor:: Barry A. Warsaw <barry@zope.com>
8.. sectionauthor:: Barry A. Warsaw <barry@zope.com>
9
10
11The :mod:`gettext` module provides internationalization (I18N) and localization
12(L10N) services for your Python modules and applications. It supports both the
13GNU ``gettext`` message catalog API and a higher level, class-based API that may
14be more appropriate for Python files. The interface described below allows you
15to write your module and application messages in one natural language, and
16provide a catalog of translated messages for running under different natural
17languages.
18
19Some hints on localizing your Python modules and applications are also given.
20
21
22GNU :program:`gettext` API
23--------------------------
24
25The :mod:`gettext` module defines the following API, which is very similar to
26the GNU :program:`gettext` API. If you use this API you will affect the
27translation of your entire application globally. Often this is what you want if
28your application is monolingual, with the choice of language dependent on the
29locale of your user. If you are localizing a Python module, or if your
30application needs to switch languages on the fly, you probably want to use the
31class-based API instead.
32
33
34.. function:: bindtextdomain(domain[, localedir])
35
36 Bind the *domain* to the locale directory *localedir*. More concretely,
37 :mod:`gettext` will look for binary :file:`.mo` files for the given domain using
38 the path (on Unix): :file:`localedir/language/LC_MESSAGES/domain.mo`, where
39 *languages* is searched for in the environment variables :envvar:`LANGUAGE`,
40 :envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and :envvar:`LANG` respectively.
41
42 If *localedir* is omitted or ``None``, then the current binding for *domain* is
43 returned. [#]_
44
45
46.. function:: bind_textdomain_codeset(domain[, codeset])
47
48 Bind the *domain* to *codeset*, changing the encoding of strings returned by the
49 :func:`gettext` family of functions. If *codeset* is omitted, then the current
50 binding is returned.
51
Georg Brandl116aa622007-08-15 14:28:22 +000052
53.. function:: textdomain([domain])
54
55 Change or query the current global domain. If *domain* is ``None``, then the
56 current global domain is returned, otherwise the global domain is set to
57 *domain*, which is returned.
58
59
60.. function:: gettext(message)
61
62 Return the localized translation of *message*, based on the current global
63 domain, language, and locale directory. This function is usually aliased as
64 :func:`_` in the local namespace (see examples below).
65
66
67.. function:: lgettext(message)
68
69 Equivalent to :func:`gettext`, but the translation is returned in the preferred
70 system encoding, if no other encoding was explicitly set with
71 :func:`bind_textdomain_codeset`.
72
Georg Brandl116aa622007-08-15 14:28:22 +000073
74.. function:: dgettext(domain, message)
75
76 Like :func:`gettext`, but look the message up in the specified *domain*.
77
78
79.. function:: ldgettext(domain, message)
80
81 Equivalent to :func:`dgettext`, but the translation is returned in the preferred
82 system encoding, if no other encoding was explicitly set with
83 :func:`bind_textdomain_codeset`.
84
Georg Brandl116aa622007-08-15 14:28:22 +000085
86.. function:: ngettext(singular, plural, n)
87
88 Like :func:`gettext`, but consider plural forms. If a translation is found,
89 apply the plural formula to *n*, and return the resulting message (some
90 languages have more than two plural forms). If no translation is found, return
91 *singular* if *n* is 1; return *plural* otherwise.
92
93 The Plural formula is taken from the catalog header. It is a C or Python
94 expression that has a free variable *n*; the expression evaluates to the index
95 of the plural in the catalog. See the GNU gettext documentation for the precise
96 syntax to be used in :file:`.po` files and the formulas for a variety of
97 languages.
98
Georg Brandl116aa622007-08-15 14:28:22 +000099
100.. function:: lngettext(singular, plural, n)
101
102 Equivalent to :func:`ngettext`, but the translation is returned in the preferred
103 system encoding, if no other encoding was explicitly set with
104 :func:`bind_textdomain_codeset`.
105
Georg Brandl116aa622007-08-15 14:28:22 +0000106
107.. function:: dngettext(domain, singular, plural, n)
108
109 Like :func:`ngettext`, but look the message up in the specified *domain*.
110
Georg Brandl116aa622007-08-15 14:28:22 +0000111
112.. function:: ldngettext(domain, singular, plural, n)
113
114 Equivalent to :func:`dngettext`, but the translation is returned in the
115 preferred system encoding, if no other encoding was explicitly set with
116 :func:`bind_textdomain_codeset`.
117
Georg Brandl116aa622007-08-15 14:28:22 +0000118
119Note that GNU :program:`gettext` also defines a :func:`dcgettext` method, but
120this was deemed not useful and so it is currently unimplemented.
121
122Here's an example of typical usage for this API::
123
124 import gettext
125 gettext.bindtextdomain('myapplication', '/path/to/my/language/directory')
126 gettext.textdomain('myapplication')
127 _ = gettext.gettext
128 # ...
Georg Brandl6911e3c2007-09-04 07:15:32 +0000129 print(_('This is a translatable string.'))
Georg Brandl116aa622007-08-15 14:28:22 +0000130
131
132Class-based API
133---------------
134
135The class-based API of the :mod:`gettext` module gives you more flexibility and
136greater convenience than the GNU :program:`gettext` API. It is the recommended
137way of localizing your Python applications and modules. :mod:`gettext` defines
138a "translations" class which implements the parsing of GNU :file:`.mo` format
Georg Brandlf6945182008-02-01 11:56:49 +0000139files, and has methods for returning strings. Instances of this "translations"
140class can also install themselves in the built-in namespace as the function
141:func:`_`.
Georg Brandl116aa622007-08-15 14:28:22 +0000142
143
144.. function:: find(domain[, localedir[, languages[, all]]])
145
146 This function implements the standard :file:`.mo` file search algorithm. It
147 takes a *domain*, identical to what :func:`textdomain` takes. Optional
148 *localedir* is as in :func:`bindtextdomain` Optional *languages* is a list of
149 strings, where each string is a language code.
150
151 If *localedir* is not given, then the default system locale directory is used.
152 [#]_ If *languages* is not given, then the following environment variables are
153 searched: :envvar:`LANGUAGE`, :envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and
154 :envvar:`LANG`. The first one returning a non-empty value is used for the
155 *languages* variable. The environment variables should contain a colon separated
156 list of languages, which will be split on the colon to produce the expected list
157 of language code strings.
158
159 :func:`find` then expands and normalizes the languages, and then iterates
160 through them, searching for an existing file built of these components:
161
162 :file:`localedir/language/LC_MESSAGES/domain.mo`
163
164 The first such file name that exists is returned by :func:`find`. If no such
165 file is found, then ``None`` is returned. If *all* is given, it returns a list
166 of all file names, in the order in which they appear in the languages list or
167 the environment variables.
168
169
170.. function:: translation(domain[, localedir[, languages[, class_[, fallback[, codeset]]]]])
171
172 Return a :class:`Translations` instance based on the *domain*, *localedir*, and
173 *languages*, which are first passed to :func:`find` to get a list of the
174 associated :file:`.mo` file paths. Instances with identical :file:`.mo` file
175 names are cached. The actual class instantiated is either *class_* if provided,
176 otherwise :class:`GNUTranslations`. The class's constructor must take a single
177 file object argument. If provided, *codeset* will change the charset used to
178 encode translated strings.
179
180 If multiple files are found, later files are used as fallbacks for earlier ones.
181 To allow setting the fallback, :func:`copy.copy` is used to clone each
182 translation object from the cache; the actual instance data is still shared with
183 the cache.
184
185 If no :file:`.mo` file is found, this function raises :exc:`IOError` if
186 *fallback* is false (which is the default), and returns a
187 :class:`NullTranslations` instance if *fallback* is true.
188
Georg Brandl116aa622007-08-15 14:28:22 +0000189
190.. function:: install(domain[, localedir[, unicode [, codeset[, names]]]])
191
192 This installs the function :func:`_` in Python's builtin namespace, based on
193 *domain*, *localedir*, and *codeset* which are passed to the function
194 :func:`translation`. The *unicode* flag is passed to the resulting translation
195 object's :meth:`install` method.
196
197 For the *names* parameter, please see the description of the translation
198 object's :meth:`install` method.
199
200 As seen below, you usually mark the strings in your application that are
201 candidates for translation, by wrapping them in a call to the :func:`_`
202 function, like this::
203
Georg Brandl6911e3c2007-09-04 07:15:32 +0000204 print(_('This string will be translated.'))
Georg Brandl116aa622007-08-15 14:28:22 +0000205
206 For convenience, you want the :func:`_` function to be installed in Python's
207 builtin namespace, so it is easily accessible in all modules of your
208 application.
209
Georg Brandl116aa622007-08-15 14:28:22 +0000210
211The :class:`NullTranslations` class
212^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
213
214Translation classes are what actually implement the translation of original
215source file message strings to translated message strings. The base class used
216by all translation classes is :class:`NullTranslations`; this provides the basic
217interface you can use to write your own specialized translation classes. Here
218are the methods of :class:`NullTranslations`:
219
220
221.. method:: NullTranslations.__init__([fp])
222
223 Takes an optional file object *fp*, which is ignored by the base class.
224 Initializes "protected" instance variables *_info* and *_charset* which are set
225 by derived classes, as well as *_fallback*, which is set through
226 :meth:`add_fallback`. It then calls ``self._parse(fp)`` if *fp* is not
227 ``None``.
228
229
230.. method:: NullTranslations._parse(fp)
231
232 No-op'd in the base class, this method takes file object *fp*, and reads the
233 data from the file, initializing its message catalog. If you have an
234 unsupported message catalog file format, you should override this method to
235 parse your format.
236
237
238.. method:: NullTranslations.add_fallback(fallback)
239
240 Add *fallback* as the fallback object for the current translation object. A
241 translation object should consult the fallback if it cannot provide a
242 translation for a given message.
243
244
245.. method:: NullTranslations.gettext(message)
246
247 If a fallback has been set, forward :meth:`gettext` to the fallback. Otherwise,
248 return the translated message. Overridden in derived classes.
249
250
251.. method:: NullTranslations.lgettext(message)
252
253 If a fallback has been set, forward :meth:`lgettext` to the fallback. Otherwise,
254 return the translated message. Overridden in derived classes.
255
Georg Brandl116aa622007-08-15 14:28:22 +0000256
257.. method:: NullTranslations.ugettext(message)
258
259 If a fallback has been set, forward :meth:`ugettext` to the fallback. Otherwise,
Georg Brandlf6945182008-02-01 11:56:49 +0000260 return the translated message as a string. Overridden in derived classes.
Georg Brandl116aa622007-08-15 14:28:22 +0000261
262
263.. method:: NullTranslations.ngettext(singular, plural, n)
264
265 If a fallback has been set, forward :meth:`ngettext` to the fallback. Otherwise,
266 return the translated message. Overridden in derived classes.
267
Georg Brandl116aa622007-08-15 14:28:22 +0000268
269.. method:: NullTranslations.lngettext(singular, plural, n)
270
271 If a fallback has been set, forward :meth:`ngettext` to the fallback. Otherwise,
272 return the translated message. Overridden in derived classes.
273
Georg Brandl116aa622007-08-15 14:28:22 +0000274
275.. method:: NullTranslations.ungettext(singular, plural, n)
276
277 If a fallback has been set, forward :meth:`ungettext` to the fallback.
Georg Brandlf6945182008-02-01 11:56:49 +0000278 Otherwise, return the translated message as a string. Overridden in
Georg Brandl116aa622007-08-15 14:28:22 +0000279 derived classes.
280
Georg Brandl116aa622007-08-15 14:28:22 +0000281
282.. method:: NullTranslations.info()
283
284 Return the "protected" :attr:`_info` variable.
285
286
287.. method:: NullTranslations.charset()
288
289 Return the "protected" :attr:`_charset` variable.
290
291
292.. method:: NullTranslations.output_charset()
293
294 Return the "protected" :attr:`_output_charset` variable, which defines the
295 encoding used to return translated messages.
296
Georg Brandl116aa622007-08-15 14:28:22 +0000297
298.. method:: NullTranslations.set_output_charset(charset)
299
300 Change the "protected" :attr:`_output_charset` variable, which defines the
301 encoding used to return translated messages.
302
Georg Brandl116aa622007-08-15 14:28:22 +0000303
304.. method:: NullTranslations.install([unicode [, names]])
305
306 If the *unicode* flag is false, this method installs :meth:`self.gettext` into
307 the built-in namespace, binding it to ``_``. If *unicode* is true, it binds
308 :meth:`self.ugettext` instead. By default, *unicode* is false.
309
310 If the *names* parameter is given, it must be a sequence containing the names of
311 functions you want to install in the builtin namespace in addition to :func:`_`.
312 Supported names are ``'gettext'`` (bound to :meth:`self.gettext` or
313 :meth:`self.ugettext` according to the *unicode* flag), ``'ngettext'`` (bound to
314 :meth:`self.ngettext` or :meth:`self.ungettext` according to the *unicode*
315 flag), ``'lgettext'`` and ``'lngettext'``.
316
317 Note that this is only one way, albeit the most convenient way, to make the
318 :func:`_` function available to your application. Because it affects the entire
319 application globally, and specifically the built-in namespace, localized modules
320 should never install :func:`_`. Instead, they should use this code to make
321 :func:`_` available to their module::
322
323 import gettext
324 t = gettext.translation('mymodule', ...)
325 _ = t.gettext
326
327 This puts :func:`_` only in the module's global namespace and so only affects
328 calls within this module.
329
Georg Brandl116aa622007-08-15 14:28:22 +0000330
331The :class:`GNUTranslations` class
332^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
333
334The :mod:`gettext` module provides one additional class derived from
335:class:`NullTranslations`: :class:`GNUTranslations`. This class overrides
336:meth:`_parse` to enable reading GNU :program:`gettext` format :file:`.mo` files
337in both big-endian and little-endian format. It also coerces both message ids
338and message strings to Unicode.
339
340:class:`GNUTranslations` parses optional meta-data out of the translation
341catalog. It is convention with GNU :program:`gettext` to include meta-data as
342the translation for the empty string. This meta-data is in :rfc:`822`\ -style
343``key: value`` pairs, and should contain the ``Project-Id-Version`` key. If the
344key ``Content-Type`` is found, then the ``charset`` property is used to
345initialize the "protected" :attr:`_charset` instance variable, defaulting to
346``None`` if not found. If the charset encoding is specified, then all message
347ids and message strings read from the catalog are converted to Unicode using
348this encoding. The :meth:`ugettext` method always returns a Unicode, while the
Georg Brandlf6945182008-02-01 11:56:49 +0000349:meth:`gettext` returns an encoded bytestring. For the message id arguments
350of both methods, either Unicode strings or bytestrings containing only
Georg Brandl116aa622007-08-15 14:28:22 +0000351US-ASCII characters are acceptable. Note that the Unicode version of the
352methods (i.e. :meth:`ugettext` and :meth:`ungettext`) are the recommended
353interface to use for internationalized Python programs.
354
355The entire set of key/value pairs are placed into a dictionary and set as the
356"protected" :attr:`_info` instance variable.
357
358If the :file:`.mo` file's magic number is invalid, or if other problems occur
359while reading the file, instantiating a :class:`GNUTranslations` class can raise
360:exc:`IOError`.
361
362The following methods are overridden from the base class implementation:
363
364
365.. method:: GNUTranslations.gettext(message)
366
367 Look up the *message* id in the catalog and return the corresponding message
Georg Brandlf6945182008-02-01 11:56:49 +0000368 string, as a bytestring encoded with the catalog's charset encoding, if
Georg Brandl116aa622007-08-15 14:28:22 +0000369 known. If there is no entry in the catalog for the *message* id, and a fallback
370 has been set, the look up is forwarded to the fallback's :meth:`gettext` method.
371 Otherwise, the *message* id is returned.
372
373
374.. method:: GNUTranslations.lgettext(message)
375
376 Equivalent to :meth:`gettext`, but the translation is returned in the preferred
377 system encoding, if no other encoding was explicitly set with
378 :meth:`set_output_charset`.
379
Georg Brandl116aa622007-08-15 14:28:22 +0000380
381.. method:: GNUTranslations.ugettext(message)
382
383 Look up the *message* id in the catalog and return the corresponding message
Georg Brandlf6945182008-02-01 11:56:49 +0000384 string, as a string. If there is no entry in the catalog for the
Georg Brandl116aa622007-08-15 14:28:22 +0000385 *message* id, and a fallback has been set, the look up is forwarded to the
386 fallback's :meth:`ugettext` method. Otherwise, the *message* id is returned.
387
388
389.. method:: GNUTranslations.ngettext(singular, plural, n)
390
391 Do a plural-forms lookup of a message id. *singular* is used as the message id
392 for purposes of lookup in the catalog, while *n* is used to determine which
Georg Brandlf6945182008-02-01 11:56:49 +0000393 plural form to use. The returned message string is a bytestring encoded with
Georg Brandl116aa622007-08-15 14:28:22 +0000394 the catalog's charset encoding, if known.
395
396 If the message id is not found in the catalog, and a fallback is specified, the
397 request is forwarded to the fallback's :meth:`ngettext` method. Otherwise, when
398 *n* is 1 *singular* is returned, and *plural* is returned in all other cases.
399
Georg Brandl116aa622007-08-15 14:28:22 +0000400
401.. method:: GNUTranslations.lngettext(singular, plural, n)
402
403 Equivalent to :meth:`gettext`, but the translation is returned in the preferred
404 system encoding, if no other encoding was explicitly set with
405 :meth:`set_output_charset`.
406
Georg Brandl116aa622007-08-15 14:28:22 +0000407
408.. method:: GNUTranslations.ungettext(singular, plural, n)
409
410 Do a plural-forms lookup of a message id. *singular* is used as the message id
411 for purposes of lookup in the catalog, while *n* is used to determine which
Georg Brandlf6945182008-02-01 11:56:49 +0000412 plural form to use. The returned message string is a string.
Georg Brandl116aa622007-08-15 14:28:22 +0000413
414 If the message id is not found in the catalog, and a fallback is specified, the
415 request is forwarded to the fallback's :meth:`ungettext` method. Otherwise,
416 when *n* is 1 *singular* is returned, and *plural* is returned in all other
417 cases.
418
419 Here is an example::
420
421 n = len(os.listdir('.'))
422 cat = GNUTranslations(somefile)
423 message = cat.ungettext(
424 'There is %(num)d file in this directory',
425 'There are %(num)d files in this directory',
426 n) % {'num': n}
427
Georg Brandl116aa622007-08-15 14:28:22 +0000428
429Solaris message catalog support
430^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
431
432The Solaris operating system defines its own binary :file:`.mo` file format, but
433since no documentation can be found on this format, it is not supported at this
434time.
435
436
437The Catalog constructor
438^^^^^^^^^^^^^^^^^^^^^^^
439
440.. index:: single: GNOME
441
442GNOME uses a version of the :mod:`gettext` module by James Henstridge, but this
443version has a slightly different API. Its documented usage was::
444
445 import gettext
446 cat = gettext.Catalog(domain, localedir)
447 _ = cat.gettext
Georg Brandl6911e3c2007-09-04 07:15:32 +0000448 print(_('hello world'))
Georg Brandl116aa622007-08-15 14:28:22 +0000449
450For compatibility with this older module, the function :func:`Catalog` is an
451alias for the :func:`translation` function described above.
452
453One difference between this module and Henstridge's: his catalog objects
454supported access through a mapping API, but this appears to be unused and so is
455not currently supported.
456
457
458Internationalizing your programs and modules
459--------------------------------------------
460
461Internationalization (I18N) refers to the operation by which a program is made
462aware of multiple languages. Localization (L10N) refers to the adaptation of
463your program, once internationalized, to the local language and cultural habits.
464In order to provide multilingual messages for your Python programs, you need to
465take the following steps:
466
467#. prepare your program or module by specially marking translatable strings
468
469#. run a suite of tools over your marked files to generate raw messages catalogs
470
471#. create language specific translations of the message catalogs
472
473#. use the :mod:`gettext` module so that message strings are properly translated
474
475In order to prepare your code for I18N, you need to look at all the strings in
476your files. Any string that needs to be translated should be marked by wrapping
477it in ``_('...')`` --- that is, a call to the function :func:`_`. For example::
478
479 filename = 'mylog.txt'
480 message = _('writing a log message')
481 fp = open(filename, 'w')
482 fp.write(message)
483 fp.close()
484
485In this example, the string ``'writing a log message'`` is marked as a candidate
486for translation, while the strings ``'mylog.txt'`` and ``'w'`` are not.
487
488The Python distribution comes with two tools which help you generate the message
489catalogs once you've prepared your source code. These may or may not be
490available from a binary distribution, but they can be found in a source
491distribution, in the :file:`Tools/i18n` directory.
492
493The :program:`pygettext` [#]_ program scans all your Python source code looking
494for the strings you previously marked as translatable. It is similar to the GNU
495:program:`gettext` program except that it understands all the intricacies of
496Python source code, but knows nothing about C or C++ source code. You don't
497need GNU ``gettext`` unless you're also going to be translating C code (such as
498C extension modules).
499
500:program:`pygettext` generates textual Uniforum-style human readable message
501catalog :file:`.pot` files, essentially structured human readable files which
502contain every marked string in the source code, along with a placeholder for the
503translation strings. :program:`pygettext` is a command line script that supports
504a similar command line interface as :program:`xgettext`; for details on its use,
505run::
506
507 pygettext.py --help
508
509Copies of these :file:`.pot` files are then handed over to the individual human
510translators who write language-specific versions for every supported natural
511language. They send you back the filled in language-specific versions as a
512:file:`.po` file. Using the :program:`msgfmt.py` [#]_ program (in the
513:file:`Tools/i18n` directory), you take the :file:`.po` files from your
514translators and generate the machine-readable :file:`.mo` binary catalog files.
515The :file:`.mo` files are what the :mod:`gettext` module uses for the actual
516translation processing during run-time.
517
518How you use the :mod:`gettext` module in your code depends on whether you are
519internationalizing a single module or your entire application. The next two
520sections will discuss each case.
521
522
523Localizing your module
524^^^^^^^^^^^^^^^^^^^^^^
525
526If you are localizing your module, you must take care not to make global
527changes, e.g. to the built-in namespace. You should not use the GNU ``gettext``
528API but instead the class-based API.
529
530Let's say your module is called "spam" and the module's various natural language
531translation :file:`.mo` files reside in :file:`/usr/share/locale` in GNU
532:program:`gettext` format. Here's what you would put at the top of your
533module::
534
535 import gettext
536 t = gettext.translation('spam', '/usr/share/locale')
537 _ = t.lgettext
538
539If your translators were providing you with Unicode strings in their :file:`.po`
540files, you'd instead do::
541
542 import gettext
543 t = gettext.translation('spam', '/usr/share/locale')
544 _ = t.ugettext
545
546
547Localizing your application
548^^^^^^^^^^^^^^^^^^^^^^^^^^^
549
550If you are localizing your application, you can install the :func:`_` function
551globally into the built-in namespace, usually in the main driver file of your
552application. This will let all your application-specific files just use
553``_('...')`` without having to explicitly install it in each file.
554
555In the simple case then, you need only add the following bit of code to the main
556driver file of your application::
557
558 import gettext
559 gettext.install('myapplication')
560
561If you need to set the locale directory or the *unicode* flag, you can pass
562these into the :func:`install` function::
563
564 import gettext
565 gettext.install('myapplication', '/usr/share/locale', unicode=1)
566
567
568Changing languages on the fly
569^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
570
571If your program needs to support many languages at the same time, you may want
572to create multiple translation instances and then switch between them
573explicitly, like so::
574
575 import gettext
576
577 lang1 = gettext.translation('myapplication', languages=['en'])
578 lang2 = gettext.translation('myapplication', languages=['fr'])
579 lang3 = gettext.translation('myapplication', languages=['de'])
580
581 # start by using language1
582 lang1.install()
583
584 # ... time goes by, user selects language 2
585 lang2.install()
586
587 # ... more time goes by, user selects language 3
588 lang3.install()
589
590
591Deferred translations
592^^^^^^^^^^^^^^^^^^^^^
593
594In most coding situations, strings are translated where they are coded.
595Occasionally however, you need to mark strings for translation, but defer actual
596translation until later. A classic example is::
597
598 animals = ['mollusk',
599 'albatross',
600 'rat',
601 'penguin',
602 'python',
603 ]
604 # ...
605 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000606 print(a)
Georg Brandl116aa622007-08-15 14:28:22 +0000607
608Here, you want to mark the strings in the ``animals`` list as being
609translatable, but you don't actually want to translate them until they are
610printed.
611
612Here is one way you can handle this situation::
613
614 def _(message): return message
615
616 animals = [_('mollusk'),
617 _('albatross'),
618 _('rat'),
619 _('penguin'),
620 _('python'),
621 ]
622
623 del _
624
625 # ...
626 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000627 print(_(a))
Georg Brandl116aa622007-08-15 14:28:22 +0000628
629This works because the dummy definition of :func:`_` simply returns the string
630unchanged. And this dummy definition will temporarily override any definition
631of :func:`_` in the built-in namespace (until the :keyword:`del` command). Take
632care, though if you have a previous definition of :func:`_` in the local
633namespace.
634
635Note that the second use of :func:`_` will not identify "a" as being
636translatable to the :program:`pygettext` program, since it is not a string.
637
638Another way to handle this is with the following example::
639
640 def N_(message): return message
641
642 animals = [N_('mollusk'),
643 N_('albatross'),
644 N_('rat'),
645 N_('penguin'),
646 N_('python'),
647 ]
648
649 # ...
650 for a in animals:
Georg Brandl6911e3c2007-09-04 07:15:32 +0000651 print(_(a))
Georg Brandl116aa622007-08-15 14:28:22 +0000652
653In this case, you are marking translatable strings with the function :func:`N_`,
654[#]_ which won't conflict with any definition of :func:`_`. However, you will
655need to teach your message extraction program to look for translatable strings
656marked with :func:`N_`. :program:`pygettext` and :program:`xpot` both support
657this through the use of command line switches.
658
659
660:func:`gettext` vs. :func:`lgettext`
661^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
662
663In Python 2.4 the :func:`lgettext` family of functions were introduced. The
664intention of these functions is to provide an alternative which is more
665compliant with the current implementation of GNU gettext. Unlike
666:func:`gettext`, which returns strings encoded with the same codeset used in the
667translation file, :func:`lgettext` will return strings encoded with the
668preferred system encoding, as returned by :func:`locale.getpreferredencoding`.
669Also notice that Python 2.4 introduces new functions to explicitly choose the
670codeset used in translated strings. If a codeset is explicitly set, even
671:func:`lgettext` will return translated strings in the requested codeset, as
672would be expected in the GNU gettext implementation.
673
674
675Acknowledgements
676----------------
677
678The following people contributed code, feedback, design suggestions, previous
679implementations, and valuable experience to the creation of this module:
680
681* Peter Funk
682
683* James Henstridge
684
685* Juan David Ibáñez Palomar
686
687* Marc-André Lemburg
688
689* Martin von Löwis
690
691* François Pinard
692
693* Barry Warsaw
694
695* Gustavo Niemeyer
696
697.. rubric:: Footnotes
698
699.. [#] The default locale directory is system dependent; for example, on RedHat Linux
700 it is :file:`/usr/share/locale`, but on Solaris it is :file:`/usr/lib/locale`.
701 The :mod:`gettext` module does not try to support these system dependent
702 defaults; instead its default is :file:`sys.prefix/share/locale`. For this
703 reason, it is always best to call :func:`bindtextdomain` with an explicit
704 absolute path at the start of your application.
705
706.. [#] See the footnote for :func:`bindtextdomain` above.
707
708.. [#] François Pinard has written a program called :program:`xpot` which does a
709 similar job. It is available as part of his :program:`po-utils` package at http
710 ://po-utils.progiciels-bpi.ca/.
711
712.. [#] :program:`msgfmt.py` is binary compatible with GNU :program:`msgfmt` except that
713 it provides a simpler, all-Python implementation. With this and
714 :program:`pygettext.py`, you generally won't need to install the GNU
715 :program:`gettext` package to internationalize your Python applications.
716
717.. [#] The choice of :func:`N_` here is totally arbitrary; it could have just as easily
718 been :func:`MarkThisStringForTranslation`.
719