| :mod:`email` --- An email and MIME handling package |
| =================================================== |
| |
| .. module:: email |
| :synopsis: Package supporting the parsing, manipulating, and generating |
| email messages, including MIME documents. |
| .. moduleauthor:: Barry A. Warsaw <barry@python.org> |
| .. sectionauthor:: Barry A. Warsaw <barry@python.org> |
| .. Copyright (C) 2001-2010 Python Software Foundation |
| |
| |
| The :mod:`email` package is a library for managing email messages, including |
| MIME and other :rfc:`2822`\ -based message documents. It is specifically *not* |
| designed to do any sending of email messages to SMTP (:rfc:`2821`), NNTP, or |
| other servers; those are functions of modules such as :mod:`smtplib` and |
| :mod:`nntplib`. The :mod:`email` package attempts to be as RFC-compliant as |
| possible, supporting in addition to :rfc:`2822`, such MIME-related RFCs as |
| :rfc:`2045`, :rfc:`2046`, :rfc:`2047`, and :rfc:`2231`. |
| |
| The primary distinguishing feature of the :mod:`email` package is that it splits |
| the parsing and generating of email messages from the internal *object model* |
| representation of email. Applications using the :mod:`email` package deal |
| primarily with objects; you can add sub-objects to messages, remove sub-objects |
| from messages, completely re-arrange the contents, etc. There is a separate |
| parser and a separate generator which handles the transformation from flat text |
| to the object model, and then back to flat text again. There are also handy |
| subclasses for some common MIME object types, and a few miscellaneous utilities |
| that help with such common tasks as extracting and parsing message field values, |
| creating RFC-compliant dates, etc. |
| |
| The following sections describe the functionality of the :mod:`email` package. |
| The ordering follows a progression that should be common in applications: an |
| email message is read as flat text from a file or other source, the text is |
| parsed to produce the object structure of the email message, this structure is |
| manipulated, and finally, the object tree is rendered back into flat text. |
| |
| It is perfectly feasible to create the object structure out of whole cloth --- |
| i.e. completely from scratch. From there, a similar progression can be taken as |
| above. |
| |
| Also included are detailed specifications of all the classes and modules that |
| the :mod:`email` package provides, the exception classes you might encounter |
| while using the :mod:`email` package, some auxiliary utilities, and a few |
| examples. For users of the older :mod:`mimelib` package, or previous versions |
| of the :mod:`email` package, a section on differences and porting is provided. |
| |
| Contents of the :mod:`email` package documentation: |
| |
| .. toctree:: |
| |
| email.message.rst |
| email.parser.rst |
| email.generator.rst |
| email.mime.rst |
| email.header.rst |
| email.charset.rst |
| email.encoders.rst |
| email.errors.rst |
| email.util.rst |
| email.iterators.rst |
| email-examples.rst |
| |
| |
| .. seealso:: |
| |
| Module :mod:`smtplib` |
| SMTP protocol client |
| |
| Module :mod:`nntplib` |
| NNTP protocol client |
| |
| |
| .. _email-pkg-history: |
| |
| Package History |
| --------------- |
| |
| This table describes the release history of the email package, corresponding to |
| the version of Python that the package was released with. For purposes of this |
| document, when you see a note about change or added versions, these refer to the |
| Python version the change was made in, *not* the email package version. This |
| table also describes the Python compatibility of each version of the package. |
| |
| +---------------+------------------------------+-----------------------+ |
| | email version | distributed with | compatible with | |
| +===============+==============================+=======================+ |
| | :const:`1.x` | Python 2.2.0 to Python 2.2.1 | *no longer supported* | |
| +---------------+------------------------------+-----------------------+ |
| | :const:`2.5` | Python 2.2.2+ and Python 2.3 | Python 2.1 to 2.5 | |
| +---------------+------------------------------+-----------------------+ |
| | :const:`3.0` | Python 2.4 | Python 2.3 to 2.5 | |
| +---------------+------------------------------+-----------------------+ |
| | :const:`4.0` | Python 2.5 | Python 2.3 to 2.5 | |
| +---------------+------------------------------+-----------------------+ |
| | :const:`5.0` | Python 3.0 and Python 3.1 | Python 3.0 to 3.2 | |
| +---------------+------------------------------+-----------------------+ |
| | :const:`5.1` | Python 3.2 | Python 3.0 to 3.2 | |
| +---------------+------------------------------+-----------------------+ |
| |
| Here are the major differences between :mod:`email` version 5.1 and |
| version 5.0: |
| |
| * It is once again possible to parse messages containing non-ASCII bytes, |
| and to reproduce such messages if the data containing the non-ASCII |
| bytes is not modified. |
| |
| * New functions :func:`message_from_bytes` and :func:`message_from_binary_file`, |
| and new classes :class:`~email.parser.BytesFeedParser` and |
| :class:`~email.parser.BytesParser` allow binary message data to be parsed |
| into model objects. |
| |
| * Given bytes input to the model, :meth:`~email.message.Message.get_payload` |
| will by default decode a message body that has a |
| :mailheader:`Content-Transfer-Encoding` of ``8bit`` using the charset |
| specified in the MIME headers and return the resulting string. |
| |
| * Given bytes input to the model, :class:`~email.generator.Generator` will |
| convert message bodies that have a :mailheader:`Content-Transfer-Encoding` of |
| 8bit to instead have a 7bit Content-Transfer-Encoding. |
| |
| * New class :class:`~email.generator.BytesGenerator` produces bytes |
| as output, preserving any unchanged non-ASCII data that was |
| present in the input used to build the model, including message bodies |
| with a :mailheader:`Content-Transfer-Encoding` of 8bit. |
| |
| Here are the major differences between :mod:`email` version 5.0 and version 4: |
| |
| * All operations are on unicode strings. Text inputs must be strings, |
| text outputs are strings. Outputs are limited to the ASCII character |
| set and so can be encoded to ASCII for transmission. Inputs are also |
| limited to ASCII; this is an acknowledged limitation of email 5.0 and |
| means it can only be used to parse email that is 7bit clean. |
| |
| Here are the major differences between :mod:`email` version 4 and version 3: |
| |
| * All modules have been renamed according to :pep:`8` standards. For example, |
| the version 3 module :mod:`email.Message` was renamed to :mod:`email.message` in |
| version 4. |
| |
| * A new subpackage :mod:`email.mime` was added and all the version 3 |
| :mod:`email.MIME\*` modules were renamed and situated into the :mod:`email.mime` |
| subpackage. For example, the version 3 module :mod:`email.MIMEText` was renamed |
| to :mod:`email.mime.text`. |
| |
| *Note that the version 3 names will continue to work until Python 2.6*. |
| |
| * The :mod:`email.mime.application` module was added, which contains the |
| :class:`MIMEApplication` class. |
| |
| * Methods that were deprecated in version 3 have been removed. These include |
| :meth:`Generator.__call__`, :meth:`Message.get_type`, |
| :meth:`Message.get_main_type`, :meth:`Message.get_subtype`. |
| |
| * Fixes have been added for :rfc:`2231` support which can change some of the |
| return types for :func:`Message.get_param` and friends. Under some |
| circumstances, values which used to return a 3-tuple now return simple strings |
| (specifically, if all extended parameter segments were unencoded, there is no |
| language and charset designation expected, so the return type is now a simple |
| string). Also, %-decoding used to be done for both encoded and unencoded |
| segments; this decoding is now done only for encoded segments. |
| |
| Here are the major differences between :mod:`email` version 3 and version 2: |
| |
| * The :class:`FeedParser` class was introduced, and the :class:`Parser` class |
| was implemented in terms of the :class:`FeedParser`. All parsing therefore is |
| non-strict, and parsing will make a best effort never to raise an exception. |
| Problems found while parsing messages are stored in the message's *defect* |
| attribute. |
| |
| * All aspects of the API which raised :exc:`DeprecationWarning`\ s in version 2 |
| have been removed. These include the *_encoder* argument to the |
| :class:`MIMEText` constructor, the :meth:`Message.add_payload` method, the |
| :func:`Utils.dump_address_pair` function, and the functions :func:`Utils.decode` |
| and :func:`Utils.encode`. |
| |
| * New :exc:`DeprecationWarning`\ s have been added to: |
| :meth:`Generator.__call__`, :meth:`Message.get_type`, |
| :meth:`Message.get_main_type`, :meth:`Message.get_subtype`, and the *strict* |
| argument to the :class:`Parser` class. These are expected to be removed in |
| future versions. |
| |
| * Support for Pythons earlier than 2.3 has been removed. |
| |
| Here are the differences between :mod:`email` version 2 and version 1: |
| |
| * The :mod:`email.Header` and :mod:`email.Charset` modules have been added. |
| |
| * The pickle format for :class:`Message` instances has changed. Since this was |
| never (and still isn't) formally defined, this isn't considered a backward |
| incompatibility. However if your application pickles and unpickles |
| :class:`Message` instances, be aware that in :mod:`email` version 2, |
| :class:`Message` instances now have private variables *_charset* and |
| *_default_type*. |
| |
| * Several methods in the :class:`Message` class have been deprecated, or their |
| signatures changed. Also, many new methods have been added. See the |
| documentation for the :class:`Message` class for details. The changes should be |
| completely backward compatible. |
| |
| * The object structure has changed in the face of :mimetype:`message/rfc822` |
| content types. In :mod:`email` version 1, such a type would be represented by a |
| scalar payload, i.e. the container message's :meth:`is_multipart` returned |
| false, :meth:`get_payload` was not a list object, but a single :class:`Message` |
| instance. |
| |
| This structure was inconsistent with the rest of the package, so the object |
| representation for :mimetype:`message/rfc822` content types was changed. In |
| :mod:`email` version 2, the container *does* return ``True`` from |
| :meth:`is_multipart`, and :meth:`get_payload` returns a list containing a single |
| :class:`Message` item. |
| |
| Note that this is one place that backward compatibility could not be completely |
| maintained. However, if you're already testing the return type of |
| :meth:`get_payload`, you should be fine. You just need to make sure your code |
| doesn't do a :meth:`set_payload` with a :class:`Message` instance on a container |
| with a content type of :mimetype:`message/rfc822`. |
| |
| * The :class:`Parser` constructor's *strict* argument was added, and its |
| :meth:`parse` and :meth:`parsestr` methods grew a *headersonly* argument. The |
| *strict* flag was also added to functions :func:`email.message_from_file` and |
| :func:`email.message_from_string`. |
| |
| * :meth:`Generator.__call__` is deprecated; use :meth:`Generator.flatten` |
| instead. The :class:`Generator` class has also grown the :meth:`clone` method. |
| |
| * The :class:`DecodedGenerator` class in the :mod:`email.Generator` module was |
| added. |
| |
| * The intermediate base classes :class:`MIMENonMultipart` and |
| :class:`MIMEMultipart` have been added, and interposed in the class hierarchy |
| for most of the other MIME-related derived classes. |
| |
| * The *_encoder* argument to the :class:`MIMEText` constructor has been |
| deprecated. Encoding now happens implicitly based on the *_charset* argument. |
| |
| * The following functions in the :mod:`email.Utils` module have been deprecated: |
| :func:`dump_address_pairs`, :func:`decode`, and :func:`encode`. The following |
| functions have been added to the module: :func:`make_msgid`, |
| :func:`decode_rfc2231`, :func:`encode_rfc2231`, and :func:`decode_params`. |
| |
| * The non-public function :func:`email.Iterators._structure` was added. |
| |
| |
| Differences from :mod:`mimelib` |
| ------------------------------- |
| |
| The :mod:`email` package was originally prototyped as a separate library called |
| `mimelib <http://mimelib.sf.net/>`_. Changes have been made so that method names |
| are more consistent, and some methods or modules have either been added or |
| removed. The semantics of some of the methods have also changed. For the most |
| part, any functionality available in :mod:`mimelib` is still available in the |
| :mod:`email` package, albeit often in a different way. Backward compatibility |
| between the :mod:`mimelib` package and the :mod:`email` package was not a |
| priority. |
| |
| Here is a brief description of the differences between the :mod:`mimelib` and |
| the :mod:`email` packages, along with hints on how to port your applications. |
| |
| Of course, the most visible difference between the two packages is that the |
| package name has been changed to :mod:`email`. In addition, the top-level |
| package has the following differences: |
| |
| * :func:`messageFromString` has been renamed to :func:`message_from_string`. |
| |
| * :func:`messageFromFile` has been renamed to :func:`message_from_file`. |
| |
| The :class:`Message` class has the following differences: |
| |
| * The method :meth:`asString` was renamed to :meth:`as_string`. |
| |
| * The method :meth:`ismultipart` was renamed to :meth:`is_multipart`. |
| |
| * The :meth:`get_payload` method has grown a *decode* optional argument. |
| |
| * The method :meth:`getall` was renamed to :meth:`get_all`. |
| |
| * The method :meth:`addheader` was renamed to :meth:`add_header`. |
| |
| * The method :meth:`gettype` was renamed to :meth:`get_type`. |
| |
| * The method :meth:`getmaintype` was renamed to :meth:`get_main_type`. |
| |
| * The method :meth:`getsubtype` was renamed to :meth:`get_subtype`. |
| |
| * The method :meth:`getparams` was renamed to :meth:`get_params`. Also, whereas |
| :meth:`getparams` returned a list of strings, :meth:`get_params` returns a list |
| of 2-tuples, effectively the key/value pairs of the parameters, split on the |
| ``'='`` sign. |
| |
| * The method :meth:`getparam` was renamed to :meth:`get_param`. |
| |
| * The method :meth:`getcharsets` was renamed to :meth:`get_charsets`. |
| |
| * The method :meth:`getfilename` was renamed to :meth:`get_filename`. |
| |
| * The method :meth:`getboundary` was renamed to :meth:`get_boundary`. |
| |
| * The method :meth:`setboundary` was renamed to :meth:`set_boundary`. |
| |
| * The method :meth:`getdecodedpayload` was removed. To get similar |
| functionality, pass the value 1 to the *decode* flag of the get_payload() |
| method. |
| |
| * The method :meth:`getpayloadastext` was removed. Similar functionality is |
| supported by the :class:`DecodedGenerator` class in the :mod:`email.generator` |
| module. |
| |
| * The method :meth:`getbodyastext` was removed. You can get similar |
| functionality by creating an iterator with :func:`typed_subpart_iterator` in the |
| :mod:`email.iterators` module. |
| |
| The :class:`Parser` class has no differences in its public interface. It does |
| have some additional smarts to recognize :mimetype:`message/delivery-status` |
| type messages, which it represents as a :class:`Message` instance containing |
| separate :class:`Message` subparts for each header block in the delivery status |
| notification [#]_. |
| |
| The :class:`Generator` class has no differences in its public interface. There |
| is a new class in the :mod:`email.generator` module though, called |
| :class:`DecodedGenerator` which provides most of the functionality previously |
| available in the :meth:`Message.getpayloadastext` method. |
| |
| The following modules and classes have been changed: |
| |
| * The :class:`MIMEBase` class constructor arguments *_major* and *_minor* have |
| changed to *_maintype* and *_subtype* respectively. |
| |
| * The ``Image`` class/module has been renamed to ``MIMEImage``. The *_minor* |
| argument has been renamed to *_subtype*. |
| |
| * The ``Text`` class/module has been renamed to ``MIMEText``. The *_minor* |
| argument has been renamed to *_subtype*. |
| |
| * The ``MessageRFC822`` class/module has been renamed to ``MIMEMessage``. Note |
| that an earlier version of :mod:`mimelib` called this class/module ``RFC822``, |
| but that clashed with the Python standard library module :mod:`rfc822` on some |
| case-insensitive file systems. |
| |
| Also, the :class:`MIMEMessage` class now represents any kind of MIME message |
| with main type :mimetype:`message`. It takes an optional argument *_subtype* |
| which is used to set the MIME subtype. *_subtype* defaults to |
| :mimetype:`rfc822`. |
| |
| :mod:`mimelib` provided some utility functions in its :mod:`address` and |
| :mod:`date` modules. All of these functions have been moved to the |
| :mod:`email.utils` module. |
| |
| The ``MsgReader`` class/module has been removed. Its functionality is most |
| closely supported in the :func:`body_line_iterator` function in the |
| :mod:`email.iterators` module. |
| |
| .. rubric:: Footnotes |
| |
| .. [#] Delivery Status Notifications (DSN) are defined in :rfc:`1894`. |