blob: ef5354fee2c491d0d6e0134927540fb469a78efb [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001:mod:`email` --- An email and MIME handling package
2===================================================
3
4.. module:: email
Georg Brandl3f076d82009-05-17 11:28:33 +00005 :synopsis: Package supporting the parsing, manipulating, and generating
6 email messages, including MIME documents.
Georg Brandl116aa622007-08-15 14:28:22 +00007.. moduleauthor:: Barry A. Warsaw <barry@python.org>
8.. sectionauthor:: Barry A. Warsaw <barry@python.org>
R. David Murray96fd54e2010-10-08 15:55:28 +00009.. Copyright (C) 2001-2010 Python Software Foundation
Georg Brandl116aa622007-08-15 14:28:22 +000010
11
Georg Brandl116aa622007-08-15 14:28:22 +000012The :mod:`email` package is a library for managing email messages, including
Georg Brandl83e9f4c2008-06-12 18:52:31 +000013MIME and other :rfc:`2822`\ -based message documents. It is specifically *not*
14designed to do any sending of email messages to SMTP (:rfc:`2821`), NNTP, or
15other servers; those are functions of modules such as :mod:`smtplib` and
16:mod:`nntplib`. The :mod:`email` package attempts to be as RFC-compliant as
17possible, supporting in addition to :rfc:`2822`, such MIME-related RFCs as
18:rfc:`2045`, :rfc:`2046`, :rfc:`2047`, and :rfc:`2231`.
Georg Brandl116aa622007-08-15 14:28:22 +000019
20The primary distinguishing feature of the :mod:`email` package is that it splits
21the parsing and generating of email messages from the internal *object model*
22representation of email. Applications using the :mod:`email` package deal
23primarily with objects; you can add sub-objects to messages, remove sub-objects
24from messages, completely re-arrange the contents, etc. There is a separate
25parser and a separate generator which handles the transformation from flat text
26to the object model, and then back to flat text again. There are also handy
27subclasses for some common MIME object types, and a few miscellaneous utilities
28that help with such common tasks as extracting and parsing message field values,
29creating RFC-compliant dates, etc.
30
31The following sections describe the functionality of the :mod:`email` package.
32The ordering follows a progression that should be common in applications: an
33email message is read as flat text from a file or other source, the text is
34parsed to produce the object structure of the email message, this structure is
35manipulated, and finally, the object tree is rendered back into flat text.
36
37It is perfectly feasible to create the object structure out of whole cloth ---
38i.e. completely from scratch. From there, a similar progression can be taken as
39above.
40
41Also included are detailed specifications of all the classes and modules that
42the :mod:`email` package provides, the exception classes you might encounter
43while using the :mod:`email` package, some auxiliary utilities, and a few
44examples. For users of the older :mod:`mimelib` package, or previous versions
45of the :mod:`email` package, a section on differences and porting is provided.
46
47Contents of the :mod:`email` package documentation:
48
49.. toctree::
50
51 email.message.rst
52 email.parser.rst
53 email.generator.rst
Georg Brandle8d2d2d2011-04-19 09:21:00 +020054 email.policy.rst
R David Murray79cf3ba2012-05-27 17:10:36 -040055 email.headerregistry.rst
Georg Brandl116aa622007-08-15 14:28:22 +000056 email.mime.rst
57 email.header.rst
58 email.charset.rst
59 email.encoders.rst
60 email.errors.rst
61 email.util.rst
62 email.iterators.rst
63 email-examples.rst
64
65
66.. seealso::
67
68 Module :mod:`smtplib`
69 SMTP protocol client
70
71 Module :mod:`nntplib`
72 NNTP protocol client
73
74
75.. _email-pkg-history:
76
77Package History
78---------------
79
80This table describes the release history of the email package, corresponding to
81the version of Python that the package was released with. For purposes of this
82document, when you see a note about change or added versions, these refer to the
83Python version the change was made in, *not* the email package version. This
84table also describes the Python compatibility of each version of the package.
85
86+---------------+------------------------------+-----------------------+
87| email version | distributed with | compatible with |
88+===============+==============================+=======================+
89| :const:`1.x` | Python 2.2.0 to Python 2.2.1 | *no longer supported* |
90+---------------+------------------------------+-----------------------+
91| :const:`2.5` | Python 2.2.2+ and Python 2.3 | Python 2.1 to 2.5 |
92+---------------+------------------------------+-----------------------+
93| :const:`3.0` | Python 2.4 | Python 2.3 to 2.5 |
94+---------------+------------------------------+-----------------------+
95| :const:`4.0` | Python 2.5 | Python 2.3 to 2.5 |
96+---------------+------------------------------+-----------------------+
R. David Murray96fd54e2010-10-08 15:55:28 +000097| :const:`5.0` | Python 3.0 and Python 3.1 | Python 3.0 to 3.2 |
98+---------------+------------------------------+-----------------------+
99| :const:`5.1` | Python 3.2 | Python 3.0 to 3.2 |
100+---------------+------------------------------+-----------------------+
101
102Here are the major differences between :mod:`email` version 5.1 and
103version 5.0:
104
105* It is once again possible to parse messages containing non-ASCII bytes,
106 and to reproduce such messages if the data containing the non-ASCII
107 bytes is not modified.
108
109* New functions :func:`message_from_bytes` and :func:`message_from_binary_file`,
110 and new classes :class:`~email.parser.BytesFeedParser` and
111 :class:`~email.parser.BytesParser` allow binary message data to be parsed
112 into model objects.
113
114* Given bytes input to the model, :meth:`~email.message.Message.get_payload`
115 will by default decode a message body that has a
Senthil Kumaran916bd382010-10-15 12:55:19 +0000116 :mailheader:`Content-Transfer-Encoding` of ``8bit`` using the charset
117 specified in the MIME headers and return the resulting string.
R. David Murray96fd54e2010-10-08 15:55:28 +0000118
119* Given bytes input to the model, :class:`~email.generator.Generator` will
120 convert message bodies that have a :mailheader:`Content-Transfer-Encoding` of
121 8bit to instead have a 7bit Content-Transfer-Encoding.
122
R. David Murray7c0a2272010-10-08 21:37:39 +0000123* New class :class:`~email.generator.BytesGenerator` produces bytes
R. David Murray96fd54e2010-10-08 15:55:28 +0000124 as output, preserving any unchanged non-ASCII data that was
125 present in the input used to build the model, including message bodies
126 with a :mailheader:`Content-Transfer-Encoding` of 8bit.
127
128Here are the major differences between :mod:`email` version 5.0 and version 4:
129
130* All operations are on unicode strings. Text inputs must be strings,
131 text outputs are strings. Outputs are limited to the ASCII character
132 set and so can be encoded to ASCII for transmission. Inputs are also
133 limited to ASCII; this is an acknowledged limitation of email 5.0 and
134 means it can only be used to parse email that is 7bit clean.
Georg Brandl116aa622007-08-15 14:28:22 +0000135
136Here are the major differences between :mod:`email` version 4 and version 3:
137
138* All modules have been renamed according to :pep:`8` standards. For example,
139 the version 3 module :mod:`email.Message` was renamed to :mod:`email.message` in
140 version 4.
141
142* A new subpackage :mod:`email.mime` was added and all the version 3
143 :mod:`email.MIME\*` modules were renamed and situated into the :mod:`email.mime`
144 subpackage. For example, the version 3 module :mod:`email.MIMEText` was renamed
145 to :mod:`email.mime.text`.
146
147 *Note that the version 3 names will continue to work until Python 2.6*.
148
149* The :mod:`email.mime.application` module was added, which contains the
150 :class:`MIMEApplication` class.
151
152* Methods that were deprecated in version 3 have been removed. These include
153 :meth:`Generator.__call__`, :meth:`Message.get_type`,
154 :meth:`Message.get_main_type`, :meth:`Message.get_subtype`.
155
156* Fixes have been added for :rfc:`2231` support which can change some of the
157 return types for :func:`Message.get_param` and friends. Under some
158 circumstances, values which used to return a 3-tuple now return simple strings
159 (specifically, if all extended parameter segments were unencoded, there is no
160 language and charset designation expected, so the return type is now a simple
161 string). Also, %-decoding used to be done for both encoded and unencoded
162 segments; this decoding is now done only for encoded segments.
163
164Here are the major differences between :mod:`email` version 3 and version 2:
165
166* The :class:`FeedParser` class was introduced, and the :class:`Parser` class
167 was implemented in terms of the :class:`FeedParser`. All parsing therefore is
168 non-strict, and parsing will make a best effort never to raise an exception.
169 Problems found while parsing messages are stored in the message's *defect*
170 attribute.
171
172* All aspects of the API which raised :exc:`DeprecationWarning`\ s in version 2
173 have been removed. These include the *_encoder* argument to the
174 :class:`MIMEText` constructor, the :meth:`Message.add_payload` method, the
175 :func:`Utils.dump_address_pair` function, and the functions :func:`Utils.decode`
176 and :func:`Utils.encode`.
177
178* New :exc:`DeprecationWarning`\ s have been added to:
179 :meth:`Generator.__call__`, :meth:`Message.get_type`,
180 :meth:`Message.get_main_type`, :meth:`Message.get_subtype`, and the *strict*
181 argument to the :class:`Parser` class. These are expected to be removed in
182 future versions.
183
184* Support for Pythons earlier than 2.3 has been removed.
185
186Here are the differences between :mod:`email` version 2 and version 1:
187
188* The :mod:`email.Header` and :mod:`email.Charset` modules have been added.
189
190* The pickle format for :class:`Message` instances has changed. Since this was
191 never (and still isn't) formally defined, this isn't considered a backward
192 incompatibility. However if your application pickles and unpickles
193 :class:`Message` instances, be aware that in :mod:`email` version 2,
194 :class:`Message` instances now have private variables *_charset* and
195 *_default_type*.
196
197* Several methods in the :class:`Message` class have been deprecated, or their
198 signatures changed. Also, many new methods have been added. See the
199 documentation for the :class:`Message` class for details. The changes should be
200 completely backward compatible.
201
202* The object structure has changed in the face of :mimetype:`message/rfc822`
203 content types. In :mod:`email` version 1, such a type would be represented by a
204 scalar payload, i.e. the container message's :meth:`is_multipart` returned
205 false, :meth:`get_payload` was not a list object, but a single :class:`Message`
206 instance.
207
208 This structure was inconsistent with the rest of the package, so the object
209 representation for :mimetype:`message/rfc822` content types was changed. In
210 :mod:`email` version 2, the container *does* return ``True`` from
211 :meth:`is_multipart`, and :meth:`get_payload` returns a list containing a single
212 :class:`Message` item.
213
214 Note that this is one place that backward compatibility could not be completely
215 maintained. However, if you're already testing the return type of
216 :meth:`get_payload`, you should be fine. You just need to make sure your code
217 doesn't do a :meth:`set_payload` with a :class:`Message` instance on a container
218 with a content type of :mimetype:`message/rfc822`.
219
220* The :class:`Parser` constructor's *strict* argument was added, and its
221 :meth:`parse` and :meth:`parsestr` methods grew a *headersonly* argument. The
222 *strict* flag was also added to functions :func:`email.message_from_file` and
223 :func:`email.message_from_string`.
224
225* :meth:`Generator.__call__` is deprecated; use :meth:`Generator.flatten`
226 instead. The :class:`Generator` class has also grown the :meth:`clone` method.
227
228* The :class:`DecodedGenerator` class in the :mod:`email.Generator` module was
229 added.
230
231* The intermediate base classes :class:`MIMENonMultipart` and
232 :class:`MIMEMultipart` have been added, and interposed in the class hierarchy
233 for most of the other MIME-related derived classes.
234
235* The *_encoder* argument to the :class:`MIMEText` constructor has been
236 deprecated. Encoding now happens implicitly based on the *_charset* argument.
237
238* The following functions in the :mod:`email.Utils` module have been deprecated:
239 :func:`dump_address_pairs`, :func:`decode`, and :func:`encode`. The following
240 functions have been added to the module: :func:`make_msgid`,
241 :func:`decode_rfc2231`, :func:`encode_rfc2231`, and :func:`decode_params`.
242
243* The non-public function :func:`email.Iterators._structure` was added.
244
245
246Differences from :mod:`mimelib`
247-------------------------------
248
249The :mod:`email` package was originally prototyped as a separate library called
250`mimelib <http://mimelib.sf.net/>`_. Changes have been made so that method names
251are more consistent, and some methods or modules have either been added or
252removed. The semantics of some of the methods have also changed. For the most
253part, any functionality available in :mod:`mimelib` is still available in the
254:mod:`email` package, albeit often in a different way. Backward compatibility
255between the :mod:`mimelib` package and the :mod:`email` package was not a
256priority.
257
258Here is a brief description of the differences between the :mod:`mimelib` and
259the :mod:`email` packages, along with hints on how to port your applications.
260
261Of course, the most visible difference between the two packages is that the
262package name has been changed to :mod:`email`. In addition, the top-level
263package has the following differences:
264
265* :func:`messageFromString` has been renamed to :func:`message_from_string`.
266
267* :func:`messageFromFile` has been renamed to :func:`message_from_file`.
268
269The :class:`Message` class has the following differences:
270
271* The method :meth:`asString` was renamed to :meth:`as_string`.
272
273* The method :meth:`ismultipart` was renamed to :meth:`is_multipart`.
274
275* The :meth:`get_payload` method has grown a *decode* optional argument.
276
277* The method :meth:`getall` was renamed to :meth:`get_all`.
278
279* The method :meth:`addheader` was renamed to :meth:`add_header`.
280
281* The method :meth:`gettype` was renamed to :meth:`get_type`.
282
283* The method :meth:`getmaintype` was renamed to :meth:`get_main_type`.
284
285* The method :meth:`getsubtype` was renamed to :meth:`get_subtype`.
286
287* The method :meth:`getparams` was renamed to :meth:`get_params`. Also, whereas
288 :meth:`getparams` returned a list of strings, :meth:`get_params` returns a list
289 of 2-tuples, effectively the key/value pairs of the parameters, split on the
290 ``'='`` sign.
291
292* The method :meth:`getparam` was renamed to :meth:`get_param`.
293
294* The method :meth:`getcharsets` was renamed to :meth:`get_charsets`.
295
296* The method :meth:`getfilename` was renamed to :meth:`get_filename`.
297
298* The method :meth:`getboundary` was renamed to :meth:`get_boundary`.
299
300* The method :meth:`setboundary` was renamed to :meth:`set_boundary`.
301
302* The method :meth:`getdecodedpayload` was removed. To get similar
303 functionality, pass the value 1 to the *decode* flag of the get_payload()
304 method.
305
306* The method :meth:`getpayloadastext` was removed. Similar functionality is
307 supported by the :class:`DecodedGenerator` class in the :mod:`email.generator`
308 module.
309
310* The method :meth:`getbodyastext` was removed. You can get similar
311 functionality by creating an iterator with :func:`typed_subpart_iterator` in the
312 :mod:`email.iterators` module.
313
314The :class:`Parser` class has no differences in its public interface. It does
315have some additional smarts to recognize :mimetype:`message/delivery-status`
316type messages, which it represents as a :class:`Message` instance containing
317separate :class:`Message` subparts for each header block in the delivery status
318notification [#]_.
319
320The :class:`Generator` class has no differences in its public interface. There
321is a new class in the :mod:`email.generator` module though, called
322:class:`DecodedGenerator` which provides most of the functionality previously
323available in the :meth:`Message.getpayloadastext` method.
324
325The following modules and classes have been changed:
326
327* The :class:`MIMEBase` class constructor arguments *_major* and *_minor* have
328 changed to *_maintype* and *_subtype* respectively.
329
330* The ``Image`` class/module has been renamed to ``MIMEImage``. The *_minor*
331 argument has been renamed to *_subtype*.
332
333* The ``Text`` class/module has been renamed to ``MIMEText``. The *_minor*
334 argument has been renamed to *_subtype*.
335
336* The ``MessageRFC822`` class/module has been renamed to ``MIMEMessage``. Note
337 that an earlier version of :mod:`mimelib` called this class/module ``RFC822``,
338 but that clashed with the Python standard library module :mod:`rfc822` on some
339 case-insensitive file systems.
340
341 Also, the :class:`MIMEMessage` class now represents any kind of MIME message
342 with main type :mimetype:`message`. It takes an optional argument *_subtype*
343 which is used to set the MIME subtype. *_subtype* defaults to
344 :mimetype:`rfc822`.
345
346:mod:`mimelib` provided some utility functions in its :mod:`address` and
347:mod:`date` modules. All of these functions have been moved to the
348:mod:`email.utils` module.
349
350The ``MsgReader`` class/module has been removed. Its functionality is most
351closely supported in the :func:`body_line_iterator` function in the
352:mod:`email.iterators` module.
353
354.. rubric:: Footnotes
355
356.. [#] Delivery Status Notifications (DSN) are defined in :rfc:`1894`.