email.tex 13.2 KB
Newer Older
1
% Copyright (C) 2001,2002 Python Software Foundation
2 3
% Author: barry@zope.com (Barry Warsaw)

4
\section{\module{email} ---
5 6 7 8 9 10
	 An email and MIME handling package}

\declaremodule{standard}{email}
\modulesynopsis{Package supporting the parsing, manipulating, and
    generating email messages, including MIME documents.}
\moduleauthor{Barry A. Warsaw}{barry@zope.com}
Fred Drake's avatar
Fred Drake committed
11
\sectionauthor{Barry A. Warsaw}{barry@zope.com}
12 13 14 15 16 17

\versionadded{2.2}

The \module{email} package is a library for managing email messages,
including MIME and other \rfc{2822}-based message documents.  It
subsumes most of the functionality in several older standard modules
18 19
such as \refmodule{rfc822}, \refmodule{mimetools},
\refmodule{multifile}, and other non-standard packages such as
20 21
\module{mimecntl}.  It is specifically \emph{not} designed to do any
sending of email messages to SMTP (\rfc{2821}) servers; that is the
22 23 24 25
function of the \refmodule{smtplib} module.  The \module{email}
package attempts to be as RFC-compliant as possible, supporting in
addition to \rfc{2822}, such MIME-related RFCs as
\rfc{2045}-\rfc{2047}, and \rfc{2231}.
26 27 28 29 30 31 32 33

The primary distinguishing feature of the \module{email} package is
that it splits the parsing and generating of email messages from the
internal \emph{object model} representation of email.  Applications
using the \module{email} package deal primarily with objects; you can
add sub-objects to messages, remove sub-objects from messages,
completely re-arrange the contents, etc.  There is a separate parser
and a separate generator which handles the transformation from flat
34
text to the object model, and then back to flat text again.  There
35 36 37 38 39 40 41
are also handy subclasses for some common MIME object types, and a few
miscellaneous utilities that help with such common tasks as extracting
and parsing message field values, creating RFC-compliant dates, etc.

The following sections describe the functionality of the
\module{email} package.  The ordering follows a progression that
should be common in applications: an email message is read as flat
42 43 44
text from a file or other source, the text is parsed to produce the
object structure of the email message, this structure is manipulated,
and finally rendered back into flat text.
45

46 47 48
It is perfectly feasible to create the object structure out of whole
cloth --- i.e. completely from scratch.  From there, a similar
progression can be taken as above.
49 50 51 52 53

Also included are detailed specifications of all the classes and
modules that the \module{email} package provides, the exception
classes you might encounter while using the \module{email} package,
some auxiliary utilities, and a few examples.  For users of the older
54 55
\module{mimelib} package, or previous versions of the \module{email}
package, a section on differences and porting is provided.
56

57 58 59 60
\begin{seealso}
    \seemodule{smtplib}{SMTP protocol client}
\end{seealso}

61
\subsection{Representing an email message}
62
\input{emailmessage}
63 64

\subsection{Parsing email messages}
65
\input{emailparser}
66 67

\subsection{Generating MIME documents}
68
\input{emailgenerator}
69 70

\subsection{Creating email and MIME objects from scratch}
71
\input{emailmimebase}
72

73
\subsection{Internationalized headers}
74
\input{emailheaders}
75

76 77 78
\subsection{Representing character sets}
\input{emailcharsets}

79 80
\subsection{Encoders}
\input{emailencoders}
81

82 83
\subsection{Exception classes}
\input{emailexc}
84

85 86
\subsection{Miscellaneous utilities}
\input{emailutil}
87

88 89
\subsection{Iterators}
\input{emailiter}
90

91 92 93 94 95 96
\subsection{Differences from \module{email} v1 (up to Python 2.2.1)}

Version 1 of the \module{email} package was bundled with Python
releases up to Python 2.2.1.  Version 2 was developed for the Python
2.3 release, and backported to Python 2.2.2.  It was also available as
a separate distutils based package.  \module{email} version 2 is
97
almost entirely backward compatible with version 1, with the
98 99 100 101 102
following differences:

\begin{itemize}
\item The \module{email.Header} and \module{email.Charset} modules
      have been added.
Barry Warsaw's avatar
Barry Warsaw committed
103

104 105
\item The pickle format for \class{Message} instances has changed.
      Since this was never (and still isn't) formally defined, this
106
      isn't considered a backward incompatibility.  However if your
107 108 109 110
      application pickles and unpickles \class{Message} instances, be
      aware that in \module{email} version 2, \class{Message}
      instances now have private variables \var{_charset} and
      \var{_default_type}.
Barry Warsaw's avatar
Barry Warsaw committed
111

112
\item Several methods in the \class{Message} class have been
113
      deprecated, or their signatures changed.  Also, many new methods
114
      have been added.  See the documentation for the \class{Message}
115
      class for details.  The changes should be completely backward
116
      compatible.
Barry Warsaw's avatar
Barry Warsaw committed
117

118 119 120 121
\item The object structure has changed in the face of
      \mimetype{message/rfc822} content types.  In \module{email}
      version 1, such a type would be represented by a scalar payload,
      i.e. the container message's \method{is_multipart()} returned
122 123
      false, \method{get_payload()} was not a list object, but a single
      \class{Message} instance.
124 125 126

      This structure was inconsistent with the rest of the package, so
      the object representation for \mimetype{message/rfc822} content
127
      types was changed.  In \module{email} version 2, the container
128 129 130 131
      \emph{does} return \code{True} from \method{is_multipart()}, and
      \method{get_payload()} returns a list containing a single
      \class{Message} item.

132
      Note that this is one place that backward compatibility could
133 134 135 136 137
      not be completely maintained.  However, if you're already
      testing the return type of \method{get_payload()}, you should be
      fine.  You just need to make sure your code doesn't do a
      \method{set_payload()} with a \class{Message} instance on a
      container with a content type of \mimetype{message/rfc822}.
Barry Warsaw's avatar
Barry Warsaw committed
138

139 140 141 142 143
\item The \class{Parser} constructor's \var{strict} argument was
      added, and its \method{parse()} and \method{parsestr()} methods
      grew a \var{headersonly} argument.  The \var{strict} flag was
      also added to functions \function{email.message_from_file()}
      and \function{email.message_from_string()}.
Barry Warsaw's avatar
Barry Warsaw committed
144

145 146 147
\item \method{Generator.__call__()} is deprecated; use
      \method{Generator.flatten()} instead.  The \class{Generator}
      class has also grown the \method{clone()} method.
Barry Warsaw's avatar
Barry Warsaw committed
148

149 150
\item The \class{DecodedGenerator} class in the
      \module{email.Generator} module was added.
Barry Warsaw's avatar
Barry Warsaw committed
151

152 153
\item The intermediate base classes \class{MIMENonMultipart} and
      \class{MIMEMultipart} have been added, and interposed in the
154
      class hierarchy for most of the other MIME-related derived
155
      classes.
Barry Warsaw's avatar
Barry Warsaw committed
156

157 158 159
\item The \var{_encoder} argument to the \class{MIMEText} constructor
      has been deprecated.  Encoding  now happens implicitly based
      on the \var{_charset} argument.
Barry Warsaw's avatar
Barry Warsaw committed
160

161 162 163 164 165 166
\item The following functions in the \module{email.Utils} module have
      been deprecated: \function{dump_address_pairs()},
      \function{decode()}, and \function{encode()}.  The following
      functions have been added to the module:
      \function{make_msgid()}, \function{decode_rfc2231()},
      \function{encode_rfc2231()}, and \function{decode_params()}.
Barry Warsaw's avatar
Barry Warsaw committed
167

168 169 170 171
\item The non-public function \function{email.Iterators._structure()}
      was added.
\end{itemize}

172 173 174
\subsection{Differences from \module{mimelib}}

The \module{email} package was originally prototyped as a separate
175 176 177
library called
\ulink{\module{mimelib}}{http://mimelib.sf.net/}.
Changes have been made so that
178 179 180
method names are more consistent, and some methods or modules have
either been added or removed.  The semantics of some of the methods
have also changed.  For the most part, any functionality available in
Fred Drake's avatar
Fred Drake committed
181
\module{mimelib} is still available in the \refmodule{email} package,
182 183 184
albeit often in a different way.  Backward compatibility between
the \module{mimelib} package and the \module{email} package was not a
priority.
185 186

Here is a brief description of the differences between the
Fred Drake's avatar
Fred Drake committed
187
\module{mimelib} and the \refmodule{email} packages, along with hints on
188 189 190
how to port your applications.

Of course, the most visible difference between the two packages is
Fred Drake's avatar
Fred Drake committed
191
that the package name has been changed to \refmodule{email}.  In
192 193 194 195 196
addition, the top-level package has the following differences:

\begin{itemize}
\item \function{messageFromString()} has been renamed to
      \function{message_from_string()}.
Barry Warsaw's avatar
Barry Warsaw committed
197

198 199
\item \function{messageFromFile()} has been renamed to
      \function{message_from_file()}.
Barry Warsaw's avatar
Barry Warsaw committed
200

201 202 203 204 205 206
\end{itemize}

The \class{Message} class has the following differences:

\begin{itemize}
\item The method \method{asString()} was renamed to \method{as_string()}.
Barry Warsaw's avatar
Barry Warsaw committed
207

208 209
\item The method \method{ismultipart()} was renamed to
      \method{is_multipart()}.
Barry Warsaw's avatar
Barry Warsaw committed
210

211 212
\item The \method{get_payload()} method has grown a \var{decode}
      optional argument.
Barry Warsaw's avatar
Barry Warsaw committed
213

214
\item The method \method{getall()} was renamed to \method{get_all()}.
Barry Warsaw's avatar
Barry Warsaw committed
215

216
\item The method \method{addheader()} was renamed to \method{add_header()}.
Barry Warsaw's avatar
Barry Warsaw committed
217

218
\item The method \method{gettype()} was renamed to \method{get_type()}.
Barry Warsaw's avatar
Barry Warsaw committed
219

220 221
\item The method\method{getmaintype()} was renamed to
      \method{get_main_type()}.
Barry Warsaw's avatar
Barry Warsaw committed
222

223 224
\item The method \method{getsubtype()} was renamed to
      \method{get_subtype()}.
Barry Warsaw's avatar
Barry Warsaw committed
225

226 227 228 229
\item The method \method{getparams()} was renamed to
      \method{get_params()}.
      Also, whereas \method{getparams()} returned a list of strings,
      \method{get_params()} returns a list of 2-tuples, effectively
230
      the key/value pairs of the parameters, split on the \character{=}
231
      sign.
Barry Warsaw's avatar
Barry Warsaw committed
232

233
\item The method \method{getparam()} was renamed to \method{get_param()}.
Barry Warsaw's avatar
Barry Warsaw committed
234

235 236
\item The method \method{getcharsets()} was renamed to
      \method{get_charsets()}.
Barry Warsaw's avatar
Barry Warsaw committed
237

238 239
\item The method \method{getfilename()} was renamed to
      \method{get_filename()}.
Barry Warsaw's avatar
Barry Warsaw committed
240

241 242
\item The method \method{getboundary()} was renamed to
      \method{get_boundary()}.
Barry Warsaw's avatar
Barry Warsaw committed
243

244 245
\item The method \method{setboundary()} was renamed to
      \method{set_boundary()}.
Barry Warsaw's avatar
Barry Warsaw committed
246

247 248 249
\item The method \method{getdecodedpayload()} was removed.  To get
      similar functionality, pass the value 1 to the \var{decode} flag
      of the {get_payload()} method.
Barry Warsaw's avatar
Barry Warsaw committed
250

251 252 253 254
\item The method \method{getpayloadastext()} was removed.  Similar
      functionality
      is supported by the \class{DecodedGenerator} class in the
      \refmodule{email.Generator} module.
Barry Warsaw's avatar
Barry Warsaw committed
255

256 257 258 259 260 261 262 263
\item The method \method{getbodyastext()} was removed.  You can get
      similar functionality by creating an iterator with
      \function{typed_subpart_iterator()} in the
      \refmodule{email.Iterators} module.
\end{itemize}

The \class{Parser} class has no differences in its public interface.
It does have some additional smarts to recognize
264
\mimetype{message/delivery-status} type messages, which it represents as
265 266 267
a \class{Message} instance containing separate \class{Message}
subparts for each header block in the delivery status
notification\footnote{Delivery Status Notifications (DSN) are defined
Fred Drake's avatar
Fred Drake committed
268
in \rfc{1894}.}.
269 270 271 272 273 274 275 276 277 278 279 280 281

The \class{Generator} class has no differences in its public
interface.  There is a new class in the \refmodule{email.Generator}
module though, called \class{DecodedGenerator} which provides most of
the functionality previously available in the
\method{Message.getpayloadastext()} method.

The following modules and classes have been changed:

\begin{itemize}
\item The \class{MIMEBase} class constructor arguments \var{_major}
      and \var{_minor} have changed to \var{_maintype} and
      \var{_subtype} respectively.
Barry Warsaw's avatar
Barry Warsaw committed
282

283 284 285
\item The \code{Image} class/module has been renamed to
      \code{MIMEImage}.  The \var{_minor} argument has been renamed to
      \var{_subtype}.
Barry Warsaw's avatar
Barry Warsaw committed
286

287 288 289
\item The \code{Text} class/module has been renamed to
      \code{MIMEText}.  The \var{_minor} argument has been renamed to
      \var{_subtype}.
Barry Warsaw's avatar
Barry Warsaw committed
290

291 292 293 294 295 296 297
\item The \code{MessageRFC822} class/module has been renamed to
      \code{MIMEMessage}.  Note that an earlier version of
      \module{mimelib} called this class/module \code{RFC822}, but
      that clashed with the Python standard library module
      \refmodule{rfc822} on some case-insensitive file systems.

      Also, the \class{MIMEMessage} class now represents any kind of
298
      MIME message with main type \mimetype{message}.  It takes an
299
      optional argument \var{_subtype} which is used to set the MIME
300
      subtype.  \var{_subtype} defaults to \mimetype{rfc822}.
301 302 303 304 305 306 307 308 309 310 311 312
\end{itemize}

\module{mimelib} provided some utility functions in its
\module{address} and \module{date} modules.  All of these functions
have been moved to the \refmodule{email.Utils} module.

The \code{MsgReader} class/module has been removed.  Its functionality
is most closely supported in the \function{body_line_iterator()}
function in the \refmodule{email.Iterators} module.

\subsection{Examples}

313 314 315 316 317 318
Here are a few examples of how to use the \module{email} package to
read, write, and send simple email messages, as well as more complex
MIME messages.

First, let's see how to create and send a simple text message:

319
\verbatiminput{email-simple.py}
320 321

Here's an example of how to send a MIME message containing a bunch of
Barry Warsaw's avatar
Barry Warsaw committed
322
family pictures that may be residing in a directory:
323

324
\verbatiminput{email-mime.py}
325

326 327 328 329
Here's an example of how to send the entire contents of a directory as
an email message:
\footnote{Thanks to Matthew Dixon Cowles for the original inspiration
          and examples.}
330

331
\verbatiminput{email-dir.py}
332 333 334 335

And finally, here's an example of how to unpack a MIME message like
the one above, into a directory of files:

336
\verbatiminput{email-unpack.py}