blob: 362b974a9ed3de6de53a800ff368da97dc62a9fc [file] [log] [blame]
\chapter{Structured Markup Processing Tools
\label{markup}}
Python supports a variety of modules to work with various forms of
structured data markup. This includes modules to work with the
Standard Generalized Markup Language (SGML) and the Hypertext Markup
Language (HTML), and several interfaces for working with the
Extensible Markup Language (XML).
It is important to note that modules in the \module{xml} package
require that there be at least one SAX-compliant XML parser available.
Starting with Python 2.3, the Expat parser is included with Python, so
the \refmodule{xml.parsers.expat} module will always be available.
You may still want to be aware of the \ulink{PyXML add-on
package}{http://pyxml.sourceforge.net/}; that package provides an
extended set of XML libraries for Python.
Python 2.5 introduces the \module{xmlcore} package; this package
provides the implementation of the \module{xml} package as distributed
with the standard library. The \module{xml} package, as in earlier
versions, provides an interface that will provide the PyXML
implementation of the interfaces when available, and the standard
library implementation if not. Applications that can use either the
PyXML implementation or the standard library's implementation may
continue to make imports from the \module{xml} package; applications
that want to only import the standard library's implementation can now
use the \module{xmlcore} package.
The documentation for the \module{xml.dom} and \module{xml.sax}
packages are the definition of the Python bindings for the DOM and SAX
interfaces.
\localmoduletable
\begin{seealso}
\seetitle[http://pyxml.sourceforge.net/]
{Python/XML Libraries}
{Home page for the PyXML package, containing an extension
of \module{xml} package bundled with Python.}
\end{seealso}