13. Structured Markup Processing Tools

Python supports a variety of modules to work with various forms of structured data markup. This includes modules to work with the Standard Generalized Markup Language (SGML) and the Hypertext Markup Language (HTML), and several interfaces for working with the Extensible Markup Language (XML).

sgmllib Only as much of an SGML parser as needed to parse HTML.
htmllib A parser for HTML documents.
htmlentitydefs Definitions of HTML general entities.
xmllib A parser for XML documents.
xml.parsers.expat An interface to the Expat non-validating XML parser.
xml.sax Package containing SAX2 base classes and convenience functions.


See About this document... for information on suggesting changes.