| Home | Trees | Indices | Help |
|---|
|
|
PageElement --+
|
Tag --+
|
markupbase.ParserBase --+ |
| |
sgmllib.SGMLParser --+
|
BeautifulStoneSoup --+
|
BeautifulSoup --+
|
MinimalSoup
The MinimalSoup class is for parsing HTML that contains pathologically bad markup. It makes no assumptions about tag nesting, but it does know which tags are self-closing, that <script> tags contain Javascript and should not be parsed, that META tags may contain encoding information, and so on.
This also makes it better for subclassing than BeautifulStoneSoup or BeautifulSoup.
|
|||
|
Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from |
|||
|
|||
RESET_NESTING_TAGS =
|
|||
NESTABLE_TAGS =
|
|||
|
Inherited from Inherited from Inherited from Inherited from Inherited from |
|||
| Home | Trees | Indices | Help |
|---|
| Generated by Epydoc 3.0beta1 on Thu Nov 8 17:49:29 2007 | http://epydoc.sourceforge.net |