Home | Trees | Indices | Help |
---|
|
PageElement --+ | Tag --+ | markupbase.ParserBase --+ | | | sgmllib.SGMLParser --+ | BeautifulStoneSoup --+ | BeautifulSoup --+ | MinimalSoup
The MinimalSoup class is for parsing HTML that contains pathologically bad markup. It makes no assumptions about tag nesting, but it does know which tags are self-closing, that <script> tags contain Javascript and should not be parsed, that META tags may contain encoding information, and so on.
This also makes it better for subclassing than BeautifulStoneSoup or BeautifulSoup.
|
|||
Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from Inherited from |
|
|||
RESET_NESTING_TAGS =
|
|||
NESTABLE_TAGS =
|
|||
Inherited from Inherited from Inherited from Inherited from Inherited from |
Home | Trees | Indices | Help |
---|
Generated by Epydoc 3.0beta1 on Thu Nov 8 17:49:29 2007 | http://epydoc.sourceforge.net |