Gumbo
0.9.2
A C library for parsing HTML.
|
This provides a conformant, no-dependencies implementation of the HTML5 parsing algorithm. It supports only UTF8; if you need to parse a different encoding, run a preprocessing step to convert to UTF8. It returns a parse tree made of the structs in this file.
Example:
HTML5 Spec:
http://www.whatwg.org/specs/web-apps/current-work/multipage/syntax.html