Episode 70
Processing Real-world HTML
Edward O'Connor from djangosd gives an overview of html5lib, a major-desktop-browser-compatible HTML parser and tokenizer for both Ruby and Python.
This talk was part of the DjangoSD/SD Ruby mashup meeting.
Bonus content: download the slides from this talk.
or