A robust web archive analytics toolkit
Pure-Python HTML parser with ElementTree support.
JavaScript Dom Api for Python, Html Parser and a Web scraping library