The sanitizer will remove certain tags (script, marquee, head, frame, menu, object, et al.). It retains predominantly ‘content’ tags.
The sanitizer will remove most attributes. It will keep only hrefs on a tags and colspans on td/th tags.
The sanitizer can be a great tool for cleaning up the HTML saved by the likes of Word and OpenOffice.