html to markdown article extractor