Just a suggestion to use https://github.com/lmmx/htmd to convert HTML to Markdown. Could be faster 🤷