Crawl websites recursively to build a hierarchical map of pages. Convert HTML into clean, LLM-ready Markdown while stripping boilerplate...