Production crawler for Markdown exports
Turn any sitemap into clean Markdown files.
Site2MD recursively parses sitemap indexes, fetches only listed URLs, extracts meaningful content, and creates frontmatter-rich Markdown plus a ZIP archive.
robots.txt aware rate-limited ZIP output
Start a sitemap crawl
Paste a top-level sitemap XML URL. Nested sitemap indexes are followed automatically.
1
Recursive sitemap parsing
Sitemap indexes are followed to a safe configurable depth with URL deduplication.
2
Clean Markdown conversion
Scripts, navigation, ads, forms, and tracking elements are removed before Turndown conversion.
3
Useful partial results
The result page shows converted and failed pages together with per-file downloads.