SiteOne Crawler logo

SiteOne Crawler

Cross-platform website crawler and analyzer for SEO, security, accessibility, and performance

Repository activity
  • Stars770
  • Forks68
  • Open Issues19
License

MIT

Languages
  • Rust
  • HTML
  • Shell
SiteOne Crawler screenshot

About SiteOne Crawler

SiteOne Crawler is a single-binary website crawler and analyzer for Windows, macOS, and Linux, with no runtime dependencies. Rewritten in Rust, it crawls every page and asset, follows robots.txt, and reports status codes, response times, sizes, and errors like 404s and redirects.

Built-in analyzers cover SEO, security headers, accessibility, best practices, performance, and SSL/TLS, with a quality score from 0.0 to 10.0 across five categories. Results come as an interactive HTML report, structured JSON, or colored terminal text, and can be emailed through a built-in SMTP sender.

It also clones whole sites to browsable offline files, converts pages to clean Markdown for feeding to AI tools, and generates sitemap.xml and sitemap.txt. A CI/CD quality gate with configurable thresholds exits with code 10 on failure, and the crawler doubles as a stress tester and cache warmer.

Key features

  • Crawls every page and asset, flagging 404s, redirects, and errors
  • Quality scoring across performance, SEO, security, accessibility, best practices
  • Interactive HTML, JSON, and text reports with built-in SMTP mailer
  • Offline website cloning and clean Markdown export for AI tools
  • CI/CD quality gate with configurable thresholds and stress testing

Details

On GitHub since
2023
Language
Rust
License
MIT
Platforms
Windows · macOS · Linux (x64 & arm64)
Install
Binary · Homebrew · apt · dnf · zypper · apk