Colly logo

Colly

Fast, elegant scraping and crawling framework for Go

Repository activity
  • Stars25.3k
  • Forks1.9k
  • Open Issues194
License

Apache-2.0

Languages
  • Go
  • HTML
Colly screenshot

About Colly

Colly is a Go framework for writing crawlers, scrapers, and spiders. It exposes a clean callback-based API for extracting structured data from websites, with uses spanning data mining, data processing, and archiving.

It manages request delays and maximum concurrency per domain, handles cookies and sessions automatically, and supports sync, async, and parallel scraping. Caching, automatic encoding of non-unicode responses, robots.txt handling, distributed scraping, and configuration via environment variables round out the feature set.

Colly is an open source Go library, installed with go get and used by many other scraping projects. It is released under the Apache License 2.0.

Key features

  • Callback-based API for scrapers and crawlers
  • Request delays and max concurrency per domain
  • Automatic cookie and session handling
  • Sync, async, and parallel scraping
  • robots.txt support and distributed scraping

Details

First released
2017
Latest release
v2.2.0 · 2025
Language
Go
Install
go get gocolly/colly/v2
Deployment
Library
License
Apache-2.0