Colly

Fast, elegant scraping and crawling framework for Go

Repository activity

Stars25.3k
Forks1.9k
Open Issues194

License

Apache-2.0

Languages

Go
HTML

Get it:Website GitHub

About Colly

Colly is a Go framework for writing crawlers, scrapers, and spiders. It exposes a clean callback-based API for extracting structured data from websites, with uses spanning data mining, data processing, and archiving.

It manages request delays and maximum concurrency per domain, handles cookies and sessions automatically, and supports sync, async, and parallel scraping. Caching, automatic encoding of non-unicode responses, robots.txt handling, distributed scraping, and configuration via environment variables round out the feature set.

Colly is an open source Go library, installed with go get and used by many other scraping projects. It is released under the Apache License 2.0.

Key features

Callback-based API for scrapers and crawlers
Request delays and max concurrency per domain
Automatic cookie and session handling
Sync, async, and parallel scraping
robots.txt support and distributed scraping

Details

First released: 2017
Latest release: v2.2.0 · 2025
Language: Go
Install: go get gocolly/colly/v2
Deployment: Library
License: Apache-2.0