Open-source data annotation platform for image, video, and 3D datasets with AI-assisted labeling
- Stars16.1k
- Forks3.7k
- Open Issues614
MIT
- Python
- TypeScript
- JavaScript

About CVAT
CVAT is a free, self-hosted open-source platform for computer vision annotation. It is used to turn images, videos, and 3D data into model-ready datasets while keeping annotation infrastructure in your own environment. The platform is built for teams that need dataset creation, review, and management in one place.
It supports AI-assisted annotation, quality control, team collaboration, analytics, and developer APIs. CVAT also connects with cloud storage and can use your own ML models for detection, segmentation, and tracking to speed up labeling. Multi-user and multi-organization workflows include roles, task assignments, and review steps.
CVAT Community is the MIT-licensed core behind CVAT Online and CVAT Enterprise, and it is actively maintained by the CVAT engineering team. It can run entirely on your own infrastructure with Docker and Docker Compose, and the project provides browser access through CVAT Online for trying it without deployment.
Key features
- Image, video, and 3D annotation
- AI-assisted labeling with custom ML models
- Quality control and review workflows
- Multi-user and multi-organization collaboration
- Developer APIs and SDKs
Details
- First released
- 2018
- Self-hosting
- Runs on your own infrastructure
- Platforms
- Web · Docker
- Deployment
- self-hostable · docker
- License
- MIT
- Storage
- Cloud storage integration
