CVAT logo

CVAT

Open-source data annotation platform for image, video, and 3D datasets with AI-assisted labeling

Repository activity
  • Stars16.1k
  • Forks3.7k
  • Open Issues614
License

MIT

Languages
  • Python
  • TypeScript
  • JavaScript
CVAT screenshot

About CVAT

CVAT is a free, self-hosted open-source platform for computer vision annotation. It is used to turn images, videos, and 3D data into model-ready datasets while keeping annotation infrastructure in your own environment. The platform is built for teams that need dataset creation, review, and management in one place.

It supports AI-assisted annotation, quality control, team collaboration, analytics, and developer APIs. CVAT also connects with cloud storage and can use your own ML models for detection, segmentation, and tracking to speed up labeling. Multi-user and multi-organization workflows include roles, task assignments, and review steps.

CVAT Community is the MIT-licensed core behind CVAT Online and CVAT Enterprise, and it is actively maintained by the CVAT engineering team. It can run entirely on your own infrastructure with Docker and Docker Compose, and the project provides browser access through CVAT Online for trying it without deployment.

Key features

  • Image, video, and 3D annotation
  • AI-assisted labeling with custom ML models
  • Quality control and review workflows
  • Multi-user and multi-organization collaboration
  • Developer APIs and SDKs

Details

First released
2018
Self-hosting
Runs on your own infrastructure
Platforms
Web · Docker
Deployment
self-hostable · docker
License
MIT
Storage
Cloud storage integration