CVAT

Open-source data annotation platform for image, video, and 3D datasets with AI-assisted labeling

Repository activity

Stars16.1k
Forks3.7k
Open Issues614

License

MIT

Languages

Python
TypeScript
JavaScript

Get it:Website Online GitHub Docker Server Docker UI

About CVAT

CVAT is a free, self-hosted open-source platform for computer vision annotation. It is used to turn images, videos, and 3D data into model-ready datasets while keeping annotation infrastructure in your own environment. The platform is built for teams that need dataset creation, review, and management in one place.

It supports AI-assisted annotation, quality control, team collaboration, analytics, and developer APIs. CVAT also connects with cloud storage and can use your own ML models for detection, segmentation, and tracking to speed up labeling. Multi-user and multi-organization workflows include roles, task assignments, and review steps.

CVAT Community is the MIT-licensed core behind CVAT Online and CVAT Enterprise, and it is actively maintained by the CVAT engineering team. It can run entirely on your own infrastructure with Docker and Docker Compose, and the project provides browser access through CVAT Online for trying it without deployment.

Key features

Image, video, and 3D annotation
AI-assisted labeling with custom ML models
Quality control and review workflows
Multi-user and multi-organization collaboration
Developer APIs and SDKs

Details

First released: 2018
Self-hosting: Runs on your own infrastructure
Platforms: Web · Docker
Deployment: self-hostable · docker
License: MIT
Storage: Cloud storage integration