
Datasaur
NLP data labeling platform with AI-assisted automation, quality workflows, and private LLM options
Discover top open-source software, updated regularly with real-world adoption signals.

Collaborative video and image annotation platform for computer vision
CVAT provides an interactive web interface to label images and videos, supporting dozens of formats, Docker deployment, SDK, CLI, and integrations with Roboflow and HuggingFace for scalable data annotation.

CVAT (Computer Vision Annotation Tool) is a web‑based platform that lets teams label images and videos through an interactive UI. It supports more than 30 import and export formats—including COCO, YOLO, PASCAL VOC, and MOT—so datasets can be created or converted without manual preprocessing.
The tool can be used instantly via the free online service at cvat.ai (limited to 10 tasks and 500 MB of data) or self‑hosted with pre‑built Docker images for server and UI, enabling on‑premise security and scalability. Automation is possible through a RESTful API, a Python SDK, and a command‑line client, while integrations with Roboflow and HuggingFace streamline auto‑annotation and model‑in‑the‑loop workflows. Enterprise options add SSO, LDAP, and upcoming analytics for larger teams.
A vibrant community contributes plugins, documentation, and Docker updates, while the project offers paid enterprise support with 24‑hour SLA, training, and dedicated assistance. This ecosystem makes CVAT suitable for both research prototypes and production‑grade annotation pipelines.
When teams consider CVAT, these hosted platforms usually appear on the same shortlist.

NLP data labeling platform with AI-assisted automation, quality workflows, and private LLM options

AI data labeling & evaluation platform for images, video, text, audio, and more

Computer vision labeling platform for images, video, LiDAR, and medical with AI-assisted tools
Looking for a hosted option? These are the services engineering teams benchmark against before choosing open source.
Semantic segmentation dataset creation
Produce high‑quality pixel masks for training segmentation models
Video object tracking
Annotate bounding boxes across frames to generate training data for tracking algorithms
Model fine‑tuning with auto‑annotations
Use Roboflow integration to generate initial labels and refine them manually
Dataset conversion
Import legacy PASCAL VOC labels and export to COCO format for downstream pipelines
The free tier allows up to 10 annotation tasks and 500 MB of uploaded data per user.
Yes, pre‑built Docker images for the server and UI enable on‑premise deployment, with optional Kubernetes support for larger installations.
CVAT can import and export more than 30 formats, including COCO, YOLO, PASCAL VOC, MOT, and many others.
Paid enterprise support offers SSO, LDAP, dedicated assistance, 24‑hour SLA, and upcoming analytics features.
Automation is possible via the REST API, Python SDK, and the cvat‑cli command‑line tool, as well as integrations with Roboflow and HuggingFace.
Project at a glance
ActiveLast synced 4 days ago