Open-source text annotation tool for collaborative labeling, classification, and sequence tasks
- Stars10.7k
- Forks1.8k
- Open Issues394
MIT
- Python
- Vue
- TypeScript

About doccano
doccano is an open-source text annotation tool for humans. It helps machine learning practitioners create labeled data for text classification, sequence labeling, and sequence to sequence tasks, including sentiment analysis, named entity recognition, and text summarization.
You create a project, upload data, and start annotating. It includes collaborative annotation, multi-language support, mobile support, emoji support, a dark theme, and a RESTful API.
doccano can be self-hosted with pip, Docker, or Docker Compose, so teams keep their data on their own infrastructure. A public demo is available for trying the labeling interface before installing.
Key features
- Text classification, sequence labeling, and sequence to sequence annotation
- Collaborative annotation
- Multi-language support
- Mobile support
- RESTful API
Details
- First released
- 2018
- Platforms
- Web · Docker
- Deployment
- self-hostable · docker
- Interface
- Dark theme
- Input
- Text data uploads
- Output
- Labeled datasets
