doccano logo

doccano

Open-source text annotation tool for collaborative labeling, classification, and sequence tasks

Repository activity
  • Stars10.7k
  • Forks1.8k
  • Open Issues394
License

MIT

Languages
  • Python
  • Vue
  • TypeScript
Get it:GitHub
doccano screenshot

About doccano

doccano is an open-source text annotation tool for humans. It helps machine learning practitioners create labeled data for text classification, sequence labeling, and sequence to sequence tasks, including sentiment analysis, named entity recognition, and text summarization.

You create a project, upload data, and start annotating. It includes collaborative annotation, multi-language support, mobile support, emoji support, a dark theme, and a RESTful API.

doccano can be self-hosted with pip, Docker, or Docker Compose, so teams keep their data on their own infrastructure. A public demo is available for trying the labeling interface before installing.

Key features

  • Text classification, sequence labeling, and sequence to sequence annotation
  • Collaborative annotation
  • Multi-language support
  • Mobile support
  • RESTful API

Details

First released
2018
Platforms
Web · Docker
Deployment
self-hostable · docker
Interface
Dark theme
Input
Text data uploads
Output
Labeled datasets