faster-whisper logo

faster-whisper

Whisper transcription with CTranslate2 for faster inference and lower memory use

Repository activity
  • Stars23.6k
  • Forks1.9k
  • Open Issues311
License

MIT

Languages
  • Python
  • Dockerfile
Get it:GitHub
faster-whisper screenshot

About faster-whisper

faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2. It transcribes audio with the same model family while aiming for faster inference and lower memory use than openai-whisper.

It supports 8-bit quantization on CPU and GPU, and the transcribe API can return segments as a generator. Word timestamps and VAD filtering are available, and Distil-Whisper checkpoints are compatible with the package.

It requires Python 3.9 or greater. GPU use needs NVIDIA cuBLAS for CUDA 12 and cuDNN 9 for CUDA 12, and one install path uses an official NVIDIA CUDA Docker image. FFmpeg does not need to be installed separately because PyAV bundles the FFmpeg libraries.

Key features

  • Transcribes audio with OpenAI Whisper models through CTranslate2
  • 8-bit quantization on CPU and GPU
  • Word timestamps and VAD filtering
  • Generator-based segment output
  • Works with Distil-Whisper checkpoints

Details

First released
2023
Platforms
CLI
Deployment
Docker · offline-first
Python
3.9+
GPU
CUDA 12 · cuDNN 9
FFmpeg
Bundled via PyAV