Bhasha ASR demo — Hugging Face / faster-whisper / Alignment mode

Instructions:

  • Hugging Face backend: Use full model IDs like vasista22/whisper-hindi-small
  • faster-whisper backend: Use model sizes only: tiny, base, small, medium, or large
  • Alignment mode: Provide both audio and the original transcript to align words to audio.
Mode: STT or Align?
Backend (STT mode only)
5 60