ibm-granite/granite-speech-3.3-2b

Automatic Speech Recognition·ibm-granite· 522.1K· 55

transformers apache-2.0 3.0B params arxiv:2505.08699base_model:ibm-granite/granite-3.3-2b-instruct

Model Summary: Granite-speech-3.3-2b is a compact and efficient speech-language model, specifically designed for automatic speech recognition (ASR) and automatic speech translation (AST). Granite-speech-3.3-2b uses a two-pass design, unlike integrated models that combine speech and language into a single pass. Initial calls to granite-speech-3.3-2b will transcribe audio files into text. To process

Open in MLForge Sign up free Desktop app Source ↗

# pull & run locally
pip install mlforge-sdk && mlforge pull ibm-granite/granite-speech-3.3-2b

Model details

Task

Automatic Speech Recognition

Provider

ibm-granite

Framework

transformers

Parameters

3.0B

Size

7.3 GB

License

apache-2.0

Downloads

522.1K

Likes

Paper

arXiv:2505.08699

Updated

2026-04-07

About ibm-granite/granite-speech-3.3-2b

Related Automatic Speech Recognition

S pyannote/speaker-diarization-3.1 Automatic Speech Recognition 8.2M 2.5K 🤗 HF W argmaxinc/whisperkit-coreml Automatic Speech Recognition 8.1M 193 🤗 HF W openai/whisper-large-v3-turbo Automatic Speech Recognition ·808.9M params 7.1M 3.1K 🤗 HF W openai/whisper-base Automatic Speech Recognition ·72.6M params 6.2M 274 🤗 HF W jonatasgrosman/wav2vec2-large-xlsr-53-japanese Automatic Speech Recognition 6.1M 61 🤗 HF W openai/whisper-large-v3 Automatic Speech Recognition ·1.5B params 5.7M 5.9K 🤗 HF