HomeModelsAutomatic Speech Recognitionibm-granite/granite-speech-3.3-2b
G

ibm-granite/granite-speech-3.3-2b

Automatic Speech Recognition·ibm-granite· 522.1K· 55
transformers apache-2.0 3.0B params arxiv:2505.08699base_model:ibm-granite/granite-3.3-2b-instruct

Model Summary: Granite-speech-3.3-2b is a compact and efficient speech-language model, specifically designed for automatic speech recognition (ASR) and automatic speech translation (AST). Granite-speech-3.3-2b uses a two-pass design, unlike integrated models that combine speech and language into a single pass. Initial calls to granite-speech-3.3-2b will transcribe audio files into text. To process

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull ibm-granite/granite-speech-3.3-2b

Model details

Task
Automatic Speech Recognition
Provider
ibm-granite
Framework
transformers
Parameters
3.0B
Size
7.3 GB
License
apache-2.0
Downloads
522.1K
Likes
55
Paper
arXiv:2505.08699
Updated
2026-04-07

About ibm-granite/granite-speech-3.3-2b

Model Summary: Granite-speech-3.3-2b is a compact and efficient speech-language model, specifically designed for automatic speech recognition (ASR) and automatic speech translation (AST). Granite-speech-3.3-2b uses a two-pass design, unlike integrated models that combine speech and language into a single pass. Initial calls to granite-speech-3.3-2b will transcribe audio files into text. To process the transcribed text using the underlying Granite language model, users must make a second call as each step must be explicitly initiated.

Related Automatic Speech Recognition

S pyannote/speaker-diarization-3.1 Automatic Speech Recognition 8.2M 2.5K 🤗 HF W argmaxinc/whisperkit-coreml Automatic Speech Recognition 8.1M 193 🤗 HF W openai/whisper-large-v3-turbo Automatic Speech Recognition ·808.9M params 7.1M 3.1K 🤗 HF W openai/whisper-base Automatic Speech Recognition ·72.6M params 6.2M 274 🤗 HF W jonatasgrosman/wav2vec2-large-xlsr-53-japanese Automatic Speech Recognition 6.1M 61 🤗 HF W openai/whisper-large-v3 Automatic Speech Recognition ·1.5B params 5.7M 5.9K 🤗 HF