HomeModelsAutomatic Speech Recognitionmicrosoft/Phi-4-multimodal-instruct
P

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition·microsoft· 539.0K· 1.6K
transformers mit 5.6B params

🎉Phi-4: [mini-reasoning reasoning] [multimodal-instruct onnx]; [mini-instruct onnx]

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull microsoft/Phi-4-multimodal-instruct

Model details

Task
Automatic Speech Recognition
Provider
microsoft
Framework
transformers
Parameters
5.6B
Size
22 GB
License
mit
Downloads
539.0K
Likes
1.6K
Paper
arXiv:2503.01743
Updated
2025-12-10

About microsoft/Phi-4-multimodal-instruct

🎉Phi-4: [mini-reasoning reasoning] [multimodal-instruct onnx]; [mini-instruct onnx]

Related Automatic Speech Recognition

S pyannote/speaker-diarization-3.1 Automatic Speech Recognition 8.2M 2.5K 🤗 HF W argmaxinc/whisperkit-coreml Automatic Speech Recognition 8.1M 193 🤗 HF W openai/whisper-large-v3-turbo Automatic Speech Recognition ·808.9M params 7.1M 3.1K 🤗 HF W openai/whisper-base Automatic Speech Recognition ·72.6M params 6.2M 274 🤗 HF W jonatasgrosman/wav2vec2-large-xlsr-53-japanese Automatic Speech Recognition 6.1M 61 🤗 HF W openai/whisper-large-v3 Automatic Speech Recognition ·1.5B params 5.7M 5.9K 🤗 HF