HomeModelsImage To TextSalesforce/blip-image-captioning-large
B

Salesforce/blip-image-captioning-large

Image To Text·Salesforce· 737.4K· 1.5K
transformers bsd-3-clause 469.7M params arxiv:2201.12086license:bsd-3-clauseregion:us

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull Salesforce/blip-image-captioning-large

Model details

Task
Image To Text
Provider
Salesforce
Framework
transformers
Parameters
469.7M
Size
7.0 GB
License
bsd-3-clause
Downloads
737.4K
Likes
1.5K
Paper
arXiv:2201.12086
Updated
2025-02-03

About Salesforce/blip-image-captioning-large

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Related Image To Text

B Salesforce/blip-image-captioning-base Image To Text 1.9M 863 🤗 HF P PaddlePaddle/PP-OCRv5_server_det Image To Text 587.3K 73 🤗 HF U PaddlePaddle/UVDoc Image To Text 512.8K 11 🤗 HF T microsoft/trocr-small-handwritten Image To Text 448.6K 63 🤗 HF P PaddlePaddle/PP-LCNet_x1_0_doc_ori Image To Text 445.3K 16 🤗 HF