Salesforce/blip2-opt-2.7b

Image Text To Text·Salesforce· 685.4K· 445

transformers mit 3.7B params arxiv:2301.12597license:mit

BLIP-2 model, leveraging OPT-2.7b (a large language model with 2.7 billion parameters). It was introduced in the paper BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models by Li et al. and first released in this repository.

Open in MLForge Sign up free Desktop app Source ↗

# pull & run locally
pip install mlforge-sdk && mlforge pull Salesforce/blip2-opt-2.7b

Model details

Task

Image Text To Text

Provider

Salesforce

Framework

transformers

Parameters

3.7B

Size

52 GB

License

mit

Downloads

685.4K

Likes

445

Paper

arXiv:2301.12597

Updated

2025-02-03

About Salesforce/blip2-opt-2.7b

Related Image Text To Text

G google/gemma-4-26B-A4B-it Image Text To Text ·26.5B params 13.1M 1.2K 🤗 HF G google/gemma-4-31B-it Image Text To Text ·32.7B params 11.2M 3.1K 🤗 HF Q Qwen/Qwen3.5-9B Image Text To Text ·9.7B params 9.8M 1.6K 🤗 HF Q Qwen/Qwen3.5-4B Image Text To Text ·4.7B params 9.6M 683 🤗 HF Q Qwen/Qwen2.5-VL-7B-Instruct Image Text To Text ·8.3B params 9.4M 1.6K 🤗 HF Q Qwen/Qwen3.6-35B-A3B-FP8 Image Text To Text ·36.0B params 5.8M 284 🤗 HF