google/owlv2-base-patch16-ensemble

Zero Shot Object Detection·google· 1.4M· 124

transformers apache-2.0 155.0M params arxiv:2306.09683license:apache-2.0region:us

The OWLv2 model (short for Open-World Localization) was proposed in Scaling Open-Vocabulary Object Detection by Matthias Minderer, Alexey Gritsenko, Neil Houlsby. OWLv2, like OWL-ViT, is a zero-shot text-conditioned object detection model that can be used to query an image with one or multiple text queries.

Open in MLForge Sign up free Desktop app Source ↗

# pull & run locally
pip install mlforge-sdk && mlforge pull google/owlv2-base-patch16-ensemble

Model details

Task

Zero Shot Object Detection

Provider

google

Framework

transformers

Parameters

155.0M

Size

1.2 GB

License

apache-2.0

Downloads

1.4M

Likes

124

Paper

arXiv:2306.09683

Updated

2024-10-31

About google/owlv2-base-patch16-ensemble

Related Zero Shot Object Detection

G IDEA-Research/grounding-dino-base Zero Shot Object Detection ·232.8M params 2.0M 193 🤗 HF G IDEA-Research/grounding-dino-tiny Zero Shot Object Detection ·172.3M params 1.2M 105 🤗 HF