HomeModelsFill Maskmicrosoft/deberta-v3-large
D

microsoft/deberta-v3-large

Fill Mask·microsoft· 1.2M· 281
transformers mit arxiv:2006.03654arxiv:2111.09543license:mit

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull microsoft/deberta-v3-large

Model details

Task
Fill Mask
Provider
microsoft
Framework
transformers
Size
7.5 GB
License
mit
Downloads
1.2M
Likes
281
Paper
arXiv:2006.03654
Updated
2023-03-19

About microsoft/deberta-v3-large

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

Related Fill Mask

B google-bert/bert-base-uncased Fill Mask ·110.1M params 60.5M 2.7K 🤗 HF X FacebookAI/xlm-roberta-base Fill Mask ·278.9M params 21.1M 853 🤗 HF R FacebookAI/roberta-base Fill Mask ·124.7M params 11.9M 617 🤗 HF R FacebookAI/roberta-large Fill Mask ·355.4M params 11.5M 301 🤗 HF M answerdotai/ModernBERT-base Fill Mask ·149.7M params 9.4M 1.1K 🤗 HF D distilbert/distilbert-base-uncased Fill Mask ·67.0M params 8.9M 901 🤗 HF