HomeModelsOthermicrosoft/layoutlmv3-base
L

microsoft/layoutlmv3-base

Other·microsoft· 830.1K· 501
transformers cc-by-nc-sa-4.0 125.3M params arxiv:2204.08387license:cc-by-nc-sa-4.0region:us

LayoutLMv3 is a pre-trained multimodal Transformer for Document AI with unified text and image masking. The simple unified architecture and training objectives make LayoutLMv3 a general-purpose pre-trained model. For example, LayoutLMv3 can be fine-tuned for both text-centric tasks, including form understanding, receipt understanding, and document visual question answering, and image-centric tasks

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull microsoft/layoutlmv3-base

Model details

Task
Other
Provider
microsoft
Framework
transformers
Parameters
125.3M
Size
1.9 GB
License
cc-by-nc-sa-4.0
Downloads
830.1K
Likes
501
Paper
arXiv:2204.08387
Updated
2024-04-10

About microsoft/layoutlmv3-base

LayoutLMv3 is a pre-trained multimodal Transformer for Document AI with unified text and image masking. The simple unified architecture and training objectives make LayoutLMv3 a general-purpose pre-trained model. For example, LayoutLMv3 can be fine-tuned for both text-centric tasks, including form understanding, receipt understanding, and document visual question answering, and image-centric tasks such as document image classification and document layout analysis.

Related Other

E google/electra-base-discriminator Other 41.9M 128 🤗 HF A Bingsu/adetailer Other 12.6M 729 🤗 HF C colbert-ir/colbertv2.0 Other ·109.6M params 11.8M 362 🤗 HF C facebook/contriever Other 7.5M 93 🤗 HF W pyannote/wespeaker-voxceleb-resnet34-LM Other 7.3M 146 🤗 HF U lpiccinelli/unidepth-v2-vitl14 Other ·353.8M params 5.4M 34 🤗 HF