Task
Image Text To Text
cua-lite preprocessed version of ScaleCUA (OpenGVLab/ScaleCUA-Data + zyliu/ScaleCUA-Data-Understanding). Large-scale multi-platform / multi-task-type GUI dataset spanning understanding, grounding:action, grounding:bbox, grounding:point, and navigation.
cua-lite preprocessed version of ScaleCUA (OpenGVLab/ScaleCUA-Data + zyliu/ScaleCUA-Data-Understanding). Large-scale multi-platform / multi-task-type GUI dataset spanning understanding, grounding:action, grounding:bbox, grounding:point, and navigation.