HomeDatasetsmercor/apex-agents
A

mercor/apex-agents

General · mercor· 69.3K
cc-by-4.0 19 GB benchmark:officialbenchmark:eval-yamllanguage:enlicense:cc-by-4.0size_categories:n<1K

APEX–Agents APEX–Agents is a benchmark from Mercor for evaluating whether AI agents can execute long-horizon, cross-application professional services tasks. Tasks were created by investment banking analysts, management consultants, and corporate lawyers, and require agents to navigate realistic work environments with files and tools (e.g., docs, spreadsheets, PDFs, email, chat, calendar). Tasks: 480 total (160 per job category) Worlds: 33 total (10 banking, 11 consulting, 12… See the full description on the dataset page: https://huggingface.co/datasets/mercor/apex-agents.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull mercor/apex-agents

Dataset details

Task
General
Language
en
License
cc-by-4.0
Size
19 GB
Creator
mercor
Downloads
69.3K
Source
huggingface_datasets
Updated
2026-06-11