Example task
Forecast
Step 1
Time Series
Historical values
Future ground truth
Step 2
Forecast Task
Ground-Truth Evidence
Step 3
Documents
The full deep-research space is the union of documents across all benchmark tasks. This page shows one task's local slice. Toggle Show GT off to read the corpus the way an agent first sees it — undifferentiated.