How do synthetic computers help train AI agents for real work?

Synthetic computers are realistic digital environments with folder structures and documents that mimic actual workspaces. Agents simulate long-horizon tasks—like completing a month-long project—within these environments. The agent's successes and failures generate training signals that improve its ability to navigate real computer systems, coordinate with others, and produce professional outputs. This avoids relying on private user data or expensive human annotation.

Can this approach scale to cover all types of work?

The methodology can theoretically scale to billions of synthetic user worlds if compute is available, since personas and work contexts are abundant. However, the paper does not detail how to ensure diverse profession coverage or validate that synthetic tasks reflect actual productivity challenges. Scaling requires addressing cost-benefit trade-offs and determining whether synthetic diversity translates to real-world agent performance.

What makes these simulations realistic compared to simpler training data?

Each simulation runs for 8+ hours and spans 2,000+ agent turns, requiring the agent to navigate filesystems, coordinate with simulated collaborators, and produce multiple professional deliverables over time. This long-horizon, multi-step structure mirrors actual work better than isolated task examples. The agent must maintain context, adapt to obstacles, and integrate outputs—challenges that short, synthetic tasks do not capture.

ai · 4 min read · May 3, 2026

Synthetic Computers Enable Agent Training at Scale

Researchers create realistic digital workspaces to train AI agents on long-horizon productivity tasks, scaling from thousands to potentially billions of simulated user environments.

Source: arxiv/cs.LG · Tao Ge, Baolin Peng, Hao Cheng, Jianfeng Gao · open original ↗

Researchers built a system to generate realistic computer environments and run month-long agent simulations to create training data for productivity AI.

— Synthetic Computers at Scale generates realistic folder structures and content-rich documents to mimic actual work environments.
— Agents simulate month-long productivity workflows, navigating filesystems, coordinating with collaborators, and producing professional deliverables.
— 1,000 synthetic computers were created; each simulation ran 8+ hours with 2,000+ agent turns on average.
— Training signals from these simulations improved agent performance on both familiar and unfamiliar productivity tasks.
— Methodology scales to billions of synthetic user worlds with sufficient compute, covering diverse professions and contexts.
— Approach positions synthetic environments as a foundation for agent self-improvement and reinforcement learning in real work scenarios.

Astrobobo tool mapping

Knowledge Capture Record the structure of your digital workspace (directories, file naming conventions, artifact types) as a baseline for understanding what synthetic training data should include.
Focus Brief Summarize the typical multi-day or multi-week projects you handle. Use this to assess whether simulated agents would need similar context windows and memory structures.
Reading Queue Add related papers on agent evaluation and synthetic data generation to track how this methodology evolves and what benchmarks emerge.

Frequently asked

Synthetic computers are realistic digital environments with folder structures and documents that mimic actual workspaces. Agents simulate long-horizon tasks—like completing a month-long project—within these environments. The agent's successes and failures generate training signals that improve its ability to navigate real computer systems, coordinate with others, and produce professional outputs. This avoids relying on private user data or expensive human annotation.

Share X LinkedIn

cite ▸

APA

Tao Ge, Baolin Peng, Hao Cheng, Jianfeng Gao. (2026, May 3). Synthetic Computers Enable Agent Training at Scale. Astrobobo Content Engine (rewrite of arxiv/cs.LG). https://astrobobo-content-engine.vercel.app/article/synthetic-computers-enable-agent-training-at-scale-2cb940

MLA

Tao Ge, Baolin Peng, Hao Cheng, Jianfeng Gao. "Synthetic Computers Enable Agent Training at Scale." Astrobobo Content Engine, 3 May 2026, https://astrobobo-content-engine.vercel.app/article/synthetic-computers-enable-agent-training-at-scale-2cb940. Based on "arxiv/cs.LG", https://arxiv.org/abs/2604.28181.

BibTeX

@misc{astrobobo_synthetic-computers-enable-agent-training-at-scale-2cb940_2026,
  author       = {Tao Ge, Baolin Peng, Hao Cheng, Jianfeng Gao},
  title        = {Synthetic Computers Enable Agent Training at Scale},
  year         = {2026},
  url          = {https://astrobobo-content-engine.vercel.app/article/synthetic-computers-enable-agent-training-at-scale-2cb940},
  note         = {Astrobobo rewrite of arxiv/cs.LG, https://arxiv.org/abs/2604.28181},
}

#agents #synthetic-data #productivity #simulation #training

Synthetic Computers Enable Agent Training at Scale

Astrobobo tool mapping

Frequently asked

Related insights

ActiNet: Self-Supervised Model Improves Wrist Activity Classification

Mixed Precision Training Stabilizes Neural ODEs

Selective-Update RNNs Match Transformers While Using Less Memory