ai · 4 min read · May 3, 2026

Synthetic Computers Enable Agent Training at Scale

Researchers create realistic digital workspaces to train AI agents on long-horizon productivity tasks, scaling from thousands to potentially billions of simulated user environments.

Source: arxiv/cs.LG · Tao Ge, Baolin Peng, Hao Cheng, Jianfeng Gao · open original ↗

Researchers built a system to generate realistic computer environments and run month-long agent simulations to create training data for productivity AI.

  • Synthetic Computers at Scale generates realistic folder structures and content-rich documents to mimic actual work environments.
  • Agents simulate month-long productivity workflows, navigating filesystems, coordinating with collaborators, and producing professional deliverables.
  • 1,000 synthetic computers were created; each simulation ran 8+ hours with 2,000+ agent turns on average.
  • Training signals from these simulations improved agent performance on both familiar and unfamiliar productivity tasks.
  • Methodology scales to billions of synthetic user worlds with sufficient compute, covering diverse professions and contexts.
  • Approach positions synthetic environments as a foundation for agent self-improvement and reinforcement learning in real work scenarios.

Astrobobo tool mapping

  • Knowledge Capture Record the structure of your digital workspace (directories, file naming conventions, artifact types) as a baseline for understanding what synthetic training data should include.
  • Focus Brief Summarize the typical multi-day or multi-week projects you handle. Use this to assess whether simulated agents would need similar context windows and memory structures.
  • Reading Queue Add related papers on agent evaluation and synthetic data generation to track how this methodology evolves and what benchmarks emerge.

Frequently asked

  • Synthetic computers are realistic digital environments with folder structures and documents that mimic actual workspaces. Agents simulate long-horizon tasks—like completing a month-long project—within these environments. The agent's successes and failures generate training signals that improve its ability to navigate real computer systems, coordinate with others, and produce professional outputs. This avoids relying on private user data or expensive human annotation.
Share X LinkedIn
cite
APA
Tao Ge, Baolin Peng, Hao Cheng, Jianfeng Gao. (2026, May 3). Synthetic Computers Enable Agent Training at Scale. Astrobobo Content Engine (rewrite of arxiv/cs.LG). https://astrobobo-content-engine.vercel.app/article/synthetic-computers-enable-agent-training-at-scale-2cb940
MLA
Tao Ge, Baolin Peng, Hao Cheng, Jianfeng Gao. "Synthetic Computers Enable Agent Training at Scale." Astrobobo Content Engine, 3 May 2026, https://astrobobo-content-engine.vercel.app/article/synthetic-computers-enable-agent-training-at-scale-2cb940. Based on "arxiv/cs.LG", https://arxiv.org/abs/2604.28181.
BibTeX
@misc{astrobobo_synthetic-computers-enable-agent-training-at-scale-2cb940_2026,
  author       = {Tao Ge, Baolin Peng, Hao Cheng, Jianfeng Gao},
  title        = {Synthetic Computers Enable Agent Training at Scale},
  year         = {2026},
  url          = {https://astrobobo-content-engine.vercel.app/article/synthetic-computers-enable-agent-training-at-scale-2cb940},
  note         = {Astrobobo rewrite of arxiv/cs.LG, https://arxiv.org/abs/2604.28181},
}

Related insights