What is Centaur and how does it improve hyperparameter tuning?

Centaur is a hybrid method that shares CMA-ES's internal optimization state (mean vector, step-size, covariance matrix) with an LLM. The LLM uses this state to propose informed trial configurations. In experiments, Centaur outperformed all pure classical and pure LLM methods, and even a 0.8B parameter LLM in Centaur beat frontier models used alone.

Why do LLMs struggle at hyperparameter optimization despite their reasoning abilities?

LLMs lack persistent memory of optimization state across trials, making it hard to track progress or learn from past configurations. They also cannot reliably maintain numerical precision for optimization parameters. Classical algorithms excel at these stateful, numerical tasks, while LLMs excel at domain reasoning—making hybrid approaches that divide these responsibilities most effective.

ai · 4 min read · Apr 21, 2026

LLMs complement but don't replace classical hyperparameter optimization

A study comparing LLM agents to classical algorithms like CMA-ES and TPE finds hybrid approaches work best for tuning model hyperparameters under compute constraints.

Source: arxiv/cs.LG · Fabio Ferreira, Lucca Wobbe, Arjun Krishnakumar, Frank Hutter, Arber Zela · open original ↗

Classical hyperparameter optimizers outperform pure LLM agents, but hybrid methods combining both achieve superior results.

— CMA-ES and TPE consistently beat LLM-based agents when searching fixed hyperparameter spaces.
— LLMs struggle to maintain optimization state across multiple trials and experiments.
— Allowing LLMs to edit training code directly narrows but doesn't close the performance gap.
— Centaur, a hybrid pairing CMA-ES state with an LLM, outperforms all pure methods tested.
— Even a 0.8B parameter LLM in Centaur beats frontier models used alone.
— Classical methods lack domain knowledge that LLMs possess about code and tuning strategies.
— Search diversity matters less than avoiding out-of-memory failures under fixed budgets.
— LLMs work best as complements to classical optimizers, not replacements.

Astrobobo tool mapping

Knowledge Capture Document your current hyperparameter optimization workflow, noting which decisions are made by algorithm versus human intuition. Use this as a baseline to measure hybrid improvements.
Focus Brief Create a one-page summary of Centaur's architecture (CMA-ES state shared with LLM) and share with your team to align on hybrid design principles before implementation.
Reading Queue Queue the autoresearch repository and Centaur paper for engineers who own AutoML infrastructure, prioritizing those responsible for tuning pipelines.

Frequently asked

No. The study shows classical methods like CMA-ES and TPE consistently outperform pure LLM agents on fixed hyperparameter spaces. LLMs struggle to track optimization state across trials. However, hybrid approaches that combine classical optimizers with LLMs achieve the best results, suggesting LLMs work best as complements rather than replacements.

Share X LinkedIn

cite ▸

APA

Fabio Ferreira, Lucca Wobbe, Arjun Krishnakumar, Frank Hutter, Arber Zela. (2026, April 21). LLMs complement but don't replace classical hyperparameter optimization. Astrobobo Content Engine (rewrite of arxiv/cs.LG). https://astrobobo-content-engine.vercel.app/article/llms-complement-but-don-t-replace-classical-hyperparameter-optimization-9f0e75

MLA

Fabio Ferreira, Lucca Wobbe, Arjun Krishnakumar, Frank Hutter, Arber Zela. "LLMs complement but don't replace classical hyperparameter optimization." Astrobobo Content Engine, 21 Apr 2026, https://astrobobo-content-engine.vercel.app/article/llms-complement-but-don-t-replace-classical-hyperparameter-optimization-9f0e75. Based on "arxiv/cs.LG", https://arxiv.org/abs/2603.24647.

BibTeX

@misc{astrobobo_llms-complement-but-don-t-replace-classical-hyperparameter-optimization-9f0e75_2026,
  author       = {Fabio Ferreira, Lucca Wobbe, Arjun Krishnakumar, Frank Hutter, Arber Zela},
  title        = {LLMs complement but don't replace classical hyperparameter optimization},
  year         = {2026},
  url          = {https://astrobobo-content-engine.vercel.app/article/llms-complement-but-don-t-replace-classical-hyperparameter-optimization-9f0e75},
  note         = {Astrobobo rewrite of arxiv/cs.LG, https://arxiv.org/abs/2603.24647},
}

#hyperparameter #optimization #llm #automl #hybrid

LLMs complement but don't replace classical hyperparameter optimization

Astrobobo tool mapping

Frequently asked

Related insights

Synthetic Computers Enable Agent Training at Scale

ActiNet: Self-Supervised Model Improves Wrist Activity Classification

Mixed Precision Training Stabilizes Neural ODEs