engineering · 8 min read · Apr 27, 2026

Sequential decision-making reduces error drift in modular digital twins

Researchers frame error propagation in digital twins as a Markov decision process, comparing model-based and model-free approaches to optimize maintenance interventions.

Source: arxiv/cs.LG · Annice Najafi, Shokoufeh Mirzaei · open original ↗

Najafi and Mirzaei use Markov decision processes to decide when and how to intervene in digital twins to prevent error accumulation.

  • Hidden Markov Models infer latent error regimes from surrogate-physics residuals in modular digital twins.
  • MDP formulation treats inferred regimes as states and corrective actions as decisions with cost-benefit rewards.
  • POMDP extension accounts for imperfect regime classification using Bayesian belief updates and confusion matrices.
  • Dynamic programming solves both MDP and POMDP; validated via Gillespie stochastic simulation.
  • Q-learning and REINFORCE tested as model-free alternatives to assess learning without explicit model knowledge.
  • MDP policy achieves highest cumulative reward; POMDP recovers 95% of MDP performance under observation noise.
  • Information value quantified: gap between MDP and POMDP guides investment in classification accuracy improvements.

Astrobobo tool mapping

  • Knowledge Capture Record your digital twin's current error regimes (e.g., sensor drift, model decay, coupling effects) and the interventions you perform. Use this as the basis for state and action definitions in an MDP.
  • Focus Brief Summarize the cost-benefit tradeoff for your system: what is the cost of downtime per hour? What is the cost of operating with reduced fidelity? This feeds the reward function.
  • Reading Queue Queue papers on POMDP solvers (e.g., POMCP, belief-space planning) and industrial digital twin case studies to deepen domain-specific context before implementing.

Frequently asked

  • An MDP assumes you observe the true error regime perfectly; a POMDP accounts for classification uncertainty by maintaining a probability distribution over regimes updated via Bayesian filtering. In the paper, POMDP recovers 95% of MDP performance under realistic noise, showing that imperfect sensing is tolerable for maintenance decisions.
Share X LinkedIn
cite
APA
Annice Najafi, Shokoufeh Mirzaei. (2026, April 27). Sequential decision-making reduces error drift in modular digital twins. Astrobobo Content Engine (rewrite of arxiv/cs.LG). https://astrobobo-content-engine.vercel.app/article/sequential-decision-making-reduces-error-drift-in-modular-digital-twins-99b844
MLA
Annice Najafi, Shokoufeh Mirzaei. "Sequential decision-making reduces error drift in modular digital twins." Astrobobo Content Engine, 27 Apr 2026, https://astrobobo-content-engine.vercel.app/article/sequential-decision-making-reduces-error-drift-in-modular-digital-twins-99b844. Based on "arxiv/cs.LG", https://arxiv.org/abs/2604.22168.
BibTeX
@misc{astrobobo_sequential-decision-making-reduces-error-drift-in-modular-digital-twins-99b844_2026,
  author       = {Annice Najafi, Shokoufeh Mirzaei},
  title        = {Sequential decision-making reduces error drift in modular digital twins},
  year         = {2026},
  url          = {https://astrobobo-content-engine.vercel.app/article/sequential-decision-making-reduces-error-drift-in-modular-digital-twins-99b844},
  note         = {Astrobobo rewrite of arxiv/cs.LG, https://arxiv.org/abs/2604.22168},
}

Related insights