Can transformers learn to reason about transitive relations?

Transformers can learn transitive reasoning on grid-structured graphs where nodes embed naturally in low-dimensional space. However, they struggle when graphs contain many disconnected components. Success depends on graph topology, not just model size. Scaling helps on grid graphs but doesn't fix failures on fragmented structures.

Does making a transformer larger improve its ability to infer connectivity?

Larger transformers generalize better to connectivity inference on grid-like graphs. However, scaling does not solve the problem for graphs with many disconnected components. The underlying graph structure matters more than model capacity; a larger model still fails if the graph is fundamentally fragmented.

What graph structures make it hard for transformers to learn connectivity?

Transformers struggle with graphs containing many disconnected components and with high-dimensional grid structures. Low-dimensional grid graphs are easier to learn. The research suggests transformers rely on geometric embedding rather than abstract logical rules, so disconnected or high-dimensional topologies exceed their learning capacity.

ai · 5 min read · Apr 23, 2026

Transformers learn graph connectivity selectively, not universally

New research shows transformers can infer transitive relations on grid-structured graphs but fail on fragmented ones, with scaling helping only certain architectures.

Source: arxiv/cs.AI · Amit Roy, Abulhair Saparov · open original ↗

Transformers learn to infer transitive relations on grid-like graphs but struggle with disconnected graph structures.

— Transformers can infer connectivity on grid-structured directed graphs where nodes embed in low-dimensional space.
— Graph dimensionality predicts learning difficulty; higher-dimensional grids challenge transformers more than low-dimensional ones.
— Larger models generalize better to connectivity inference on grid graphs as scale increases.
— Transformers fail to learn connectivity when graphs contain many disconnected components.
— Transitive reasoning ability depends on graph topology, not just model capacity or training data volume.
— Prior work tested in-context learning of transitivity; this study examines learning from training examples.
— Results suggest transformers rely on geometric structure rather than abstract logical rules for reasoning.

Astrobobo tool mapping

Knowledge Capture When documenting domain knowledge, explicitly map how separate systems connect. Flag isolated components and create bridging facts before feeding to LLM training or prompts.
Focus Brief Before deploying an LLM for causal or logical reasoning, write a brief listing all disconnected subgraphs in your domain. Identify which ones the model must reason across and plan explicit linking.
Reading Queue Queue follow-up papers on transformer mechanistic interpretability and graph neural networks to understand whether hybrid architectures (GNNs + transformers) solve the disconnected-graph problem.

Frequently asked

Transformers can learn transitive reasoning on grid-structured graphs where nodes embed naturally in low-dimensional space. However, they struggle when graphs contain many disconnected components. Success depends on graph topology, not just model size. Scaling helps on grid graphs but doesn't fix failures on fragmented structures.

Share X LinkedIn

cite ▸

APA

Amit Roy, Abulhair Saparov. (2026, April 23). Transformers learn graph connectivity selectively, not universally. Astrobobo Content Engine (rewrite of arxiv/cs.AI). https://astrobobo-content-engine.vercel.app/article/transformers-learn-graph-connectivity-selectively-not-universally-c1d363

MLA

Amit Roy, Abulhair Saparov. "Transformers learn graph connectivity selectively, not universally." Astrobobo Content Engine, 23 Apr 2026, https://astrobobo-content-engine.vercel.app/article/transformers-learn-graph-connectivity-selectively-not-universally-c1d363. Based on "arxiv/cs.AI", https://arxiv.org/abs/2509.22343.

BibTeX

@misc{astrobobo_transformers-learn-graph-connectivity-selectively-not-universally-c1d363_2026,
  author       = {Amit Roy, Abulhair Saparov},
  title        = {Transformers learn graph connectivity selectively, not universally},
  year         = {2026},
  url          = {https://astrobobo-content-engine.vercel.app/article/transformers-learn-graph-connectivity-selectively-not-universally-c1d363},
  note         = {Astrobobo rewrite of arxiv/cs.AI, https://arxiv.org/abs/2509.22343},
}

#transformers #reasoning #graphs #transitivity #scaling #llms

Transformers learn graph connectivity selectively, not universally

Astrobobo tool mapping

Frequently asked

Related insights

Synthetic Computers Enable Agent Training at Scale

ActiNet: Self-Supervised Model Improves Wrist Activity Classification

Mixed Precision Training Stabilizes Neural ODEs