Astrobobo · Content Engine

Search

4 results for "transformers"

ai · arxiv/cs.LG · 4 min

Selective-Update RNNs Match Transformers While Using Less Memory

A new RNN architecture learns when to update internal state, preserving memory across long sequences and reducing computational waste on redundant input.

May 3, 2026 Read →
ai · arxiv/cs.AI · 5 min

Transformers learn graph connectivity selectively, not universally

New research shows transformers can infer transitive relations on grid-structured graphs but fail on fragmented ones, with scaling helping only certain architectures.

Apr 23, 2026 Read →
engineering · arxiv/cs.AI · 4 min

Dual Transformers Improve Bug Assignment Accuracy by 10%+

TriagerX uses two transformer models and developer interaction history to recommend the right engineer for bug fixes, outperforming single-model approaches.

Apr 20, 2026 Read →
ai · arxiv/cs.LG · 8 min

Distilling Transformers into Mamba via Linearized Attention

A two-stage knowledge transfer method preserves Transformer performance in State Space Models by routing through linearized attention as an intermediate step.

Apr 17, 2026 Read →