Search
4 results for "transformers"
- ai · arxiv/cs.LG · 4 min
Selective-Update RNNs Match Transformers While Using Less Memory
A new RNN architecture learns when to update internal state, preserving memory across long sequences and reducing computational waste on redundant input.
May 3, 2026 Read → - ai · arxiv/cs.AI · 5 min
Transformers learn graph connectivity selectively, not universally
New research shows transformers can infer transitive relations on grid-structured graphs but fail on fragmented ones, with scaling helping only certain architectures.
Apr 23, 2026 Read → - engineering · arxiv/cs.AI · 4 min
Dual Transformers Improve Bug Assignment Accuracy by 10%+
TriagerX uses two transformer models and developer interaction history to recommend the right engineer for bug fixes, outperforming single-model approaches.
Apr 20, 2026 Read → - ai · arxiv/cs.LG · 8 min
Distilling Transformers into Mamba via Linearized Attention
A two-stage knowledge transfer method preserves Transformer performance in State Space Models by routing through linearized attention as an intermediate step.
Apr 17, 2026 Read →