Search
11 results for "architecture"
- ai · arxiv/cs.LG · 4 min
Selective-Update RNNs Match Transformers While Using Less Memory
A new RNN architecture learns when to update internal state, preserving memory across long sequences and reducing computational waste on redundant input.
May 3, 2026 Read → - ai · hackernoon · 6 min
Continuity in AI agents requires architecture, not bigger memory stores
A solo builder argues that persistent AI identity depends on scheduled cognition cycles and narrative compression, not retrieval systems.
Apr 30, 2026 Read → - ai · arxiv/cs.LG · 8 min
Model Architecture Controls Whether Errors Stay Hidden
Transformer design determines if internal decision signals remain observable after training, independent of output confidence metrics.
Apr 29, 2026 Read → - engineering · arxiv/cs.LG · 8 min
Tessera: Cache-Line Encryption for Edge AI Without Bandwidth Loss
A hardware architecture that decrypts neural network weights at 64-byte granularity, hiding cryptographic overhead within DRAM fetch latency on shared-memory edge accelerators.
Apr 28, 2026 Read → - ai · arxiv/cs.LG · 4 min
Hyperbolic neural networks outperform Euclidean models in quantum simulations
Researchers demonstrate that Poincaré and Lorentz recurrent architectures consistently beat standard neural quantum states on many-body physics benchmarks.
Apr 28, 2026 Read → - ai · arxiv/cs.AI · 4 min
Cross-Entropy Loss Drives Neural Probe Performance, Not Architecture
Pre-registered study shows cross-entropy training inflates logit norms 15x, accounting for most K-way energy probe gains over softmax baselines.
Apr 24, 2026 Read → - ai · arxiv/cs.AI · 5 min
Transformers learn graph connectivity selectively, not universally
New research shows transformers can infer transitive relations on grid-structured graphs but fail on fragmented ones, with scaling helping only certain architectures.
Apr 23, 2026 Read → - ai · arxiv/cs.LG · 8 min
Concept Bottleneck Models Hit Hard Ceiling in Dermoscopy Data
Rough-set analysis reveals 16% of concept profiles in Derm7pt are internally inconsistent, capping model accuracy at 92% regardless of architecture.
Apr 22, 2026 Read → - startups · hackernoon · 2 min
GenZVerse Builds Governance Into Architecture, Not Policy
A Polygon-based Web3 platform claims decentralisation enforced by smart contracts, not founder promises — here is what that distinction means.
Apr 19, 2026 Read → - engineering · hackernoon · 6 min
Elegant Architecture Often Fails the Next Team
Samuel Oladipupo argues that legible, deletable code outperforms clever abstractions when maintainability is measured honestly.
Apr 19, 2026 Read → - ai · arxiv/cs.LG · 8 min
Three-Phase Transformer: Structural Prior for Decoder Efficiency
A residual-stream architecture using cyclic channel partitioning and phase-aligned rotations achieves 7% perplexity gains with minimal parameter overhead.
Apr 17, 2026 Read →