Search
2 results for "compression"
- ai · hackernoon · 6 min
Continuity in AI agents requires architecture, not bigger memory stores
A solo builder argues that persistent AI identity depends on scheduled cognition cycles and narrative compression, not retrieval systems.
Apr 30, 2026 Read → - ai · arxiv/cs.AI · 6 min
OjaKV: Online Low-Rank Compression for LLM Key-Value Caches
A hybrid storage and adaptive subspace method reduces KV cache memory by compressing intermediate tokens while preserving critical anchors, compatible with FlashAttention.
Apr 20, 2026 Read →