Astrobobo · Content Engine

Search

2 results for "compression"

ai · hackernoon · 6 min

Continuity in AI agents requires architecture, not bigger memory stores

A solo builder argues that persistent AI identity depends on scheduled cognition cycles and narrative compression, not retrieval systems.

Apr 30, 2026 Read →
ai · arxiv/cs.AI · 6 min

OjaKV: Online Low-Rank Compression for LLM Key-Value Caches

A hybrid storage and adaptive subspace method reduces KV cache memory by compressing intermediate tokens while preserving critical anchors, compatible with FlashAttention.

Apr 20, 2026 Read →