LLM agent memory at 0.12% of model parameters

https://images.ctfassets.net/jdtwqhzvc2n1/18dBj7HcLZwtY4fwX3tJuY/5d3ca8dcb2012b957d9305ef38f8497f/lightweight_llm_memory_adapter.jpg?w=800&q=75

AI agents forget. Every time a coding assistant loses track of a debugging thread, or a data analysis agent re-ingests the same context it already processed, the team pays in latency, token costs, and brittle workflows. The fix most teams reach for — expanding the context window or adding more RAG — is increasingly expensive and still doesn't reliably work.

To address this, researchers from Mind Lab and several universities proposed delta-mem, an efficient technique that compresses the model’s historical information into a dynamically updated matrix without changing the model itself. The resulting module adds just 0.12% of the backbone model's parameters — compared to 76.40% for one leading alternative — while outperforming it on memory-heavy benchmarks. Delta-mem allows models to continuously accumulate and reuse historical data, reducing the reliance on massive context windows or complex external retrieval modules for behavioral continuity.

The long memory challenge

The conventional solution...

Copyright of this story solely belongs to venturebeat.com. To see the full text click HERE

Read more

https://static01.nyt.com/images/2026/06/27/multimedia/00biz-chip-package-Iyer-kfhp/00biz-chip-package-Iyer-kfhp-facebookJumbo.jpg

A look at advanced chip packaging, now more reliant on TSMC and its partners in Taiwan than ever, and the efforts to address this bottleneck in the US

Sponsor Posts Fast, affordable law for startups — Soxton automates startup legal so founders can move faster and sleep better. We handle incorporation, advisor, employment and commercial contracts. Join the waitlist for early access! Stop vibe coding analytics — Equals AI turns questions about your business into auditable spreadsheet models and dashboards.