AI memory framework MeMo skips LLM retraining

https://images.ctfassets.net/jdtwqhzvc2n1/uNG5np6loL4mLiU9LKH0s/7525aad6eda1c42caffcb84af89bce26/LLM_memory_module.jpg?w=800&q=75

Enabling LLMs to acquire new knowledge after training remains a major hurdle for enterprise AI — current solutions are either too expensive, too slow, or constrained by context window limits.

MeMo, a framework from researchers at multiple universities, encodes new knowledge into a dedicated smaller memory model that operates separately from the main LLM.

The modular architecture works with both open- and closed-source models and sidesteps the complexity of RAG pipelines and full model retraining.

Experiments show that MeMo handles complex queries reliably even when retrieval pipelines are noisy. It avoids the catastrophic forgetting associated with direct fine-tuning and provides a cost-effective pathway for continuous knowledge updates.

The challenge of updating LLM memory

Large language models are frozen after training and their internal knowledge remains static until they undergo subsequent, computationally massive updates.

Currently, developers rely on three main approaches to integrate external knowledge into an LLM, each with...

Copyright of this story solely belongs to venturebeat.com. To see the full text click HERE

Read more

https://media.wired.com/photos/6a19d3576c603cc05220330d/191:100/w_1280,c_limit/Gear_HandsOnWithGeminiSparkGoogle%E2%80%99sAIAgentThatLivesinYourPhone_v1.jpg

Hands-on with Gemini Spark beta rolling out to AI Ultra subs: planned a birthday party from emails and calendar, but called a live-in boyfriend a “close friend”

Sponsor Posts Niantic Spatial: Drone Imagery to Physical AI — Niantic Spatial and Spexi Geospatial partner to turn drone imagery into city-scale 3D intelligence for physical AI — on demand, geometrically accurate, and ready for simulation and training. A smarter way to text customers — This is a guest post by HostMyText.Imagine