MemGPT (now Letta) pioneered hierarchical memory tiers — main context + archival storage, with the LLM acting as its own OS-style memory manager. REM Labs picks up where that research stopped: nine consolidation strategies, production infrastructure, federation, reactivity. Honest side-by-side.
MemGPT is a Berkeley research project that put "LLM as operating system" on the map. Without this paper, half the current memory ecosystem doesn't exist.
MemGPT gives you two memory tiers and a buffer. REM gives you nine consolidation strategies that run overnight, federation across agents, and webhook reactivity — all as a managed service or self-hosted, your choice.
MemGPT's hierarchical tiers are brilliant in theory and most teams re-implement them when they move to production. REM ships all nine strategies as a managed service (or a Docker container), plus webhooks, MCP, A2A, GDPR endpoints, and multi-agent namespaces. Infrastructure primitives, not just reference code.
MemGPT is primarily a self-host OSS project. We compared against the current release (Letta fork). Numbers on retrieval are from their own evaluation harness where available.
| Dimension | REM Labs | MemGPT / Letta |
|---|---|---|
| Category | Continuity layer for intelligence | Hierarchical agent memory framework |
| LongMemEval (500q) | 94.6% · byte-exact upstream GPT-4o judge | Not publicly reported on LongMemEval; prior self-evals on LIME / DMR |
| Consolidation strategies | 9 (Dream Engine) | 2 tiers + recall buffer (main / archival, self-managed) |
| Model-agnostic | Yes — every vendor + local | Yes — OpenAI, Anthropic, local (Ollama, vLLM) |
| Self-hostable | Yes — single Docker command | Yes — pip install, native strength |
| Open source | Partial — SDK + self-host OSS, Dream Engine hosted | Yes — BSD (full) |
| GDPR / forget API | Yes — per-memory + per-namespace + audit | Partial — delete from archival; no audit log |
| Federation across agents | Yes — shared namespaces + A2A agent card | Partial — shared blocks across agents (Letta) |
| Webhooks / reactivity | Yes — memory.created, dream.completed, contradiction.detected | No native; sleep-time agents can poll |
| MCP / A2A protocol | Yes — /.well-known/mcp.json | Partial — Letta adding MCP; A2A in roadmap |
| Multi-agent / hive | Yes — DreamHive, coordinated consolidation | Partial — multi-agent in Letta, no coordinated consolidation |
| Managed cloud | Yes — free tier, $19 Pro | Partial — Letta Cloud (closed beta) |
| Pricing start | Free (unlimited memories, 500 dreams/mo) | Free (self-host) · Letta Cloud: $0 free, paid TBD |
| Funding / project status | Solo founder, 2026 | Berkeley research → Letta (seed round, team of ~10) |
LONGMEMEVAL METHODOLOGY · /benchmarks
The two dimensions MemGPT markets hardest — and REM's actual numbers on each.
REM's core is Apache 2.0 — SDKs, extractors, self-host edition with the full Dream Engine, 9 strategies, 8 retrieval modes. Self-host in one command, ~90s, unlimited everything. On-prem or air-gapped, no closed dependency.
REM exposes typed primitives: memory namespaces, RBAC, pub/sub channels, 9 consolidation strategies, 8 retrieval modes, tournament refinement, Lamarckian inheritance, webhook events. Every layer inspectable; every layer pluggable.
You can run a MemGPT agent with REM as its consolidation layer. MemGPT handles the core/archival tiering; REM runs the nine Dream strategies on its archival corpus overnight. Both published, both documented. Email us if you want the integration recipe.
No credit card. Python & Node SDKs. Self-host in 90 seconds or try the hosted free tier.