Zep and Supermemory are both cloud-focused memory solutions, but they take different architectural approaches. Zep uses a temporal knowledge graph (Graphiti) with time as a first-class dimension, while Supermemory combines hybrid RAG with structured fact extraction and multi-modal support.
This comparison covers their architectures, benchmark performance, pricing, and ideal use cases to help you decide.
Quick Comparison
| Factor | Zep | Supermemory |
|---|
| Architecture | Temporal knowledge graph (Graphiti) | Hybrid RAG with fact extraction |
| LongMemEval* | 71.2% | 85.2% |
| Deployment | Cloud-first; self-host requires Graphiti + graph DB | Cloud API only |
| Pricing | Free / $25 / $475 / Enterprise | Not publicly disclosed |
| GitHub Stars | 4.4K (Zep) + 24.8K (Graphiti) | 21.7K |
| Funding | Not disclosed | $2.6M seed (Google/Cloudflare execs) |
What is Zep?
Zep uses Graphiti, a temporal knowledge graph where time is a first-class dimension. Every fact has valid_from, valid_to, and invalid_at markers, allowing queries like "what was true in January?" or "when did this change?"
Zep positions itself around "context engineering" rather than just memory. Graphiti, the underlying engine, has 24.8K stars and supports multiple graph backends (Neo4j, FalkorDB, Kuzu, Neptune).
Key strengths:
- Best-in-class temporal reasoning
- Multi-hop graph queries
- <200ms retrieval latency
- Graphiti is open source (24.8K stars)
- Strong enterprise features (SOC2, HIPAA)
What is Supermemory?
Supermemory uses a hybrid RAG approach that combines vector search with LLM-based fact extraction. Founded by a 19-year-old and backed by $2.6M from Google and Cloudflare executives, it focuses on fast retrieval (~50ms) and multi-modal support.
Beyond basic memory, Supermemory includes connectors for Google Drive, Gmail, Notion, OneDrive, and GitHub—positioning it as a broader knowledge integration layer.
Key strengths:
- Strong benchmark (85.2% LongMemEval)
- Multi-modal support (PDFs, images via OCR, video via transcription)
- Built-in connectors (Google Drive, Notion, GitHub)
- Fast retrieval (~50ms)
- Active development
Architecture Comparison
Zep's Approach
Zep's Graphiti engine stores facts as nodes in a knowledge graph with explicit temporal metadata. Each edge carries validity windows tracking when facts became true and when they were superseded.
This temporal awareness is native to the architecture—not a filter applied after retrieval. The tradeoff is infrastructure complexity: self-hosting requires running Graphiti plus a graph database.
Supermemory's Approach
Supermemory combines vector search with LLM-based fact extraction. The memory engine extracts facts, handles temporal changes and contradictions, and builds user profiles with ~50ms retrieval. It also processes multi-modal content: PDFs, images (OCR), videos (transcription), and code.
The hybrid approach blends RAG-style retrieval with structured memory queries, aiming for both speed and accuracy.
The Key Difference
Zep structures memory around temporal relationships; Supermemory structures memory around multi-modal integration.
When a user changes jobs, Zep's temporal edges explicitly mark the old job as superseded with invalid_at timestamps. The graph knows exactly when the change happened and can query historical state.
Supermemory's fact extraction handles the same update through contradiction resolution—detecting that the new employment fact conflicts with the old one. This is effective but less architecturally explicit than Zep's temporal model.
The tradeoff: Zep is more precise for time-based queries, while Supermemory handles a wider range of input types and retrieves faster.
| Benchmark | Zep | Supermemory |
|---|
| LongMemEval* | 71.2% | 85.2% |
Supermemory outperforms Zep by 14 percentage points on LongMemEval. Zep's 71.2% is self-reported; independent evaluation shows 63.8%, which would widen the gap to over 21 points.
Both score significantly below Hypabase (87.4%), which uses AMR-based extraction for higher retrieval accuracy.
Pricing Comparison
Zep
| Tier | Price | Limits |
|---|
| Free | $0 | 1K episodes/month |
| Flex | $25/month | 20K credits, 600 req/min |
| Flex Plus | $475/month | 300K credits, 1K req/min, webhooks |
| Enterprise | Custom | SOC2, HIPAA, dedicated support |
Supermemory
Supermemory's pricing is not publicly disclosed. Contact their team for details.
Zep's transparent pricing is an advantage for budgeting. Supermemory's opaque pricing makes cost comparison difficult, though its cloud-only model means you avoid infrastructure costs for graph databases that Zep self-hosting requires.
When to Choose Zep
Choose Zep if you:
- Need temporal queries ("what was true last month?")
- Require enterprise compliance (SOC2, HIPAA)
- Want the option to self-host (via Graphiti)
Zep's temporal graph is useful for knowledge updates, though the architecture hasn't evolved much since launch.
When to Choose Supermemory
Choose Supermemory if you:
- Need multi-modal memory (PDFs, images, video)
- Want built-in connectors to productivity tools (Google Drive, Notion)
- Prioritize retrieval speed (~50ms vs <200ms)
Supermemory scores higher on benchmarks but is cloud-only with undisclosed pricing, making it harder to evaluate total cost of ownership.
Consider Hypabase
Zep and Supermemory both rely on LLM-prompted extraction—whether through graph triples or hybrid RAG—which means the quality of stored facts depends entirely on how well the LLM prompt generalizes. When facts involve multiple attributes about a person (preferences, location, communication style), ad-hoc extraction routinely scatters them across disconnected records. Hypabase solves this with formal linguistic extraction.
| Factor | Zep | Supermemory | Hypabase |
|---|
| Extraction | LLM-based into triples | LLM-based hybrid RAG | AMR (formal linguistic framework) |
| Representation | Temporal triples | Vector + extracted facts | N-ary hyperedges |
| LongMemEval* | 71.2% | 85.2% | 87.4% |
| Personalization | — | — | 100% |
Hypabase uses Abstract Meaning Representation (AMR)—a formal framework from computational linguistics—to map sentences into structured graphs. The output uses PENMAN notation with karaka semantic roles, capturing every relationship in a single atomic fact:
"Bob prefers Slack over email and works remotely from Denver"
Ad-hoc extraction (Zep, Supermemory):
(Bob, prefers, Slack)
(Bob, dislikes, email)
(Bob, works, remotely)
(Bob, location, Denver)
— Four separate facts; "Slack *over* email" preference ranking lost
AMR extraction (Hypabase):
(prefers :subject Bob :object Slack :origin email :attribute communication)
(works :subject Bob :attribute remote :locus Denver)
Hypabase preserves the preference ranking (Slack over email) and the compound attribute (remote work from Denver) as structured hyperedges. Ad-hoc extraction flattens "prefers Slack over email" into two disconnected facts, losing the comparison entirely.
Why This Matters for User Profiles
| Benefit | How AMR + Hyperedges Deliver It |
|---|
| Preference rankings preserved | :object and :origin roles capture "X over Y" comparisons atomically |
| Multi-attribute facts | Location + work style stored together, not as separate unlinked records |
| Precise retrieval | Query "How does Bob prefer to communicate?" hits :subject Bob + :attribute communication |
| Consistent extraction | 6 karaka roles cover all semantic relationships—no ad-hoc relation types |
This is why Hypabase achieves 100% on personalization tasks—user preferences with nuanced rankings and compound attributes are precisely retrievable, even when queries use different phrasing.
Learn more about Hypabase →
FAQ
Is Zep better than Supermemory?
Supermemory scores higher on LongMemEval (85.2% vs 71.2% self-reported). Zep offers stronger temporal reasoning and enterprise compliance. For higher accuracy with structured extraction, consider Hypabase (87.4%).
Can I migrate from Zep to Supermemory?
There's no direct migration path—they store different data structures. Migration requires re-ingesting your conversation history through the new system. If you're evaluating both, consider running a small pilot before committing.
What's the main difference?
Zep optimizes for temporal accuracy with graph-based reasoning. Supermemory optimizes for multi-modal integration and retrieval speed. Hypabase optimizes for extraction quality using AMR and structured hyperedge representation.
Which is better for self-hosting?
Neither is ideal. Zep requires running Graphiti plus a graph database (Neo4j/FalkorDB/Kuzu). Supermemory is cloud-only with no self-hosting option. Hypabase runs entirely in a single SQLite file with no external database required—the simplest self-hosting option.
Conclusion
Zep adds temporal reasoning but self-reports 71.2% on LongMemEval (independent evaluation shows 63.8%). Useful for knowledge updates, though requires graph database infrastructure.
Supermemory scores 85.2% with multi-modal support and fast retrieval, but is cloud-only with undisclosed pricing—harder to evaluate for budget-conscious teams.
Hypabase achieves 87.4% through AMR-based extraction into hyperedges—structured knowledge representation that preserves relationships ad-hoc extraction fragments. 100% on personalization tasks.
All three are straightforward to integrate:
Try Hypabase →
LongMemEval scores: Zep (71.2%) self-reported; independent evaluation shows 63.8% (arxiv:2512.13564). Supermemory (85.2%) from published benchmarks. Hypabase (87.4%) from published benchmark harness.