Question 1

What is the difference between Meaning Memory Engine and Meaning Memory Studio?

Accepted Answer

Meaning Memory Engine ships the cognitive memory engine alone: STARE 5D scoring, the deterministic five-step compile pipeline, file and Postgres backends, audit-grade provenance, and a Python SDK (TypeScript SDK on roadmap). Teams with their own RAG stack (LlamaIndex, LangGraph, Vespa, pgvector, or custom) wire retrieved context in via the BYO-RAG adapter. Meaning Memory Studio bundles Engine plus Vexilon, our production hybrid retrieval layer: dense and BM25 search with RRF fusion, cross-encoder reranking, contextual chunking, a knowledge graph, and a bundled vector store. One install, one MCP surface, memory and corpus retrieval together.

Question 2

Do I need Meaning Memory Studio if I already have a RAG stack?

Accepted Answer

No. Meaning Memory Engine is designed for teams who already run a retrieval-augmented generation stack they like, LlamaIndex, LangGraph, Vespa, pgvector, or a custom implementation. The Engine ships with a BYO-RAG adapter so retrieved context flows cleanly into the consolidation pipeline. You keep your retrieval layer; we add the memory layer. Meaning Memory Studio is the convenience tier for teams who would rather not wire their own retrieval.

Question 3

What is Vexilon?

Accepted Answer

Vexilon is the production hybrid retrieval layer bundled in Meaning Memory Studio. It combines dense vector search (BGE-M3 embeddings on Qdrant), BM25 sparse search, Reciprocal Rank Fusion, cross-encoder reranking with BGE-reranker-v2-m3, contextual chunking with LLM-generated preambles, a knowledge graph for entities and relationships, and three-signal boost profiles for recency, significance, and procedural authority.

Question 4

Can I start with Engine and upgrade to Studio later?

Accepted Answer

Yes. Both editions share the same cognitive memory engine, schema, and SDKs. Upgrading from Engine to Studio is additive, the Vexilon retrieval stack (vector store, embedding service, reranker, indexer) installs alongside the existing Engine deployment. Memory data, history, and provenance chains are preserved across the upgrade. Contact us for the upgrade path.

Question 5

Where does my data live? Do you ever see it?

Accepted Answer

Both editions are licensed self-host. You run the binary inside your own infrastructure, Docker Compose, Kubernetes via Helm, or bare metal. No data ever leaves your perimeter; we never see your memory data. We are custodians of the engine, not operators of your deployment. Industry analogs: Neo4j Enterprise, Elasticsearch under the Elastic License, Stardog.

Question 6

Is Meaning Memory open source?

Accepted Answer

No. Meaning Memory is a licensed self-host engine product. Both Engine and Studio editions are distributed as license-gated wheel artifacts, SHA-anchored and reproducible-built. The wheel is never published on public PyPI; customers receive it through a license-gated distribution portal. The Python SDK is open source today; a TypeScript SDK is on the roadmap so developers can wire the engine into their stacks cleanly.

Capability	Engine	Studio
Agent coordination
Switchback agent coordination MCP server	✓	✓
Cognitive memory
STARE 5D scoring	✓	✓
Auto-STARE with bundled local embeddings + distilled Significance scorer (CPU; extraction-grade scoring uses your LLM)	✓	✓
Memory Consolidation (5-step pipeline)	✓	✓
Episode summaries (stable)	✓	✓
Narrative clustering (beta, opt-in)	✓	✓
R-graph relationships	✓	✓
Salience scoring & decay	✓	✓
Memory Seeding (bulk import for cold-start onboarding)	✓	✓
Backends & deployment
File backend (.md-native, agent-isolated)	✓	✓
Postgres backend (multi-tenant, SQL audit)	✓	✓
Idempotency primitives (Postgres)	✓	✓
Helm chart & Docker Compose	✓	✓
Licensed self-host (you run it)	✓	✓
Audit & provenance
Provenance chain on every memory	✓	✓
Audit hash-chain (Postgres)	✓	✓
OTEL-native tracing	✓	✓
Developer experience
MCP tools (mm_remember, mm_search, ...)	✓	✓
Python SDK	✓	✓
TypeScript SDK	roadmap	roadmap
Framework adapters	✓	✓
BYO-RAG adapter (LlamaIndex, LangGraph, Vespa)	✓	✓
Vexilon retrieval layer (Studio only)
Hybrid retrieval (dense + BM25, RRF fusion)	—	✓
Cross-encoder reranker (BGE-reranker-v2-m3)	—	✓
Contextual chunking (LLM-generated preambles)	—	✓
Knowledge graph (entities + relationships)	—	✓
Boost profiles (recent / significant / procedural)	—	✓
Filesystem watcher + git post-commit hooks	—	✓
Incremental reindex API	—	✓
Bundled vector store (Qdrant)	—	✓
Bundled embedding service (BGE-M3)	—	✓
Unified MCP tools across memory and corpus	—	✓

Two editions. One engine.

MM Engine

MM Studio

Side by side

Memory and retrieval are different primitives.

Not sure which edition fits?