Question 1

What is Vexilon?

Accepted Answer

Vexilon is the production hybrid retrieval layer bundled with Meaning Memory Studio. It combines dense vector search (BGE-M3 embeddings on a Qdrant vector store), BM25 sparse search, and Reciprocal Rank Fusion to unify results across both retrieval modes. A cross-encoder reranker (BGE-reranker-v2-m3) reorders the top candidates for semantic precision. Contextual chunking uses an LLM to generate context preambles before embedding, so chunks carry the document scope they came from. A lightweight knowledge graph tracks entities and relationships across the corpus. Three-signal boost profiles (recency, significance, procedural authority) let operators tune ranking per query intent.

Question 2

How does Vexilon compare to LlamaIndex, Pinecone, Vespa, or pgvector?

Accepted Answer

Most RAG stacks ship one or two of the pieces and leave you to wire the rest. LlamaIndex is a framework, you choose vector stores, rerankers, and chunking strategies. Pinecone is a managed vector database, no BM25, no reranker bundled, no contextual chunking. Vespa is a production search engine, capable but heavy, sparse-and-dense in one but no contextual preamble layer or knowledge graph. pgvector is a Postgres extension, you build everything else yourself. Vexilon bundles all of it (hybrid retrieval + RRF + reranker + contextual chunking + knowledge graph + Qdrant + embeddings) as a single MCP-native install, then integrates natively with Meaning Memory's STARE-scored cognitive memory.

Question 3

Why is Vexilon part of Studio and not bundled with Engine?

Accepted Answer

Vexilon carries genuine deployment weight, a Qdrant vector store, an embedding service running on GPU for BGE-M3 inference, and a cross-encoder reranker container. Bundling it with Engine would force every Engine customer to operate that stack whether they need corpus retrieval or not. Engine customers who already run their own RAG (LlamaIndex, LangGraph, Vespa, pgvector, or custom) wire retrieved context in via the BYO-RAG adapter; Studio customers get Vexilon bundled and ready.

Capability	Vexilon	LlamaIndex	Pinecone	Vespa	pgvector
Dense vector search	✓	via plugin	✓	✓	✓
BM25 sparse search	✓	via plugin	—	✓	—
RRF fusion (bundled)	✓	DIY	—	✓	—
Cross-encoder reranker (bundled)	✓	via integration	—	DIY	—
Contextual chunking	✓	—	—	—	—
Knowledge graph layer	✓	via integration	—	via integration	—
MCP-native interface	✓	—	—	—	—
Self-host (no SaaS lock-in)	✓	✓	SaaS only	✓	✓
Integrated with cognitive memory	✓	—	—	—	—

Hybrid retrieval, in the box.

Hybrid retrieval, fully assembled.

Hybrid retrieval

Cross-encoder reranker

Contextual chunking

Knowledge graph

Three-signal boost profiles

MCP-native

Why bundled beats assembled.

Ready to bundle retrieval with memory.

Common questions.

What is Vexilon?

How does Vexilon compare to LlamaIndex, Pinecone, Vespa, or pgvector?

Why is Vexilon part of Studio and not bundled with Engine?