Qdrant
Qdrant is a high-performance, composable vector search engine built in Rust for production-grade semantic, hybrid, and agentic workloads.
Combine dense vectors, sparse vectors, metadata filters, multi-vector representations, and custom scoring as primitives at query time. Written in Rust for memory efficiency, SIMD optimization, and predictable performance without garbage collection pauses. No wrappers, no bolt-ons, no legacy compromises — just a custom HNSW implementation and storage engine built specifically for vector workloads.
Learn more
Pinecone
The AI Knowledge Platform.
The Pinecone Database, Inference, and Assistant make building high-performance vector search apps easy. Developer-friendly, fully managed, and easily scalable without infrastructure hassles.
Once you have vector embeddings, manage and search through them in Pinecone to power semantic search, recommenders, and other applications that rely on relevant information retrieval.
Ultra-low query latency, even with billions of items. Give users a great experience. Live index updates when you add, edit, or delete data. Your data is ready right away. Combine vector search with metadata filters for more relevant and faster results.
Launch, use, and scale your vector search service with our easy API, without worrying about infrastructure or algorithms. We'll keep it running smoothly and securely.
Learn more
Letta
Create, deploy, and manage your agents at scale with Letta. Build production applications backed by agent microservices with REST APIs. Letta adds memory to your LLM services to give them advanced reasoning capabilities and transparent long-term memory (powered by MemGPT). We believe that programming agents start with programming memory. Built by the researchers behind MemGPT, introduces self-managed memory for LLMs. Expose the entire sequence of tool calls, reasoning, and decisions that explain agent outputs, right from Letta's Agent Development Environment (ADE). Most systems are built on frameworks that stop at prototyping. Letta' is built by systems engineers for production at scale so the agents you create can increase in utility over time. Interrogate the system, debug your agents, and fine-tune their outputs, all without succumbing to black box services built by Closed AI megacorps.
Learn more
CMEM Cloud
CMEM Cloud is the cloud sync layer for claude-mem, built to link AI agent memory everywhere through one private MCP link. claude-mem is the open source engine that takes notes while an agent works, and CMEM Cloud mirrors that local memory so agents can recall it across every session, machine, editor, and MCP-compatible client. Instead of making users re-explain context, paste old notes, or restart from zero, the system captures decisions, bug fixes, dead ends, environment notes, architecture choices, and other structured observations as the agent works. Those observations are stored in a temporal database, searched by meaning through vector recall, and made available through a private MCP endpoint that any compatible agent can read and write through. It starts with installing the local engine, letting a second model write structured notes out of band, syncing the local database to CMEM Cloud, and then recalling that memory anywhere.
Learn more