cachly is a persistent AI memory platform for developers. It gives AI coding assistants like Claude Code, Cursor, GitHub Copilot and Windsurf a brain that remembers every lesson, fix, and architecture decision — forever. It connects via the MCP (Model Context Protocol) standard and includes 126 MCP tools. Free tier available. Runs on German (EU) servers.

How does cachly work?

Run 'npx @cachly-dev/mcp-server@latest autopilot' once. The wizard auto-detects every AI editor you have installed (Claude Code, Cursor, Copilot, Windsurf, Cline, Zed) and writes the correct config for each. It then reads your entire git history with brain_from_git and loads years of team knowledge into your Brain before your first session. From that point, sessions start automatically, memory is shared across all your editors simultaneously, and a git post-commit hook teaches cachly from every commit.

Does cachly auto-detect my editors?

Yes. The cachly setup wizard automatically detects Claude Code, Cursor, GitHub Copilot, Windsurf, Cline, Zed, and Continue.dev — any editor that supports MCP. It writes the correct config file for each editor in one pass. You never manually edit JSON config files.

Is memory shared across all my AI editors?

Yes. cachly uses a single Brain that all your AI editors connect to simultaneously. A lesson remembered in Claude Code is instantly available in Cursor and GitHub Copilot. If your team uses different editors, all of you share the same persistent memory pool.

What is brain_from_git?

brain_from_git is a cachly tool that reads your entire git history before your first session and extracts lessons from every commit, PR, and revert. Your AI arrives knowing years of architectural decisions, bug fixes, and team conventions — without you writing a single line of documentation. Zero onboarding.

What is causal_trace?

causal_trace is a cachly tool that traces the history of any file or bug across your entire git history in seconds — replacing 30+ minutes of manual git blame. Describe a problem in plain English. It returns the root cause, the failure chain, and the exact fix that worked — with date, command, and file path.

What is brain_predict?

brain_predict is a cachly tool that scans your Brain for failure patterns before every deploy, migration, or dependency upgrade. It returns probability-weighted warnings based on your team's actual incident history — so you catch the next incident before it happens.

Does cachly work with Claude Code, Cursor, and GitHub Copilot?

Yes. cachly works with Claude Code, Cursor, GitHub Copilot, Windsurf, Cline, Zed, and Continue.dev — anywhere that supports MCP. Run 'npx @cachly-dev/mcp-server@latest autopilot' to configure all editors in one step. Memory is shared across all editors simultaneously.

Can cachly search memory across languages?

Yes. cachly uses semantic vector embeddings, not keyword search. A lesson stored in German appears when you search in English. A fix documented in Arabic matches a Japanese query about the same bug pattern. Supported languages include English, German, French, Spanish, Italian, Portuguese, Japanese, Chinese (Simplified and Traditional), Korean, Arabic, Hebrew, and more.

How is cachly different from mem0?

mem0 is a memory layer for Python LLM apps and chatbots — great for building AI products. cachly is built specifically for developer tooling: it connects to your AI editor via MCP, learns from your git history automatically, predicts failures before deploy, and gives your whole team shared memory. cachly runs on EU servers and is GDPR-native. For developers using Claude Code, Cursor, or Copilot, cachly is the right choice.

Is cachly GDPR compliant?

Yes. cachly runs exclusively on German servers (Hetzner). All data stays in the EU. No data is shared with third parties. cachly is fully GDPR compliant. An AVV (Auftragsverarbeitungsvertrag / Data Processing Agreement) is available for Business and Enterprise customers.

What topics does the cachly blog cover?

The cachly blog covers persistent AI memory for developers, MCP server setup, brain_from_git (bootstrapping AI from git history), causal_trace (root-cause analysis), brain_predict (pre-deploy failure prediction), team AI memory, and editor-specific guides for Claude Code, Cursor, GitHub Copilot, Windsurf, Cline, and Zed.

How do I give Claude Code persistent memory?

Run 'npx @cachly-dev/mcp-server@latest autopilot' once. The wizard detects Claude Code, writes the correct MCP config, and bootstraps from your git history with brain_from_git. From that point, every Claude Code session starts with a pre-briefing from your Brain. Memory persists across sessions, projects, and machines.

How do I give Cursor persistent memory?

Run 'npx @cachly-dev/mcp-server@latest autopilot'. cachly auto-detects Cursor, writes ~/.cursor/mcp.json, and loads your git history into the Brain. From the next session, Cursor arrives pre-briefed with every lesson, fix, and architecture decision your team has made.

What is brain_predict and how does it prevent production incidents?

brain_predict scans your Brain's Causal Knowledge Graph before a deploy, migration, or dependency upgrade. It surfaces failure patterns from your actual incident history with confidence scores (0–1). If you previously had a database timeout after a schema migration, brain_predict warns you before your next migration. It runs in under 500ms.

What is brain_from_git?

brain_from_git reads your entire git commit log and extracts lessons from every fix, revert, and architectural change. It runs once and populates your Brain with years of team knowledge in seconds. New developers arrive on day one with full codebase context — zero manual documentation required.

Can cachly mask PII before it reaches the server?

Yes. cachly's client-side PIIGuard utility (in @cachly-dev/sdk/pii) masks emails, phone numbers, IBANs, and custom patterns like PINs or account numbers into deterministic [MASKED_xxxxxx] tokens locally, before any request is sent. The original-to-token mapping never leaves your machine. Combined with EU-only Hetzner servers and Team Brain, this lets regulated fintech and healthcare teams use persistent AI memory without sensitive data crossing the wire.

Is cachly free to use?

Yes. cachly has a free tier — no credit card required. The free tier includes persistent AI memory, brain_from_git, session_start briefings, smart_recall, and access to the full 126-tool MCP server. Paid plans (Speed at €19/mo, Business at €49/mo) add higher rate limits, team features, SSO, and priority support.

Blog

Engineering and product posts from the cachly team — persistent AI memory, causal_trace, brain_from_git, MCP server setup, and developer infrastructure.

June 13, 2026·4 min read

See Your Real ROI: Cost-per-Call + Weekly Trend, Now in Your IDE

A savings number is only believable if it uses your real price. The Brain Health panel in cachly's VS Code and IntelliJ plugins now reads your configured cost-per-call and shows a week-over-week activity trend — so the ROI in your editor matches your actual bill.

ROICost TransparencyWeek-over-WeekIDE PluginsFeature

June 7, 2026·6 min read

Team Brain: Shared AI Memory for Your Entire Engineering Team

Individual AI memory compounds for one developer. Team Brain makes it compound for everyone — every fix, every gotcha, every architectural decision captured once and available to the whole team's AI tools instantly, regardless of which editor they use.

Team BrainShared MemoryCollaborationAI Knowledge BaseTeams

June 7, 2026·5 min read

smart_recall: Brief Your AI Before It Starts

Every session with an AI assistant starts completely blind. cachly's smart_recall loads the most relevant lessons from your Brain before the first file is touched — turning past knowledge into instant pre-task context, with confidence scores that move as your codebase evolves.

smart_recallAI BriefingPersistent MemoryPre-Task ContextFeature

June 7, 2026·6 min read

causal_trace: Your AI Already Knows Why This Broke

Most AI debugging starts from zero every time. cachly's causal_trace checks your Brain first: if this error — or one like it — was solved before, you get the known fix instantly instead of re-deriving it. Cause-and-effect recall, confidence you can trust, compounding value.

causal_traceDebuggingRoot CauseAI MemoryFeature

June 7, 2026·5 min read

OpenClaw + cachly: One Brain Across Every AI Channel

The new @cachly-dev/openclaw Brain Bridge lets your OpenClaw agents share the same compounding lesson store as Claude Code, Cursor, and the IDE plugins. One Brain. 22 channels. Cross-tool memory that actually compounds.

OpenClawBrain BridgeMulti-ChannelCross-Tool MemoryRelease

June 6, 2026·7 min read

Privacy-First AI Memory for Fintech & Healthcare Teams

Regulated teams want persistent AI memory without leaking PII. Here's how cachly combines client-side PIIGuard masking, EU-only (Hetzner) servers, and Team Brain so your AI gets smarter without sensitive data ever crossing the wire.

PrivacyGDPRTeam BrainUse Case

June 6, 2026·6 min read

The Compounding Brain: Why AI Memory Gets More Valuable Every Day

Most dev tools are worth the same on day 1 and day 500. AI memory is different — it compounds. The story behind cachly and why a Brain that remembers every fix, revert, and decision becomes your team's most valuable asset over time.

StoryVisionAI MemoryCompounding

June 6, 2026·6 min read

PIIGuard: Mask PII Before It Ever Reaches the Server

Did you know cachly can stop sensitive data — emails, PINs, IBANs, API keys — from ever leaving your machine? Meet PIIGuard (DataShield): local masking into deterministic tokens, full control over what gets sent, simple example included.

PrivacyPIIGuardDataShieldGDPRSDK

May 31, 2026·6 min read

Same Brain, Any Model: cachly Works with Claude, Cursor, Copilot, Windsurf and More

Your AI memory should not be locked to one vendor. cachly is the persistent, model-neutral memory layer — same Brain in Claude Code, Cursor, Windsurf, Copilot, Cline, Zed, and Continue. Bring your own model, keep your brain.

Model NeutralPortabilityMCPAI Memory

May 27, 2026·7 min read

cachly is now on awesome-mcp-servers — 20,000 downloads and what comes next

PR #5174 merged. cachly is listed on punkpeye/awesome-mcp-servers, the most widely followed MCP discovery list. Nearly 20,000 npm downloads, Team Brain, CI-driven learning, gRPC for AI agents.

Milestoneawesome-mcp-serversTeam BrainCI Learning

May 23, 2026·6 min read

#1 on npm. One real user. Both true at the same time.

We're the most-installed MCP memory server on npm — and we have exactly one real user. Why neither number is failure, and the one promise we're making to every human. A manifesto.

ManifestoBuild in PublicVision2026

May 23, 2026·7 min read

We're the #1 MCP memory server on npm — 472 weekly installs in 2026

cachly's MCP server is now the most-installed memory server on npm: 472 weekly installs vs 17–27 for the next four competitors combined. Here's what worked, what didn't, and the lesson from our first real signups.

MilestoneGrowthMCP2026

May 9, 2026·6 min read

12,000 downloads in 23 days — MCP distribution & the memory pivot

How @cachly-dev/mcp-server reached 11,953 downloads in its first 23 days on npm — purely organic. What drove growth: the pivot from semantic cache to AI brain memory.

GrowthMCPnpmDistribution

May 14, 2026·5 min read

Zed Persistent Memory — Never Explain Your Stack Again

Zed forgets everything when a session ends. One command gives it permanent memory — auto-detects your setup, bootstraps from git history, shares across your team.

ZedTutorialMCPAI Memory

May 14, 2026·5 min read

Windsurf Persistent Memory — Never Explain Your Stack Again

Windsurf forgets everything when a session ends. One command gives it permanent memory — auto-detects your setup, bootstraps from git history, shares across your team.

WindsurfTutorialMCPAI Memory

May 14, 2026·5 min read

How to Give Cline Persistent Memory with MCP

Cline forgets everything between sessions. Add cachly's MCP server in 30 seconds to give Cline permanent memory — auto-learns from git history, shares across editors.

ClineVS CodeMCPAI Memory

May 14, 2026·6 min read

Claude Code Persistent Memory — Never Explain Your Stack Again

Claude Code forgets everything when a session ends. One command gives it permanent memory — auto-detects your setup, bootstraps from git history, shares across your team.

Claude CodeTutorialMCPAI Memory

May 14, 2026·6 min read

cachly vs CLAUDE.md — When a Static File Isn't Enough

CLAUDE.md is a good starting point. cachly is what you graduate to — automatic git learning, failure prediction, cross-editor sharing, and zero manual maintenance.

CLAUDE.mdComparisonAI MemoryClaude Code

May 13, 2026·7 min read

cachly vs mem0 — Which AI Memory is Right for Developers?

An honest comparison of cachly and mem0 for developer tooling. cachly wins on editor integration, git learning, causal analysis, and EU compliance. mem0 wins for Python LLM app builders.

Comparisonmem0AI MemoryMCP

May 13, 2026·5 min read

How to Give GitHub Copilot Persistent Memory

GitHub Copilot forgets everything when you close VS Code. One command and 30 seconds fixes that permanently — auto-detects your editors, learns from your git history, shares across your team.

GitHub CopilotTutorialMCPAI Memory

May 5, 2026·8 min read

Introducing the Cognitive Cache: The Cache That Thinks

cachly v0.6 ships five capabilities no cache has ever had: causal root-cause tracing, knowledge decay scoring, brain diff, memory consolidation, and zero-config autopilot. This is not managed Redis. This is the world's first cache that thinks.

Cognitive Cachev0.6causal_traceautopilot

May 4, 2026·7 min read

brain_from_git: Turn Your Git History into an AI Knowledge Base

Your commit log contains years of hard-won knowledge — bugs fixed, patterns discovered, gotchas avoided. brain_from_git reads it in 30 seconds and loads every lesson into your AI brain automatically.

Git HistoryAI Memorybrain_from_git

May 4, 2026·6 min read

Zero AI Onboarding: Full Codebase Context on Day One

New developers used to spend weeks learning what not to do. With brain_from_git and invite_link, your AI arrives pre-loaded with 2 years of team knowledge — before the first session ends.

OnboardingTeam Brainbrain_from_git

May 1, 2026·5 min read

Your AI Brain Now Speaks Arabic and Hebrew

cachly v0.5.48 adds native Arabic and Hebrew support: RTL tokenization, Arabic light stemming, 100+ stopwords, and bidirectional cross-language retrieval with the full synonym graph.

ArabicHebrewRTLi18n

April 21, 2026·6 min read

Memory Crystals: Distilling Team Knowledge into Instant AI Context

A Brain with 200 lessons is powerful but noisy. Memory Crystals distill everything your team has learned into a dense, always-fresh snapshot — injected into every AI session automatically.

Memory CrystalsTeam BrainAI Memory

April 21, 2026·6 min read

Ambient Git Learning: Your Commit History as AI Knowledge

Your git log contains some of the richest operational knowledge in your codebase — and your AI has never read it. We built Ambient Git Learning to change that, with zero extra steps.

Ambient LearningGitAI Memory

April 21, 2026·7 min read

Team Telepathy: a Shared AI Brain for Engineering Teams

One developer fixes a hard bug. Five minutes later, every AI assistant on your team knows about it. No Slack message. No wiki update. No standup. That's Team Telepathy.

Team BrainAI MemoryMCP

April 22, 2026·6 min read

Store in Japanese, Recall in English — Cross-Language AI Memory

cachly v0.5.37 ships cross-language retrieval: store lessons in Japanese, Korean, Arabic, or Hebrew and recall them in English — no embeddings required. Built on a 130+ term synonym map.

Multilinguali18nCross-LanguageAI Memory

April 20, 2026·6 min read

Cachly SDK Integrations: 3 lines to semantic caching

Copy-paste examples for LangChain, Vercel AI SDK, OpenAI direct, Go, Ruby, PHP, Rust, and more. Every stack, same result: 60–90% fewer LLM API calls.

IntegrationsLangChainTypeScriptGo

April 20, 2026·5 min read

Free, Private Embeddings for Your AI Dev Brain — Powered by Ollama

We now run nomic-embed-text via Ollama on our infrastructure. No OpenAI API key needed. Your code never leaves Germany. Zero-cost semantic search for every Cachly Brain user.

OllamaEmbeddingsPrivacyInfrastructure

April 20, 2026·5 min read

Self-Host a Semantic LLM Cache in 5 Minutes

Run Cachly on your own server with one docker compose command. No Kubernetes, no cloud dependency, no data leaves your infra. Air-gapped, GDPR-ready, enterprise-grade.

Self-HostingDockerEnterprise

April 18, 2026·7 min read

How we cut LLM costs by 80% with Semantic Cache

Every user rephrases the same question differently. Without semantic caching you pay for each rephrasing. Here's how pgvector similarity search eliminates 60–90% of LLM API calls — with real numbers and 3 lines of code.

Semantic CacheCost OptimizationAI

April 18, 2026·8 min read

How I Built a VS Code Extension That Shows What My AI Learned

From `yo code` to a live status bar widget showing brain health, lesson count, and token savings — the full walkthrough including every gotcha. TypeScript, zero extra dependencies.

VS CodeTutorialDeveloper Tools

April 18, 2026·10 min read

Building an IntelliJ Plugin in Kotlin: Status Bar + API

From build.gradle.kts to a live widget in IntelliJ IDEA, WebStorm, and all JetBrains IDEs — StatusBarWidgetFactory, PersistentStateComponent, Swing DialogWrapper, and the Gradle gotchas the docs don't mention.

IntelliJKotlinTutorial

April 18, 2026·3 min read

See your AI Brain in VS Code and IntelliJ

New IDE plugins show brain health, lesson count, and recall stats directly in your status bar. VS Code and IntelliJ — zero config.

IDE PluginsVS CodeIntelliJ

April 18, 2026·5 min read

Your AI assistant never forgets — no embeddings required

We removed the #1 barrier to AI memory: the mandatory API key. Before: your assistant forgot everything. After: it remembers in 3ms. Zero config, works offline.

AI MemoryProductZero Config

April 16, 2026·8 min read

We built persistent memory for Claude Code

How we gave AI coding assistants a brain that survives across sessions — session briefings, lesson recall, team knowledge, and semantic search.

AI MemoryMCPClaude Code

April 15, 2026·4 min read

Search Your Japanese AI Brain in Romaji

cachly v0.5.37 adds Hepburn romaji search for katakana. Type 'kontena' and find コンテナ lessons instantly — digraphs, geminates, and long vowels all handled.

JapaneseRomajiSearchi18n

April 10, 2026·4 min read

Your AI Never Forgets: Introducing Cachly Brain IDE Plugins

VS Code and IntelliJ plugins that show what your AI coding assistant has learned — lesson count, recall stats, and estimated token savings. One status bar widget, zero config.

IDE PluginsVS CodeIntelliJProduct