cachly is a persistent AI memory platform for developers. It gives AI coding assistants like Claude Code, Cursor, GitHub Copilot and Windsurf a brain that remembers every lesson, fix, and architecture decision — forever. It connects via the MCP (Model Context Protocol) standard and includes 126 MCP tools. Free tier available. Runs on German (EU) servers.

How does cachly work?

Run 'npx @cachly-dev/mcp-server@latest autopilot' once. The wizard auto-detects every AI editor you have installed (Claude Code, Cursor, Copilot, Windsurf, Cline, Zed) and writes the correct config for each. It then reads your entire git history with brain_from_git and loads years of team knowledge into your Brain before your first session. From that point, sessions start automatically, memory is shared across all your editors simultaneously, and a git post-commit hook teaches cachly from every commit.

Does cachly auto-detect my editors?

Yes. The cachly setup wizard automatically detects Claude Code, Cursor, GitHub Copilot, Windsurf, Cline, Zed, and Continue.dev — any editor that supports MCP. It writes the correct config file for each editor in one pass. You never manually edit JSON config files.

Is memory shared across all my AI editors?

Yes. cachly uses a single Brain that all your AI editors connect to simultaneously. A lesson remembered in Claude Code is instantly available in Cursor and GitHub Copilot. If your team uses different editors, all of you share the same persistent memory pool.

What is brain_from_git?

brain_from_git is a cachly tool that reads your entire git history before your first session and extracts lessons from every commit, PR, and revert. Your AI arrives knowing years of architectural decisions, bug fixes, and team conventions — without you writing a single line of documentation. Zero onboarding.

What is causal_trace?

causal_trace is a cachly tool that traces the history of any file or bug across your entire git history in seconds — replacing 30+ minutes of manual git blame. Describe a problem in plain English. It returns the root cause, the failure chain, and the exact fix that worked — with date, command, and file path.

What is brain_predict?

brain_predict is a cachly tool that scans your Brain for failure patterns before every deploy, migration, or dependency upgrade. It returns probability-weighted warnings based on your team's actual incident history — so you catch the next incident before it happens.

Does cachly work with Claude Code, Cursor, and GitHub Copilot?

Yes. cachly works with Claude Code, Cursor, GitHub Copilot, Windsurf, Cline, Zed, and Continue.dev — anywhere that supports MCP. Run 'npx @cachly-dev/mcp-server@latest autopilot' to configure all editors in one step. Memory is shared across all editors simultaneously.

Can cachly search memory across languages?

Yes. cachly uses semantic vector embeddings, not keyword search. A lesson stored in German appears when you search in English. A fix documented in Arabic matches a Japanese query about the same bug pattern. Supported languages include English, German, French, Spanish, Italian, Portuguese, Japanese, Chinese (Simplified and Traditional), Korean, Arabic, Hebrew, and more.

How is cachly different from mem0?

mem0 is a memory layer for Python LLM apps and chatbots — great for building AI products. cachly is built specifically for developer tooling: it connects to your AI editor via MCP, learns from your git history automatically, predicts failures before deploy, and gives your whole team shared memory. cachly runs on EU servers and is GDPR-native. For developers using Claude Code, Cursor, or Copilot, cachly is the right choice.

Is cachly GDPR compliant?

Yes. cachly runs exclusively on German servers (Hetzner). All data stays in the EU. No data is shared with third parties. cachly is fully GDPR compliant. An AVV (Auftragsverarbeitungsvertrag / Data Processing Agreement) is available for Business and Enterprise customers.

smart_recall: Brief Your AI Before It Starts

The blank-slate problem

Every session with an AI coding assistant starts fresh. There's no carry-over from yesterday's migration, no memory of the config quirk you documented last week, no recollection that the same package version mismatchburned two hours on a different branch. The AI is capable — but it's always starting from zero.

The cost is invisible because it compounds slowly. A two-minute re-orientation here, a five-minute re-derivation there. Multiply by every task, every developer, every session — and you're paying a significant tax on knowledge your team already paid to acquire.

What smart_recall does

smart_recall is cachly's pre-task lookup. Before your AI reads a single file, it queries the Brain with the current task description and pulls back the most relevant lessons — ranked by confidence and recency.

fires before the task begins — not after

smart_recall(
  query: "add Redis caching layer to the auth service"
)

// → 3 lessons retrieved (confidence ≥ 0.72):
//
// [0.91] PATTERN: auth-service uses custom connection pooling;
//        standard ioredis defaults cause pool exhaustion under load.
//        FIX: set maxRetriesPerRequest: null, enableOfflineQueue: false.
//        Confirmed 3×.
//
// [0.84] GOTCHA: AUTH_REDIS_URL env var is set per-deployment;
//        local dev uses REDIS_URL. Always check both in config.
//        Last seen: 2026-05-28.
//
// [0.72] DEPENDENCY: Redis caching here interacts with session store.
//        Test session invalidation after adding cache layer.

Those three lessons take milliseconds to retrieve and seconds to apply. Without the Brain, your AI would need to discover the pool exhaustion issue through trial and error — or you'd discover it in production.

Semantic, not keyword

The recall is semantic, not text search. Lessons tagged "redis connection pooling" surface when your query is "add caching to auth service" — because the Brain understands that those are related, even without the exact words overlapping. The relevance is scored by weighted term overlap across lesson content, tags, and historical accuracy signals.

This matters because developers don't file tickets in the exact format lessons were saved. Your team writes naturally; the Brain matches intelligently.

The confidence ladder

Every lesson returned by smart_recall carries a confidence score. High-confidence lessons (0.85+) are ones the team has confirmed multiple times — they go directly into the AI's working context as facts. Medium-confidence lessons (0.60–0.84) are surfaced as things to watch for. Low-confidence (below the threshold) are filtered out to avoid noise.

Confidence moves. A lesson that worked four times in a row sits near 0.91. One that got invalidated by a framework upgrade erodes toward 0.05 as the Brain sees contradicting outcomes. The briefing you get tomorrow reflects what your team learned today.

Works across every tool

smart_recall is a Brain MCP tool — which means any AI tool connected to cachly gets it. Claude Code fires it automatically at session start. The OpenClaw Brain Bridge exposes it as middleware. The VS Code extension surfaces the briefing in the sidebar. The same Brain, the same lessons, regardless of which tool the developer is using that day.

What you stop paying for

The compounding value isn't dramatic in any single session — it's three minutes saved here, a caught mistake there. But across a team of five developers over six months, those minutes add up to days. And the bugs your AI doesn't introduce because it was pre-briefed with known gotchas — those are the most valuable saves of all.

That's what cachly is built around: compounding knowledge that your team already paid to learn, surfaced automatically so you never pay that tuition twice.

smart_recall:
brief your AI before it starts