cachly is a persistent AI memory platform for developers. It gives AI coding assistants like Claude Code, Cursor, GitHub Copilot and Windsurf a brain that remembers every lesson, fix, and architecture decision — forever. It connects via the MCP (Model Context Protocol) standard and includes 121 tools. Free tier available. Runs on German (EU) servers.

How does cachly work?

Run 'npx @cachly-dev/mcp-server@latest autopilot' once. The wizard auto-detects every AI editor you have installed (Claude Code, Cursor, Copilot, Windsurf, Cline, Zed) and writes the correct config for each. It then reads your entire git history with brain_from_git and loads years of team knowledge into your Brain before your first session. From that point, sessions start automatically, memory is shared across all your editors simultaneously, and a git post-commit hook teaches cachly from every commit.

Does cachly auto-detect my editors?

Yes. The cachly setup wizard automatically detects Claude Code, Cursor, GitHub Copilot, Windsurf, Cline, Zed, and Continue.dev — any editor that supports MCP. It writes the correct config file for each editor in one pass. You never manually edit JSON config files.

Is memory shared across all my AI editors?

Yes. cachly uses a single Brain that all your AI editors connect to simultaneously. A lesson remembered in Claude Code is instantly available in Cursor and GitHub Copilot. If your team uses different editors, all of you share the same persistent memory pool.

What is brain_from_git?

brain_from_git is a cachly tool that reads your entire git history before your first session and extracts lessons from every commit, PR, and revert. Your AI arrives knowing years of architectural decisions, bug fixes, and team conventions — without you writing a single line of documentation. Zero onboarding.

What is causal_trace?

causal_trace is a cachly tool that traces the history of any file or bug across your entire git history in seconds — replacing 30+ minutes of manual git blame. Describe a problem in plain English. It returns the root cause, the failure chain, and the exact fix that worked — with date, command, and file path.

What is brain_predict?

brain_predict is a cachly tool that scans your Brain for failure patterns before every deploy, migration, or dependency upgrade. It returns probability-weighted warnings based on your team's actual incident history — so you catch the next incident before it happens.

Does cachly work with Claude Code, Cursor, and GitHub Copilot?

Yes. cachly works with Claude Code, Cursor, GitHub Copilot, Windsurf, Cline, Zed, and Continue.dev — anywhere that supports MCP. Run 'npx @cachly-dev/mcp-server@latest autopilot' to configure all editors in one step. Memory is shared across all editors simultaneously.

Can cachly search memory across languages?

Yes. cachly uses semantic vector embeddings, not keyword search. A lesson stored in German appears when you search in English. A fix documented in Arabic matches a Japanese query about the same bug pattern. Supported languages include English, German, French, Spanish, Italian, Portuguese, Japanese, Chinese (Simplified and Traditional), Korean, Arabic, Hebrew, and more.

How is cachly different from mem0?

mem0 is a memory layer for Python LLM apps and chatbots — great for building AI products. cachly is built specifically for developer tooling: it connects to your AI editor via MCP, learns from your git history automatically, predicts failures before deploy, and gives your whole team shared memory. cachly runs on EU servers and is GDPR-native. For developers using Claude Code, Cursor, or Copilot, cachly is the right choice.

Is cachly GDPR compliant?

Yes. cachly runs exclusively on German servers (Hetzner). All data stays in the EU. No data is shared with third parties. cachly is fully GDPR compliant. An AVV (Auftragsverarbeitungsvertrag / Data Processing Agreement) is available for Business and Enterprise customers.

Your AI Brain Now Speaks Arabic and Hebrew

Cross-Language Bridge

Arabic and Hebrew are now full members of the cross-language retrieval network — the same network that already connects English, German, French, Japanese, Chinese, and Korean. Store a lesson in any language, recall it in any other. No translation. No configuration. No separate index.

Example — a real-world scenario:

An Arabic-speaking engineer fixes a JWT authentication issue and documents it in Arabic:

cachly learn '{
  "topic": "fix:auth",
  "outcome": "success",
  "whatWorked": "مصادقة JWT تعمل بعد إضافة المفتاح السري في متغيرات البيئة"
}'

Three days later, a teammate searches in English:

smart_recall("JWT authentication secret missing")
→ ✅ Returns the Arabic lesson, ranked correctly

The same works in reverse. An English query for port conflict deployment finds Hebrew lessons. A Japanese query finds Arabic lessons. The synonym graph is fully connected.

How It Works — The Synonym Graph

Every technical term is a node. Edges connect equivalents across languages:

"authentication"
  ↔  مصادقة  (Arabic)
  ↔  אימות   (Hebrew)
  ↔  認証    (Japanese)
  ↔  인증    (Korean)
  ↔  认证    (Chinese)
  ↔  Authentifizierung  (German)

"deploy"
  ↔  نشر          (Arabic)
  ↔  פריסה        (Hebrew)
  ↔  デプロイ     (Japanese)
  ↔  배포          (Korean)
  ↔  部署          (Chinese)
  ↔  bereitstellen (German)

When you query smart_recall("مشكلة النشر") (deployment problem), the Brain: tokenizes → removes Arabic stopwords → stems النشر → نشر → expands to deploy, deployment, 배포, デプロイ, bereitstellen → searches all stored lessons for any of those tokens → returns ranked results regardless of language.

Query	Finds lessons containing
smart_recall("authentication error")	مصادقة, אימות, 認証, autenticación, …
smart_recall("مشكلة النشر")	deploy, deployment, デプロイ, 배포, …
smart_recall("שגיאת אימות")	auth, authentication, مصادقة, 認証, …
smart_recall("تصحيح الأخطاء")	debug, debugging, איתור באגים, デバッグ, …

The RTL Challenge

Most search engines are built for left-to-right text. Arabic and Hebrew run right-to-left — a surface-level difference that hides a deeper challenge: both languages attach grammatical particles directly to words as prefixes, making naive tokenization nearly useless.

Consider Arabic: the word الخطأ(al-khaṭaʾ, "the error") fuses the definite article ال (al-) with the root خطأ (error). A naive tokenizer treats the whole thing as one opaque token. Searching for خطأ would miss الخطأ — and miss وخطأ (and-error), فالخطأ (so-the-error), and every other prefixed form. Hebrew has the same pattern.

What We Built

Unicode-aware RTL tokenization — when the Brain detects Arabic (U+0600–U+06FF) or Hebrew (U+0590–U+05FF) characters, it switches to word-level tokenization with language-specific enhancements.

Arabic light stemming — iterative, up to 3 passes, resolves stacked prefixes:

الخطأ   → خطأ   (ال = definite article stripped)
وخطأ   → خطأ   (و = conjunction stripped)
فالخطأ → خطأ   (ف + ال, two passes needed)
للنشر  → نشر   (ل + ال, two passes: للنشر → النشر → نشر)

Plus 60 Arabic + 40 Hebrew stopwords — particles, pronouns, auxiliary verbs, prepositions that carry no semantic weight, filtered before indexing and at query time.

How to Use It

No setup required. Just write lessons the way you think:

# Arabic
cachly learn '{
  "topic": "deploy:api",
  "outcome": "success",
  "whatWorked": "نشر التطبيق نجح بعد تغيير منفذ الخدمة من 8080 إلى 3000"
}'

# Hebrew
cachly learn '{
  "topic": "fix:auth",
  "outcome": "success",
  "whatWorked": "תיקון בעיית האימות על ידי הוספת המפתח הסודי לסביבת הייצור"
}'

# Search in any language
smart_recall("مشكلة النشر")     → deployment lessons in any language
smart_recall("שגיאת אימות")     → auth error lessons in any language
smart_recall("authentication")  → also finds مصادقة and אימות lessons

Upgrade

npx @cachly-dev/mcp-server@latest autopilot