cachly is a persistent AI memory platform for developers. It gives AI coding assistants like Claude Code, Cursor, GitHub Copilot and Windsurf a brain that remembers every lesson, fix, and architecture decision — forever. It connects via the MCP (Model Context Protocol) standard and includes 121 tools. Free tier available. Runs on German (EU) servers.

How does cachly work?

Run 'npx @cachly-dev/mcp-server@latest autopilot' once. The wizard auto-detects every AI editor you have installed (Claude Code, Cursor, Copilot, Windsurf, Cline, Zed) and writes the correct config for each. It then reads your entire git history with brain_from_git and loads years of team knowledge into your Brain before your first session. From that point, sessions start automatically, memory is shared across all your editors simultaneously, and a git post-commit hook teaches cachly from every commit.

Does cachly auto-detect my editors?

Yes. The cachly setup wizard automatically detects Claude Code, Cursor, GitHub Copilot, Windsurf, Cline, Zed, and Continue.dev — any editor that supports MCP. It writes the correct config file for each editor in one pass. You never manually edit JSON config files.

Is memory shared across all my AI editors?

Yes. cachly uses a single Brain that all your AI editors connect to simultaneously. A lesson remembered in Claude Code is instantly available in Cursor and GitHub Copilot. If your team uses different editors, all of you share the same persistent memory pool.

What is brain_from_git?

brain_from_git is a cachly tool that reads your entire git history before your first session and extracts lessons from every commit, PR, and revert. Your AI arrives knowing years of architectural decisions, bug fixes, and team conventions — without you writing a single line of documentation. Zero onboarding.

What is causal_trace?

causal_trace is a cachly tool that traces the history of any file or bug across your entire git history in seconds — replacing 30+ minutes of manual git blame. Describe a problem in plain English. It returns the root cause, the failure chain, and the exact fix that worked — with date, command, and file path.

What is brain_predict?

brain_predict is a cachly tool that scans your Brain for failure patterns before every deploy, migration, or dependency upgrade. It returns probability-weighted warnings based on your team's actual incident history — so you catch the next incident before it happens.

Does cachly work with Claude Code, Cursor, and GitHub Copilot?

Yes. cachly works with Claude Code, Cursor, GitHub Copilot, Windsurf, Cline, Zed, and Continue.dev — anywhere that supports MCP. Run 'npx @cachly-dev/mcp-server@latest autopilot' to configure all editors in one step. Memory is shared across all editors simultaneously.

Can cachly search memory across languages?

Yes. cachly uses semantic vector embeddings, not keyword search. A lesson stored in German appears when you search in English. A fix documented in Arabic matches a Japanese query about the same bug pattern. Supported languages include English, German, French, Spanish, Italian, Portuguese, Japanese, Chinese (Simplified and Traditional), Korean, Arabic, Hebrew, and more.

How is cachly different from mem0?

mem0 is a memory layer for Python LLM apps and chatbots — great for building AI products. cachly is built specifically for developer tooling: it connects to your AI editor via MCP, learns from your git history automatically, predicts failures before deploy, and gives your whole team shared memory. cachly runs on EU servers and is GDPR-native. For developers using Claude Code, Cursor, or Copilot, cachly is the right choice.

Is cachly GDPR compliant?

Yes. cachly runs exclusively on German servers (Hetzner). All data stays in the EU. No data is shared with third parties. cachly is fully GDPR compliant. An AVV (Auftragsverarbeitungsvertrag / Data Processing Agreement) is available for Business and Enterprise customers.

AI Memory for Asian Dev Teams: Singapore Node, CJK Support, GDPR

The problem: AI assistants have no memory

You spend 45 minutes debugging a tricky TypeScript issue with Claude Code. You find the fix, ship it, close the tab. Next day, same error in a different file. You open Claude Code — and it has no idea what you found yesterday. The context window reset.

This is the AI amnesia problem. Every developer using Claude, Cursor, or GitHub Copilot hits it. The AI is smart, but it forgets everything the moment the session ends.

For teams building in Japan, South Korea, China, or Singapore, it is even worse: most AI memory tools are built for English-speaking Western markets, run on US servers, and have no meaningful support for Japanese, Chinese, or Korean text.

cachly: persistent AI memory, built for global teams

cachly is a Redis-compatible cache with an AI Brain layer on top. The Brain stores everything your AI learns — fixes, architecture decisions, conventions, deployment lessons — and recalls them semantically on the next session.

The key tools are simple:

learn_from_attempts — Store what worked (and what failed). Called automatically by your AI after a fix or deploy.
smart_recall— Semantic search over all stored lessons. Your AI asks "how did we fix the Docker healthcheck?" and gets the exact answer from 3 weeks ago.
session_start / session_end — Automatic. Your AI picks up where it left off.

The entire setup takes under 60 seconds. Add one JSON block to your Claude Code or Cursor config. No API keys, no dashboards, no manual management.

Singapore node: 10ms to Tokyo, 15ms to Seoul

This is the part that matters for APAC teams: we run a dedicated node in Singapore ( cachly-node-singapore-4), specifically because 200ms round-trip to Germany makes AI tools feel sluggish.

Typical latencies from the Singapore node:

City	Singapore node	Germany node
🇯🇵 Tokyo	~10ms	~220ms
🇰🇷 Seoul	~15ms	~230ms
🇸🇬 Singapore	<1ms	~160ms
🇨🇳 Shanghai	~35ms	~200ms
🇦🇺 Sydney	~30ms	~280ms

smart_recall returns in under 80ms p99 globally. From Tokyo via Singapore, that is under 100ms total — fast enough that it feels instant inside Claude Code or Cursor.

Select the APAC region when you create your cachly instance. Your data stays in Singapore — it never routes through Germany or the US.

CJK language support: Japanese, Chinese, Korean

Most AI memory systems are English-first. Embedding models like OpenAI's text-embedding-3-small handle English beautifully but degrade significantly on CJK text — especially character-level languages like Chinese and Japanese where tokenization works differently.

cachly uses two approaches:

1. Key-based memory for structured facts

The simplest and most reliable path for CJK: store lessons with descriptive English keys, Japanese/Chinese/Korean content in the value.

// Store a lesson in Japanese
await brain.learn({
  topic: "fix:keycloak-auth",
  what_worked: "認証エラーはKeycloakのclient_idが間違っていたのが原因。\n" +
               "正しいclient_id: cachly-web (cachly-appではない)",
  tags: ["keycloak", "auth", "japan-team"]
})

// Recall it later in English — it still finds it
await brain.smart_recall("How did we fix the Keycloak auth issue?")

The semantic search works cross-language: you can write the lesson in Japanese and recall it in English, or vice versa. The embedding model (nomic-embed-text) handles multilingual text natively.

2. nomic-embed-text: genuinely multilingual

cachly runs nomic-embed-text on our own infrastructure (no third-party API required). It is trained on 43 languages including Japanese, Simplified Chinese, Traditional Chinese, and Korean. Unlike models fine-tuned only for English, nomic-embed-text produces meaningful embeddings for CJK text without additional configuration.

You can store and recall in any language:

# Python — store in Chinese, recall in English
brain.learn(topic="infra:docker", what_worked="Docker健康检查必须使用127.0.0.1而不是localhost")
brain.smart_recall("Docker healthcheck issue")  # finds it

Data sovereignty: why GDPR is good for Asian teams too

European GDPR is often framed as a Western regulation, but its principles align closely with Asia's own data protection frameworks:

🇯🇵Japan APPI

Act on the Protection of Personal Information — requires purpose limitation and third-party transfer consent, same as GDPR.

🇰🇷South Korea PIPA

Personal Information Protection Act — often called 'stricter than GDPR', with explicit data subject rights.

🇸🇬Singapore PDPA

Personal Data Protection Act — MAS TRM Guidelines additionally regulate financial sector data flows.

🇨🇳China PIPL + DSL

Personal Information Protection Law + Data Security Law — strict data localization requirements.

cachly's architecture satisfies all of these:

Data localization: APAC data stays in Singapore. EU data stays in Germany. No cross-region transfer by default.
No third-party AI APIs: The embedding model runs on our infrastructure. Your code and your lessons never pass through OpenAI, Google, or AWS.
Encryption in transit: TLS enforced on all connections.
Encryption at rest: AES-256 on Business and Enterprise plans.
DPA/AVV auto-generated: We generate a Data Processing Agreement automatically at sign-up — legally valid under EU GDPR and accepted by Japanese/Korean enterprise procurement.

Team Brain: shared memory across the whole team

The most powerful use case for Asian dev teams is shared team memory. In Japan especially, there is a strong culture of knowledge sharing (知識共有, chishiki kyōyū) — lessons should not be siloed in one person's chat history.

With a shared cachly instance, every developer's AI editor reads from and writes to the same Brain:

# All 5 team members point to the same instance_id in their MCP config:
{
  "mcpServers": {
    "cachly": {
      "command": "npx",
      "args": ["@cachly-dev/mcp-server@latest"],
      "env": {
        "CACHLY_INSTANCE_ID": "shared-team-brain-uuid",
        "CACHLY_JWT": "your-jwt-token"
      }
    }
  }
}

When Hiroshi fixes a deployment issue at 9am Tokyo time, that lesson is in the shared Brain by 9:01am — available to Yuki in Seoul and Wei in Shanghai immediately, in whatever language they recall it in.

Setup in 60 seconds

The entire setup — account, Singapore instance, MCP configuration — takes under 60 seconds:

Go to cachly.dev/sign-up → create an account (10 seconds, no credit card)
Create an instance, select APAC (Singapore) region (30 seconds)
Add one JSON block to your Claude Code or Cursor MCP config (20 seconds)

The free tier (25 MB, 1 instance) is enough for an individual developer. Teams upgrade to Pro or Business for shared instances and more memory.

What Claude remembers after cachly

Here is a real example of what a Tokyo-based team's Brain contains after one month of use:

fix:docker-healthcheck

dockerclickhouseinfra

ClickHouse healthcheck MUST use 127.0.0.1 not localhost — IPv6 disabled on servers causes DNS resolution failure.

deploy:api-go

deploygoapi

Always run `go build ./...` before rsync to catch compile errors locally. SSH port is 2222 not 22.

fix:keycloak-auth

keycloakauthfrontend

Keycloak client_id for the web app is 'cachly-web', not 'cachly-app'. Realm: cachly.

infra:postgres-pgvector

postgrespgvectormigrations

pgvector extension must be installed before running migrations. `CREATE EXTENSION IF NOT EXISTS vector;`

Every new session, Claude Code pre-loads these lessons automatically via session_start. The AI arrives knowing your infrastructure, your conventions, and your past mistakes — before you type a single prompt.

What 'Claude forgets' costs APAC teams

Let's put a number on it. A senior developer in Tokyo earns roughly ¥15–20M/year — about ¥8,000–10,000 per hour. A typical "re-solving a known problem" episode takes 45–90 minutes. At ¥9,000/hour, that is ¥6,750–13,500 wasted — per incident.

A 5-person team hitting this twice a week: ¥5.4M–10.8M per year in wasted developer time. The cachly Business plan (for teams) is €199/month — less than the cost of one wasted afternoon.

Getting started

cachly is free to try — no credit card, no time limit, no OpenAI API key required.

For APAC teams: select the Singapore region when creating your first instance. The MCP server handles everything else — authentication, memory, recall — automatically.

AI Memory for Asian Dev Teams: Singapore Node, CJK Support, and Why GDPR Is Good for Asia Too