A memory layer your AI app can plug into in 5 minutes. Multi-tenant, semantic, audited. Runs on Cloudflare.
your app sends raw transcripts, we extract typed memories (preference / fact / goal / event / relationship) and embed them so semantic recall just works.
POST /v1/recall takes a query and an end-user id, returns ranked memories. Drop them at the top of your prompt — that's it.
workspaces, API keys, per-tenant rate limits, audit log, end-user scoping. Your customers' users are isolated by default.
768-dim embeddings via Workers AI, namespaced per workspace. Sub-100ms recall at the edge. No infra to provision.
when memories contradict ("lives in NYC" vs "lives in SF"), we resolve and version. Old low-confidence memories decay nightly.
email / phone / card detection on every write. One-call data export and delete per end-user.
| Hacker | free |
| Startup | $29/mo |
| Scale | $199/mo |
| Enterprise | contact |
all tiers include unlimited end-users.
curl -X POST https://api.northcontext.dev/v1/memories \
-H "Authorization: Bearer $NC_KEY" \
-d '{
"end_user_id": "user_42",
"text": "I work at Acme as a senior PM."
}'