Skip to content

docs(self-hosting): document local embedding memory behavior#1094

Open
chetanunadkat wants to merge 1 commit into
supermemoryai:mainfrom
chetanunadkat:docs/self-hosting-embedding-memory
Open

docs(self-hosting): document local embedding memory behavior#1094
chetanunadkat wants to merge 1 commit into
supermemoryai:mainfrom
chetanunadkat:docs/self-hosting-embedding-memory

Conversation

@chetanunadkat

Copy link
Copy Markdown

What

Adds a Memory considerations subsection to the self-hosting embedding docs explaining that each local embedding worker's WebAssembly heap only grows (up to ~4 GB) and is reclaimed only when the pool goes fully idle.

Why

Self-hosters on small hosts hit unbounded RSS growth (20–50 GB) under continuous ingestion because peak memory scales as roughly POOL_SIZE × ~4 GB and the only reclamation path (idle shutdown) never fires while the server is busy. The current docs present POOL_SIZE purely as a throughput knob, with no memory-cost guidance. This adds the missing budgeting note and the safe-default recommendation for ≤16 GB hosts.

Documents the user-facing/operational side of #1093. The worker/pool implementation lives in the closed binary, so this PR only covers the docs; the issue proposes the code-level fix (recycle busy workers after N batches / a memory ceiling).

Change

One subsection added to apps/docs/self-hosting/configuration.mdx, right after the embedding-performance table. Docs-only, no code changes.

Each local embedding worker's WASM linear memory only grows (up to ~4 GB)
and is reclaimed only when the pool goes fully idle. Add a 'Memory
considerations' note so self-hosters can budget peak memory
(~POOL_SIZE x 4 GB) and avoid OOM on small/continuously-ingesting hosts.

Refs supermemoryai#1093

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
@ishaanxgupta

Copy link
Copy Markdown
Contributor

@chetanunadkat I see some conflicts here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants