Models
Models and Keys
Choose between local memory mode, optional on-device models, and optional Anthropic API keys for richer extraction, page synthesis, recaps, and graph work.
At a glance
01
Wenlan works without downloading a local model or adding an API key.
02
Optional models and keys add heavier language work such as extraction, page synthesis, recaps, and richer graph work.
01
Three setup levels
Start with local memory mode unless you know you need automated language-heavy maintenance. It gives you storage, embeddings, dedupe, hybrid search, MCP recall, and local artifacts.
Add an on-device model or Anthropic key when you want richer daemon-side extraction, page synthesis, recaps, and knowledge graph work.
Setup options
wenlan setup --basic
wenlan model install qwen3-4b
wenlan key set anthropic
wenlan model status
wenlan key status
wenlan doctor02
Local memory mode
Local memory mode is the no-model default. It is enough for the core work loop: brief, capture, recall, handoff, distill from the agent side, and inspect local artifacts.
This mode is also the right starting point when you want the smallest install path or when a machine cannot run local model inference comfortably.
03
On-device model
An on-device model lets the daemon perform more language work locally. It can help with classification, extraction, title generation, recaps, and other maintenance tasks without sending those daemon-side prompts to an external API.
Local model capability is still bounded by hardware and context limits. For large distillation jobs, an API provider can be stronger when you choose to configure one.
04
Anthropic key
A configured Anthropic key is bring-your-own-key for daemon-side language tasks. It is optional and should be treated as an explicit tradeoff: stronger language work in exchange for sending the relevant task prompt to the API provider.
The CLI can read ANTHROPIC_API_KEY through the key setup path. Your connected AI client may already call its own provider during normal chat; this page only describes Wenlan's daemon-side model and key paths.
05
Agent-side fallback
Claude Code skills can classify captures, write handoffs, and synthesize pages from inside the active agent session. That is why Wenlan remains useful before you configure a daemon model.
The daemon stays the store and retrieval layer. The agent can supply language judgment when it is already in the work context.
Local mode fallback
Pick memory_type daemon classifier /capture chooses type
Extract entities daemon extraction /capture posts entities
Synthesize pages daemon distill cycle /distill writes page
Expand/rerank recall daemon rerank/expand /recall rewrites queryNext
Advanced Retrieval Status
Understand Wenlan's shipped retrieval path and the opt-in main-branch experiments behind newer retrieval work.
Read next