> For the complete documentation index, see [llms.txt](/llms.txt).
> A full single-fetch corpus is available at [llms-full.txt](/llms-full.txt).
---
title: Prompts
description: Manage prompt text as a first-class artifact, run it from Python or TypeScript, and keep production prompt versions tied to code.
last_verified: 2026-07-24
---


A **Prompt** is a managed LLM prompt that can be drafted and tested in AGNT5, then committed with your application code for production. Use Prompts when the prompt itself is an artifact you want to version, evaluate, and deploy, instead of burying instructions inside function bodies.

In development, a Prompt can resolve through AGNT5's prompt store so you can test drafts quickly. In production, AGNT5 resolves Prompts from files bundled with the deployed code artifact. That keeps production behavior reproducible: the code version and prompt version move together.


**Go:**

The managed Prompt artifact described on this page (`prompts/<id>.mdx`, versioning, runtime overrides) is **not available in the Go SDK yet** — there's no `Prompt` type, prompt store, or file-based prompt resolution. The Go SDK only exposes the lower-level prompt-caching knob on `GenerateRequest`/`Agent`; see [Prompt caching](/docs/build/prompt-caching.md) for what is available. For now, keep prompt text as plain Go string constants or template files you manage yourself, and pass the rendered text as `Messages` on `GenerateRequest`.


---

## Run a Prompt


**Python:**

```python
from agnt5 import lm
from agnt5.lm import Prompt

response = await lm.generate(
    model="openai/gpt-4o-mini",
    prompt=Prompt(
        id="support_reply",
        variables={
            "customer": {"name": "Ada"},
            "topic": "shipping",
        },
    ),
)

print(response.text)
```


**TypeScript:**

```typescript
const lm = LM.openai();

const response = await lm.generate({
  model: "openai/gpt-4o-mini",
  prompt: {
    id: "support_reply",
    variables: {
      customer: { name: "Ada" },
      topic: "shipping",
    },
  },
});

console.log(response.text);
```


**Go:**

Not available — see the note above.


<Callout type="info">`prompt` replaces inline `messages` for the call. If you provide a Prompt, do not also pass raw prompt text or message arrays for the same generation request.</Callout>

---

## Commit a production Prompt

Create a prompt file under `prompts/<id>.mdx` in your application repo:

```mdx
---
id: support_reply
version: 3
version_id: 018f0000-0000-7000-8000-000000000003
model: openai/gpt-4o-mini
temperature: 0.2
max_tokens: 600
variables:
  - customer.name
  - topic
response_format: text
---

<System>
You are a concise support agent.
</System>

<User>
Reply to {{customer.name}} about {{topic}}.
</User>
```

AGNT5 parses the front matter for routing and generation settings, then renders the body into ordered chat messages. The body supports `<System>`, `<User>`, and `<Assistant>` blocks. If no block is present, the body is treated as a single user message.

Production Prompt resolution looks in this order:

1. `AGNT5_PROMPT_OVERRIDE`
2. `AGNT5_PROMPTS_MANIFEST`
3. `prompts/<id>.mdx`
4. `prompts/<id>.md`
5. AGNT5 prompt-run API fallback for non-production draft and test workflows

Prompt files must be Markdown or MDX. AGNT5 does not load prompt text from `prompts.lock` or JSON files; prompts are meant to be readable authored artifacts.

<Callout type="warning">In production environments, the Prompt must be bundled with the deployed artifact. Missing Prompts fail closed and do not fall back to control-plane state.</Callout>

---

## Select a version

Use `version` when constructing the Prompt object, or pass the SDK's version option for the call.


**Python:**

```python
response = await lm.generate(
    model="openai/gpt-4o-mini",
    prompt=Prompt(id="support_reply", version="version-3"),
)
```


**TypeScript:**

```typescript
const response = await lm.generate({
  model: "openai/gpt-4o-mini",
  prompt: { id: "support_reply", version: "version-3" },
});
```


**Go:**

Not available — see the note above.


The version can match either the prompt file's `version` or `version_id`.

---

## Override runtime settings

Use runtime LLM overrides when you want to test the same prompt with a different model or generation settings. This is useful in the Playground, experiments, and one-off comparisons because workflow code can stay unchanged.


**Python:**

```python
ctx.runtime.llm.model = "openai/gpt-4o"
ctx.runtime.llm.temperature = 0.6
ctx.runtime.llm.max_tokens = 800
ctx.runtime.llm.top_p = 0.9

response = await lm.generate(
    model="openai/gpt-4o-mini",
    prompt=Prompt(
        id="support_reply",
        variables={"topic": "shipping"},
    ),
)
```


**TypeScript:**

```typescript
ctx.runtime.llm.model = "openai/gpt-4o";
ctx.runtime.llm.temperature = 0.6;
ctx.runtime.llm.maxOutputTokens = 800;
ctx.runtime.llm.topP = 0.9;

const response = await lm.generate({
  model: "openai/gpt-4o-mini",
  prompt: {
    id: "support_reply",
    variables: { topic: "shipping" },
  },
});
```


**Go:**

Not available — see the note above.


For workflows with multiple prompts, set prompt-specific overrides by prompt id:


**Python:**

```python
ctx.runtime.prompts["draft"] = LLMRuntimeOptions(
    model="anthropic/claude-3-5-haiku-20241022",
    temperature=0.7,
)
ctx.runtime.prompts["review"] = LLMRuntimeOptions(
    model="openai/gpt-4o",
    temperature=0.3,
)

await lm.generate(model="openai/gpt-4o-mini", prompt=Prompt(id="classify"))
await lm.generate(model="openai/gpt-4o-mini", prompt=Prompt(id="draft"))
await lm.generate(model="openai/gpt-4o-mini", prompt=Prompt(id="review"))
```


**TypeScript:**

```typescript
ctx.runtime.prompts['draft'] = {
  model: 'anthropic/claude-3-5-haiku-20241022',
  temperature: 0.7,
};
ctx.runtime.prompts['review'] = {
  model: 'openai/gpt-4o',
  temperature: 0.3,
};

await lm.generate({ model: 'openai/gpt-4o-mini', prompt: { id: 'classify' } });
await lm.generate({ model: 'openai/gpt-4o-mini', prompt: { id: 'draft' } });
await lm.generate({ model: 'openai/gpt-4o-mini', prompt: { id: 'review' } });
```


**Go:**

Not available — see the note above.


The Prompt file remains the source of truth for prompt text. Runtime overrides only change model execution settings for the current run. Prompt-specific overrides win over the global runtime default.

---

## Use Prompts inside workflows

Prompts work anywhere you can call the language model SDK. Inside workflows, keep the LLM call inside a checkpointed step so replay can reuse the completed result.


**Python:**

```python
from agnt5 import function, FunctionContext, lm
from agnt5.lm import Prompt

@function
async def draft_reply(ctx: FunctionContext, customer_name: str, topic: str) -> str:
    response = await lm.generate(
        model="openai/gpt-4o-mini",
        prompt=Prompt(
            id="support_reply",
            variables={
                "customer": {"name": customer_name},
                "topic": topic,
            },
        ),
    )
    return response.text
```

When this function is called through `ctx.step()` from a workflow, the generated reply is checkpointed with the rest of the step result.


**TypeScript:**

```typescript
const lm = LM.openai();

export const draftReply = fn('draft_reply').run(
  async (ctx: Context, input: { customerName: string; topic: string }) => {
    const response = await lm.generate({
      model: 'openai/gpt-4o-mini',
      prompt: {
        id: 'support_reply',
        variables: {
          customer: { name: input.customerName },
          topic: input.topic,
        },
      },
    });
    return response.text;
  },
);
```

Because `fn(...).run(...)` checkpoints automatically when called with `ctx` from a workflow, the generated reply is checkpointed with the rest of the step result — no separate `ctx.step()` call needed.


**Go:**

Not available — see the note above. For a plain (non-managed) prompt string, wrap the `ctx.Generate` call in `agnt5.Step` as usual so the result is checkpointed.


---

## Compatibility

Older SDK code may still use `PromptRef`, `prompt_ref`, or `promptRef`. Those names remain supported for compatibility, but new docs and new code should use **Prompt** as the concept:

```python
# Supported, but prefer prompt=Prompt(...)
await lm.generate(
    model="openai/gpt-4o-mini",
    prompt_ref="support_reply",
    variables={"topic": "shipping"},
)
```

```ts
// Supported, but prefer prompt: { id: "support_reply" }
await lm.generate({
  model: "openai/gpt-4o-mini",
  promptRef: "support_reply",
  variables: { topic: "shipping" },
});
```

Use `PromptRef` only when maintaining older code that already uses that terminology.