Deploy an AI Agent or LLM App

Varity Team Core Contributors Updated June 2026

Varity automatically detects AI agent apps and configures the right hosting for them. You write the code, run one command, and your agent is live.

One-Click Agent Templates

Don’t want to write code? Varity ships ready-to-deploy certified templates for popular open-source tools including Agent Zero, Flowise, Open WebUI, and Uptime Kuma. The certified list is gateway-owned and changes as templates pass live deploy and browser verification.

The proven path for these templates is the Developer Portal. Open the template gallery at developer.store.varity.so/dashboard/deploy and click deploy on any template. That’s it. One click, live URL. If a template needs environment variables (like an API key), the portal prompts you for them before deploying.

If you prefer the terminal or your AI editor, deploy the same certified templates by template ID:

CLI
MCP (AI coding tools)

Use the portal or MCP tools to find the certified template ID, then deploy it from the CLI. Pass any required environment variables with repeated --env flags:

varitykit app deploy --template flowise --name my-flowise

With the Varity MCP server installed, ask your AI editor to use the template tools:

“Show details for Agent Zero, then deploy Agent Zero on Varity”

The MCP should call varity_template_info first, then varity_deploy_template with template: "agent-zero". Use varity_deploy for a local project or Docker image.

The rest of this guide covers deploying an agent you wrote yourself.

Prerequisites

Python 3.11+ or Node.js 20+
varitykit CLI installed:
Terminal window
```
pipx install varitykit
```
(Recommended over pip install for isolated installs. Both work.)
A Varity account with a deploy key. Run varitykit login if you have not logged in yet.

Option 1: Python AI Agent (FastAPI)

This example builds a simple agent endpoint that accepts a prompt and returns a response using the OpenAI API.

Create your agent

from fastapi import FastAPI
from pydantic import BaseModel
import openai
import os

app = FastAPI()
client = openai.OpenAI(api_key=os.environ["OPENAI_API_KEY"])

class PromptRequest(BaseModel):
    prompt: str

@app.post("/run")
async def run_agent(request: PromptRequest):
    response = client.chat.completions.create(
        model="gpt-4o-mini",
        messages=[{"role": "user", "content": request.prompt}]
    )
    return {"result": response.choices[0].message.content}

@app.get("/health")
async def health():
    return {"status": "ok"}

Add your dependencies
requirements.txt
```
fastapi
uvicorn
openai
```
Add your API key

Create a .env file in your project root:
.env
```
OPENAI_API_KEY=sk-...
```
Varity reads this file at deploy time and injects the variable into your app’s runtime environment.
Deploy
Terminal window
```
varitykit app deploy
```
Varity detects FastAPI, configures dynamic compute hosting, and deploys your agent.

Your agent is live

Detected: FastAPI (Python)
Hosting: dynamic compute
Deploying...
Your agent is live at: https://your-agent.varity.app/

Send a request to test it:

curl -X POST https://your-agent.varity.app/run \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Summarize the Varity docs in one sentence."}'

Option 2: Node.js AI Agent (Express)

This example builds an Express API that wraps the OpenAI API.

Create your agent

const express = require('express');
const OpenAI = require('openai');

const app = express();
app.use(express.json());

const client = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });

app.post('/run', async (req, res) => {
  const { prompt } = req.body;
  const response = await client.chat.completions.create({
    model: 'gpt-4o-mini',
    messages: [{ role: 'user', content: prompt }]
  });
  res.json({ result: response.choices[0].message.content });
});

app.get('/health', (_req, res) => res.json({ status: 'ok' }));

const port = process.env.PORT || 3000;
app.listen(port, '0.0.0.0', () => {
  console.log(`Agent running on port ${port}`);
});

Add your dependencies

{
  "name": "my-agent",
  "version": "1.0.0",
  "scripts": {
    "start": "node index.js"
  },
  "dependencies": {
    "express": "^4.20.0",
    "openai": "^4.0.0"
  }
}

Add your API key
.env
```
OPENAI_API_KEY=sk-...
```
Deploy
Terminal window
```
varitykit app deploy
```
Varity detects Express, configures dynamic compute hosting, and deploys your agent.

Your agent is live

Detected: Express (Node.js)
Hosting: dynamic compute
Deploying...
Your agent is live at: https://your-agent.varity.app/

Using a Local Language Model

If your agent runs a language model locally instead of calling an external API, add the ollama package to your dependencies. Varity will detect it and automatically provision a model server alongside your app.

Python
Node.js

fastapi
uvicorn
ollama

import os
import ollama
from fastapi import FastAPI
from pydantic import BaseModel

app = FastAPI()
ollama_url = os.environ.get("OLLAMA_URL", "http://localhost:11434")

class PromptRequest(BaseModel):
    prompt: str

@app.post("/run")
async def run_agent(request: PromptRequest):
    response = ollama.chat(
        model="llama3",
        messages=[{"role": "user", "content": request.prompt}],
        host=ollama_url
    )
    return {"result": response["message"]["content"]}

{
  "dependencies": {
    "express": "^4.20.0",
    "@langchain/ollama": "^0.1.0"
  }
}

const express = require('express');
const { Ollama } = require('@langchain/ollama');

const app = express();
app.use(express.json());

const llm = new Ollama({
  baseUrl: process.env.OLLAMA_URL || 'http://localhost:11434',
  model: 'llama3'
});

app.post('/run', async (req, res) => {
  const result = await llm.invoke(req.body.prompt);
  res.json({ result });
});

When you deploy, Varity:

Detects ollama or @langchain/ollama in your dependencies
Provisions a model server alongside your app
Injects OLLAMA_URL into your runtime environment automatically

Your app code reads process.env.OLLAMA_URL (Node.js) or os.environ["OLLAMA_URL"] (Python) to connect.

Check Deployment Status

After deploying, check on your agent:

# See if it is running
varitykit app status

Or ask your AI coding tool (if you have the Varity MCP installed):

“Check the status of my Varity deployment”

“Show me the logs for my last deployment”

Troubleshooting

“ProjectDetectionError: Framework not supported”

Check that your project has a requirements.txt (Python) or package.json (Node.js) with a supported framework listed. See Supported Frameworks.

Agent returns 502 after deploy

Your app may be taking a few extra seconds to start. Wait 30 seconds and try again. If it persists, check your logs by asking your AI coding tool: