Cerebras
Cerebras offers the fastest LLM inference available — significantly faster than GPU-based providers, ideal for real-time applications.
Setup
Get an API key at cloud.cerebras.ai.
const agent = new Agent({ name: 'my-agent', model: { provider: 'cerebras', model: 'llama3.3-70b', }, systemPrompt: 'You are a helpful assistant.',});npx wrangler secret put CEREBRAS_API_KEYRecommended models
| Model | Best for |
|---|---|
llama3.3-70b | Best quality (recommended) |
llama3.1-8b | Fastest, lightweight tasks |
Environment variable
interface Env { CEREBRAS_API_KEY: string;}