Agents

Agents are large language models (LLMs) that use tools in a loop to accomplish tasks.

These components work together:

LLMs process input and decide the next action
Tools extend capabilities beyond text generation (reading files, calling APIs, writing to databases)
Loop orchestrates execution through:
- Context management - Maintaining conversation history and deciding what the model sees (input) at each step
- Stopping conditions - Determining when the loop (task) is complete

Agent Class

The Agent class handles these three components. Here's an agent that uses multiple tools in a loop to accomplish a task:

import { Experimental_Agent as Agent, stepCountIs, tool } from 'ai';
import { z } from 'zod';

const weatherAgent = new Agent({
  model: 'openai/gpt-4o',
  tools: {
    weather: tool({
      description: 'Get the weather in a location (in Fahrenheit)',
      inputSchema: z.object({
        location: z.string().describe('The location to get the weather for'),
      }),
      execute: async ({ location }) => ({
        location,
        temperature: 72 + Math.floor(Math.random() * 21) - 10,
      }),
    }),
    convertFahrenheitToCelsius: tool({
      description: 'Convert temperature from Fahrenheit to Celsius',
      inputSchema: z.object({
        temperature: z.number().describe('Temperature in Fahrenheit'),
      }),
      execute: async ({ temperature }) => {
        const celsius = Math.round((temperature - 32) * (5 / 9));
        return { celsius };
      },
    }),
  },
  stopWhen: stepCountIs(20),
});

const result = await weatherAgent.generate({
  prompt: 'What is the weather in San Francisco in celsius?',
});

console.log(result.text); // agent's final answer
console.log(result.steps); // steps taken by the agent

The agent automatically:

Calls the weather tool to get the temperature in Fahrenheit
Calls convertFahrenheitToCelsius to convert it
Generates a final text response with the result

The Agent class handles the loop, context management, and stopping conditions.

Why Use the Agent Class?

The Agent class is the recommended approach for building agents with the AI SDK because it:

Reduces boilerplate - Manages loops and message arrays
Improves reusability - Define once, use throughout your application
Simplifies maintenance - Single place to update agent configuration

For most use cases, start with the Agent class. Use core functions (generateText, streamText) when you need explicit control over each step for complex structured workflows.

Structured Workflows

Agents are flexible and powerful, but non-deterministic. When you need reliable, repeatable outcomes with explicit control flow, use core functions with structured workflow patterns combining:

Conditional statements for explicit branching
Standard functions for reusable logic
Error handling for robustness
Explicit control flow for predictability

Explore workflow patterns to learn more about building structured, reliable systems.

Next Steps

Building Agents - Guide to creating agents with the Agent class
Workflow Patterns - Structured patterns using core functions for complex workflows
Loop Control - Execution control with stopWhen and prepareStep