GuideBeginnerAgents2026-05-22

Getting Started with the Claude API: From First Call to Production

Learn how to integrate Claude into your applications using the Messages API, Managed Agents, and SDKs. Includes Python code examples, model selection tips, and best practices for production.

Quick Answer

This guide walks you through setting up the Claude API, making your first call with Python, choosing between Messages and Managed Agents, selecting the right model, and following best practices for production deployment.

Claude APIPython SDKMessages APIManaged AgentsQuickstart

Introduction

Claude is more than just a conversational AI—it's a powerful engine you can integrate into your own applications, workflows, and tools. Whether you're building a customer support chatbot, a code assistant, or an automated content generator, the Claude API gives you direct access to the same models that power claude.ai.

This guide will take you from zero to production: getting your API key, making your first request, understanding the two primary development surfaces (Messages API and Managed Agents), and choosing the right model for your use case. By the end, you'll have a solid foundation for building with Claude.

Prerequisites

A Claude API account (sign up for free)
Basic familiarity with Python (or TypeScript, Go, Java, Ruby, PHP, C#)
Python 3.8+ installed locally (for code examples)

Step 1: Get Your API Key

Log in to the Anthropic Console.
Navigate to API Keys in the left sidebar.
Click Create Key and give it a name (e.g., "My First App").
Copy the key immediately—you won't be able to see it again.

Security tip: Never hardcode your API key in source code. Use environment variables or a secrets manager.

Step 2: Install the SDK

Anthropic provides official SDKs for multiple languages. For Python, install the anthropic package:

pip install anthropic

For TypeScript/JavaScript:

npm install @anthropic-ai/sdk

Step 3: Make Your First API Call

Here's the simplest possible call using the Python SDK. This sends a single user message and prints Claude's response.

import anthropic
client = anthropic.Anthropic(
    api_key="YOUR_API_KEY"  # Replace with your actual key or use env var
)
message = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=1024,
    messages=[
        {"role": "user", "content": "Hello, Claude!"}
    ]
)
print(message.content[0].text)

What's happening?

model: Specifies which Claude model to use (we'll cover model selection shortly).
max_tokens: Limits the length of Claude's response (1 token ≈ 0.75 words).
messages: An array of conversation turns. Each turn has a role ("user" or "assistant") and content.

TypeScript Equivalent

import Anthropic from '@anthropic-ai/sdk';
const client = new Anthropic({
  apiKey: process.env.ANTHROPIC_API_KEY,
});
async function main() {
  const message = await client.messages.create({
    model: 'claude-sonnet-4-6',
    max_tokens: 1024,
    messages: [{ role: 'user', content: 'Hello, Claude!' }],
  });
  console.log(message.content[0].text);
}
main();

Step 4: Choose Your Development Surface

Claude offers two primary ways to build:

Messages API (Direct Model Access)

With the Messages API, you have full control. You construct every turn of the conversation, manage state yourself, and write your own tool loop. This is ideal for:

Custom chat interfaces
Complex multi-turn workflows
Applications where you need fine-grained control over context

When to use: You want maximum flexibility and are comfortable managing conversation state.

Managed Agents (Fully Managed Infrastructure)

Managed Agents provide a higher-level abstraction. You define an agent (with instructions, tools, and model choice), and Anthropic handles conversation state, session persistence, and event history. This is perfect for:

Autonomous task execution
Long-running background agents
Applications where you don't want to manage state yourself

When to use: You want to deploy an agent quickly without building infrastructure.

Pro tip: You can start with the Messages API for prototyping, then migrate to Managed Agents when you need persistent sessions and event history.

Step 5: Pick the Right Model

Claude comes in three tiers. Choosing the right one balances performance, cost, and speed.

Model	ID	Best For	Speed
Opus 4.7	`claude-opus-4-7`	Complex analysis, deep reasoning, creative tasks	Slower
Sonnet 4.6	`claude-sonnet-4-6`	General production workloads, balanced intelligence & speed	Fast
Haiku 4.5	`claude-haiku-4-5`	High-volume, latency-sensitive apps	Fastest

Recommendations:

Start with Sonnet for most use cases—it's the best balance.
Switch to Opus when you need deeper reasoning (e.g., code review, legal analysis).
Use Haiku for simple tasks like classification, summarization, or customer support triage.

Step 6: Explore Advanced Features

Once you're comfortable with basic calls, try these powerful capabilities:

Extended Thinking

Claude can "think" before responding, improving reasoning on complex tasks. Enable it with the thinking parameter:

message = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=2048,
    thinking={"type": "enabled", "budget_tokens": 1024},
    messages=[{"role": "user", "content": "Solve this logic puzzle..."}]
)

Vision (Image Input)

Claude can analyze images. Pass base64-encoded image data:

import base64
with open("chart.png", "rb") as f:
    image_data = base64.b64encode(f.read()).decode()
message = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=1024,
    messages=[{
        "role": "user",
        "content": [
            {"type": "text", "text": "What does this chart show?"},
            {"type": "image", "source": {"type": "base64", "media_type": "image/png", "data": image_data}}
        ]
    }]
)

Tool Use (Function Calling)

Claude can call external functions. Define tools as JSON schema:

tools = [
    {
        "name": "get_weather",
        "description": "Get current weather for a city",
        "input_schema": {
            "type": "object",
            "properties": {
                "location": {"type": "string"}
            },
            "required": ["location"]
        }
    }
]
message = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=1024,
    tools=tools,
    messages=[{"role": "user", "content": "What's the weather in Tokyo?"}]
)

Prompt Caching

Reduce costs and latency for repeated system prompts by caching them:

message = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=1024,
    system=[
        {
            "type": "text",
            "text": "You are a helpful assistant...",
            "cache_control": {"type": "ephemeral"}
        }
    ],
    messages=[{"role": "user", "content": "Hello"}]
)

Step 7: Evaluate and Ship

Before going to production, follow these best practices:

Prompting Best Practices

Be specific and clear in your instructions.
Use system prompts for role-setting (e.g., "You are a customer support agent...").
Provide examples (few-shot prompting) for complex tasks.

Run Evals

Create a test set of inputs and expected outputs. Use the Workbench to iterate on prompts before deploying.

Safety & Guardrails

Implement content filtering for sensitive use cases.
Use Claude's built-in safety features (e.g., refuse harmful requests).
Monitor for prompt injection attempts.

Rate Limits & Errors

Handle 429 (rate limit) errors with exponential backoff.
Set up monitoring for 500 errors and latency spikes.
Use the Usage Monitoring dashboard to track costs.

Cost Optimization

Use Haiku for simple tasks, Sonnet for most, Opus only when needed.
Enable prompt caching for repeated system prompts.
Set appropriate max_tokens—don't over-allocate.

Next Steps

Try the Workbench to experiment with prompts without writing code.
Explore the Cookbook for ready-to-use code samples.
Deploy a starter app from the Quickstarts page.
Learn about Claude Code for an agentic coding assistant in your terminal.

Key Takeaways

Get started in minutes: Sign up, grab an API key, install the SDK, and make your first call with just a few lines of code.
Choose the right surface: Use the Messages API for full control, Managed Agents for hands-off state management.
Pick the right model: Sonnet for balance, Opus for deep reasoning, Haiku for speed and cost savings.
Leverage advanced features: Extended thinking, vision, tool use, and prompt caching unlock powerful capabilities.
Follow production best practices: Run evals, implement guardrails, handle errors gracefully, and monitor costs.