GuideBeginnerBest Practices2026-05-17

Getting Started with the Claude API: A Practical Guide for Developers

Learn how to integrate Claude AI into your applications using the Anthropic API. Covers authentication, messaging, streaming, and best practices for Python and TypeScript.

Quick Answer

This guide walks you through setting up the Claude API, making your first request, handling streaming responses, and following best practices for production use.

Claude APIPythonTypeScriptIntegrationStreaming

Introduction

Claude, developed by Anthropic, is a powerful AI assistant that can be integrated into your own applications via the Anthropic API. Whether you're building a chatbot, content generator, or data analysis tool, the Claude API provides a straightforward way to leverage Claude's capabilities.

This guide covers everything you need to get started: from authentication and making your first request to streaming responses and following best practices. By the end, you'll be ready to build your own Claude-powered application.

Prerequisites

Before you begin, make sure you have:

An Anthropic account (sign up is free)
An API key (found in the API Keys section of your account dashboard)
Basic familiarity with Python or TypeScript
Node.js (v18+) installed if using TypeScript, or Python 3.8+ installed if using Python

Step 1: Authentication

Every API request must include your API key. The key is passed via the x-api-key header. For security, never hardcode your API key in your source code. Use environment variables instead.

Setting up your environment variable

export ANTHROPIC_API_KEY="sk-ant-..."

Python example

import os
from anthropic import Anthropic
client = Anthropic(
    api_key=os.environ.get("ANTHROPIC_API_KEY")
)

TypeScript example

import Anthropic from '@anthropic-ai/sdk';
const client = new Anthropic({
  apiKey: process.env['ANTHROPIC_API_KEY'],
});

Step 2: Making Your First API Call

The core endpoint is the Messages API. You send a list of messages and Claude responds with a message.

Python

message = client.messages.create(
    model="claude-3-5-sonnet-20241022",
    max_tokens=1024,
    messages=[
        {"role": "user", "content": "Hello, Claude!"}
    ]
)
print(message.content[0].text)

TypeScript

async function main() {
  const message = await client.messages.create({
    model: 'claude-3-5-sonnet-20241022',
    max_tokens: 1024,
    messages: [
      { role: 'user', content: 'Hello, Claude!' }
    ],
  });
console.log(message.content[0].text);
}
main();

Expected output:

Hello! How can I assist you today?

Step 3: Understanding the Request Structure

A Messages API request has these key fields:

model: The Claude model to use (e.g., claude-3-5-sonnet-20241022, claude-3-haiku-20240307).
max_tokens: The maximum number of tokens to generate in the response.
messages: An array of message objects, each with a role (user or assistant) and content.
system (optional): A system prompt to set Claude's behavior.
temperature (optional): Controls randomness (0.0 to 1.0). Lower values make output more deterministic.

Example with system prompt

message = client.messages.create(
    model="claude-3-5-sonnet-20241022",
    max_tokens=1024,
    system="You are a helpful assistant that speaks like a pirate.",
    messages=[
        {"role": "user", "content": "What is the capital of France?"}
    ]
)
print(message.content[0].text)
Output: "Arrr, the capital o' France be Paris, me hearty!"

Step 4: Streaming Responses

For a better user experience, you can stream Claude's response token by token. This is especially useful for chat applications where you want to show the response as it's being generated.

Python streaming

stream = client.messages.create(
    model="claude-3-5-sonnet-20241022",
    max_tokens=1024,
    messages=[
        {"role": "user", "content": "Tell me a short story about a robot."}
    ],
    stream=True
)
for event in stream:
    if event.type == "content_block_delta":
        print(event.delta.text, end="", flush=True)

TypeScript streaming

const stream = await client.messages.create({
  model: 'claude-3-5-sonnet-20241022',
  max_tokens: 1024,
  messages: [
    { role: 'user', content: 'Tell me a short story about a robot.' }
  ],
  stream: true,
});
for await (const event of stream) {
  if (event.type === 'content_block_delta') {
    process.stdout.write(event.delta.text);
  }
}

Step 5: Handling Multi-turn Conversations

To maintain a conversation, include the full message history in each request. The API is stateless, so you must send all previous messages.

conversation = [
    {"role": "user", "content": "What is the tallest mountain?"},
    {"role": "assistant", "content": "Mount Everest, at 8,848 meters (29,029 feet)."},
    {"role": "user", "content": "Which continent is it on?"}
]
message = client.messages.create(
    model="claude-3-5-sonnet-20241022",
    max_tokens=1024,
    messages=conversation
)
print(message.content[0].text)
Output: "Mount Everest is located in Asia, on the border between Nepal and Tibet (China)."

Step 6: Error Handling

Always handle potential errors gracefully. Common errors include:

401 Unauthorized: Invalid or missing API key
429 Too Many Requests: Rate limit exceeded
500 Internal Server Error: Temporary server issue

Python error handling

from anthropic import APIError, APIConnectionError, RateLimitError
try:
    message = client.messages.create(
        model="claude-3-5-sonnet-20241022",
        max_tokens=1024,
        messages=[{"role": "user", "content": "Hello"}]
    )
except RateLimitError as e:
    print(f"Rate limited: {e}")
except APIError as e:
    print(f"API error: {e}")
except APIConnectionError as e:
    print(f"Connection error: {e}")

Best Practices

Use environment variables for your API key – never hardcode it.
Implement retry logic with exponential backoff for transient errors.
Set appropriate max_tokens to control costs and response length.
Use streaming for interactive applications to reduce perceived latency.
Keep conversation history within token limits (Claude's context window varies by model; for claude-3-5-sonnet, it's 200K tokens).
Monitor your usage in the Anthropic Console to avoid unexpected charges.

Next Steps

Now that you have the basics, explore more advanced features:

Tool use: Let Claude call external APIs or functions.
Vision: Send images for Claude to analyze.
JSON mode: Get structured JSON responses.
Batch processing: Send multiple requests efficiently.

Check the official Anthropic documentation for full details.

Key Takeaways

The Claude API uses a simple Messages endpoint – send messages, get responses.
Always authenticate with your API key via the x-api-key header or client library.
Streaming provides real-time token-by-token output, ideal for chat apps.
Maintain conversation context by sending the full message history with each request.
Follow best practices: use environment variables, implement retry logic, and monitor your usage.