GuideBeginnerAgents2026-05-15

Mastering Claude's Stop Reasons: Build Robust API Applications

Learn how to interpret and handle Claude's stop_reason field in the Messages API. Includes code examples for end_turn, tool_use, max_tokens, and error handling strategies.

Quick Answer

Learn to interpret Claude's stop_reason field (end_turn, tool_use, max_tokens, stop_sequence) and handle each case correctly in your application, including preventing empty responses and managing tool call flows.

Messages APIstop_reasonerror handlingtool usebest practices

Mastering Claude's Stop Reasons: Build Robust API Applications

When building applications with Claude's Messages API, understanding why the model stopped generating its response is essential for creating reliable, production-ready systems. The stop_reason field in every API response tells you exactly why Claude finished—and knowing how to handle each case can mean the difference between a smooth user experience and a broken workflow.

In this guide, you'll learn what each stop reason means, how to handle them in code, and how to avoid common pitfalls like empty responses.

What Is `stop_reason`?

The stop_reason field is part of every successful Messages API response. Unlike errors (which indicate something went wrong with your request), stop_reason tells you why Claude successfully completed its response generation. It's your signal for what to do next.

Here's a typical response structure:

{
  "id": "msg_01234",
  "type": "message",
  "role": "assistant",
  "content": [
    {
      "type": "text",
      "text": "Here's the answer to your question..."
    }
  ],
  "stop_reason": "end_turn",
  "stop_sequence": null,
  "usage": {
    "input_tokens": 100,
    "output_tokens": 50
  }
}

The Four Stop Reasons

Claude can stop generating for four distinct reasons. Let's explore each one.

`end_turn` – Natural Completion

What it means: Claude finished its response naturally. The model decided it had said everything needed for the current turn. When it happens: This is the most common stop reason. It occurs when Claude has provided a complete answer, asked a clarifying question, or simply finished its thought. How to handle it: In most cases, you can process the response and wait for the next user input.

from anthropic import Anthropic
client = Anthropic()
response = client.messages.create(
    model="claude-sonnet-4-20250514",
    max_tokens=1024,
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
if response.stop_reason == "end_turn":
    # Process the complete response
    print(response.content[0].text)

#### ⚠️ The Empty Response Gotcha

Sometimes Claude returns an empty response (exactly 2–3 tokens with no content) with stop_reason: "end_turn". This typically happens in tool-use scenarios when:

You add text blocks immediately after tool results (Claude learns to expect the user to always insert text after tool results, so it ends its turn to follow the pattern)
You send Claude's completed response back without adding anything (Claude already decided it's done, so it remains done)

How to prevent empty responses:

# INCORRECT: Adding text immediately after tool_result
messages = [
    {"role": "user", "content": "Calculate the sum of 1234 and 5678"},
    {"role": "assistant", "content": [
        {
            "type": "tool_use",
            "id": "toolu_123",
            "name": "calculator",
            "input": {"operation": "add", "a": 1234, "b": 5678}
        }
    ]},
    {"role": "user", "content": [
        {"type": "tool_result", "tool_use_id": "toolu_123", "content": "6912"},
        {"type": "text", "text": "Here's the result"}  # ❌ Don't add text after tool_result
    ]}
]
CORRECT: Send tool results directly without additional text
messages = [
    {"role": "user", "content": "Calculate the sum of 1234 and 5678"},
    {"role": "assistant", "content": [
        {
            "type": "tool_use",
            "id": "toolu_123",
            "name": "calculator",
            "input": {"operation": "add", "a": 1234, "b": 5678}
        }
    ]},
    {"role": "user", "content": [
        {"type": "tool_result", "tool_use_id": "toolu_123", "content": "6912"}
    ]}  # ✅ Just the tool_result, no additional text
]

If you still get empty responses after fixing the above, use a continuation prompt:

def handle_empty_response(client, messages):
    response = client.messages.create(
        model="claude-opus-4-7",
        max_tokens=1024,
        messages=messages
    )
    
    if response.stop_reason == "end_turn" and not response.content:
        # Add a continuation prompt in a NEW user message
        messages.append({"role": "user", "content": "Please continue"})
        response = client.messages.create(
            model="claude-opus-4-7",
            max_tokens=1024,
            messages=messages
        )
    
    return response

`tool_use` – Claude Wants to Use a Tool

What it means: Claude decided to call one or more tools you've provided. The response content will contain tool_use blocks instead of (or in addition to) text. When it happens: When you've defined tools and Claude determines it needs to perform an action—like looking up data, running a calculation, or calling an external API. How to handle it: You must execute the tool, append the result as a tool_result block, and send the conversation back to Claude.

import json
from anthropic import Anthropic
client = Anthropic()
Define a simple calculator tool
tools = [
    {
        "name": "calculator",
        "description": "Perform a mathematical operation",
        "input_schema": {
            "type": "object",
            "properties": {
                "operation": {"type": "string", "enum": ["add", "subtract", "multiply", "divide"]},
                "a": {"type": "number"},
                "b": {"type": "number"}
            },
            "required": ["operation", "a", "b"]
        }
    }
]
messages = [
    {"role": "user", "content": "What is 1234 + 5678?"}
]
response = client.messages.create(
    model="claude-sonnet-4-20250514",
    max_tokens=1024,
    tools=tools,
    messages=messages
)
Check for tool use
if response.stop_reason == "tool_use":
    # Append Claude's response to messages
    messages.append({"role": "assistant", "content": response.content})
    
    # Process each tool use block
    for block in response.content:
        if block.type == "tool_use":
            # Execute the tool (in a real app, you'd call your actual function)
            if block.name == "calculator":
                a = block.input["a"]
                b = block.input["b"]
                if block.input["operation"] == "add":
                    result = a + b
                
                # Append the tool result
                messages.append({
                    "role": "user",
                    "content": [
                        {
                            "type": "tool_result",
                            "tool_use_id": block.id,
                            "content": str(result)
                        }
                    ]
                })
    
    # Send back to Claude for the final response
    final_response = client.messages.create(
        model="claude-sonnet-4-20250514",
        max_tokens=1024,
        tools=tools,
        messages=messages
    )
    print(final_response.content[0].text)

`max_tokens` – Token Limit Reached

What it means: Claude reached the max_tokens limit you set before completing its response. The response was cut off mid-thought. When it happens: When the model needed more tokens than you allocated to finish its response. How to handle it: You have two options:

Increase max_tokens – If you consistently hit this limit, raise the value in your request.
Continue the conversation – Send Claude's partial response back and ask it to continue.

response = client.messages.create(
    model="claude-sonnet-4-20250514",
    max_tokens=1024,  # Try increasing this if you hit the limit
    messages=messages
)
if response.stop_reason == "max_tokens":
    # Option 1: Increase max_tokens and retry
    # Option 2: Continue the conversation
    messages.append({"role": "assistant", "content": response.content})
    messages.append({"role": "user", "content": "Please continue from where you left off."})
    
    response = client.messages.create(
        model="claude-sonnet-4-20250514",
        max_tokens=2048,  # Increased limit
        messages=messages
    )

`stop_sequence` – Custom Stop Sequence Triggered

What it means: Claude encountered one of the custom stop_sequences you specified in your API request. When it happens: When you've defined specific strings (like "\n\nHuman:" or "<END>") that signal Claude to stop generating. How to handle it: The response is complete up to the stop sequence. The stop_sequence field in the response will tell you which sequence was triggered.

response = client.messages.create(
    model="claude-sonnet-4-20250514",
    max_tokens=1024,
    stop_sequences=["<END>", "\n\nHuman:"],
    messages=messages
)
if response.stop_reason == "stop_sequence":
    print(f"Stopped at sequence: {response.stop_sequence}")
    # The response content is complete up to the stop sequence
    print(response.content[0].text)

Building a Complete Handler

Here's a robust handler that manages all stop reasons in a single function:

def handle_claude_response(client, messages, tools=None, max_tokens=1024):
    """
    Handle Claude's response and manage all stop reasons.
    Returns the final response after processing any tool calls.
    """
    response = client.messages.create(
        model="claude-sonnet-4-20250514",
        max_tokens=max_tokens,
        tools=tools,
        messages=messages
    )
    
    while True:
        if response.stop_reason == "end_turn":
            # Natural completion
            if not response.content:
                # Handle empty response
                messages.append({"role": "user", "content": "Please continue"})
                response = client.messages.create(
                    model="claude-sonnet-4-20250514",
                    max_tokens=max_tokens,
                    tools=tools,
                    messages=messages
                )
                continue
            return response
            
        elif response.stop_reason == "tool_use":
            # Process tool calls
            messages.append({"role": "assistant", "content": response.content})
            for block in response.content:
                if block.type == "tool_use":
                    # Execute tool (implement your tool execution logic)
                    result = execute_tool(block.name, block.input)
                    messages.append({
                        "role": "user",
                        "content": [{
                            "type": "tool_result",
                            "tool_use_id": block.id,
                            "content": str(result)
                        }]
                    })
            # Continue the conversation
            response = client.messages.create(
                model="claude-sonnet-4-20250514",
                max_tokens=max_tokens,
                tools=tools,
                messages=messages
            )
            continue
            
        elif response.stop_reason == "max_tokens":
            # Token limit reached - continue
            messages.append({"role": "assistant", "content": response.content})
            messages.append({"role": "user", "content": "Please continue."})
            response = client.messages.create(
                model="claude-sonnet-4-20250514",
                max_tokens=max_tokens * 2,  # Increase limit
                tools=tools,
                messages=messages
            )
            continue
            
        elif response.stop_reason == "stop_sequence":
            # Custom stop sequence triggered
            return response

Best Practices Summary

Always check stop_reason – Don't assume the response is complete. Each reason requires different handling.
Never add text after tool_result – This is the most common cause of empty responses.
Use continuation prompts for truncated responses – When max_tokens or empty end_turn occurs, ask Claude to continue.
Log the stop_reason – In production, log this value to debug unexpected behavior and optimize your application.
Test all four scenarios – Ensure your handler works correctly for end_turn, tool_use, max_tokens, and stop_sequence.

Key Takeaways

stop_reason tells you why Claude stopped – end_turn (natural completion), tool_use (wants to call a tool), max_tokens (response was cut off), or stop_sequence (custom stop triggered).
Empty responses with end_turn are preventable – Never add text blocks immediately after tool_result blocks, and use continuation prompts if needed.
tool_use requires a multi-turn flow – Execute the tool, append the result, and send the conversation back to Claude for the final response.
max_tokens means your response was truncated – Increase the limit or continue the conversation to get the complete answer.
Build a unified handler – A single function that processes all stop reasons will make your application more robust and maintainable.

Mastering Claude's Stop Reasons: Build Robust API Applications

What Is stop_reason?

The Four Stop Reasons

end_turn – Natural Completion

CORRECT: Send tool results directly without additional text

tool_use – Claude Wants to Use a Tool

Define a simple calculator tool

Check for tool use

max_tokens – Token Limit Reached

stop_sequence – Custom Stop Sequence Triggered

Building a Complete Handler

Best Practices Summary

Key Takeaways

What Is `stop_reason`?

`end_turn` – Natural Completion

`tool_use` – Claude Wants to Use a Tool

`max_tokens` – Token Limit Reached

`stop_sequence` – Custom Stop Sequence Triggered