Guide2026-04-24

Mastering Claude's Stop Reasons: A Practical Guide for API Developers

Learn how to interpret and handle Claude's stop_reason field in the Messages API, including end_turn, max_tokens, tool_use, and stop_sequence, with code examples and troubleshooting tips.

Quick Answer

This guide explains the four stop_reason values in Claude's API responses—end_turn, max_tokens, tool_use, and stop_sequence—and shows you how to handle each one in your application logic, including how to prevent and recover from empty responses.

Claude APIstop_reasonMessages APIerror handlingtool use

Mastering Claude's Stop Reasons: A Practical Guide for API Developers

When you build applications with Claude's Messages API, every successful response includes a stop_reason field. This small but critical piece of data tells you why Claude stopped generating its response. Understanding these values is essential for creating robust, production-ready applications that handle different scenarios gracefully.

In this guide, we'll break down each stop reason, show you how to handle them in code, and cover common pitfalls—especially the tricky "empty response" problem that can occur with tool use.

What Is `stop_reason`?

The stop_reason field appears in every successful API response from Claude. Unlike errors (which indicate something went wrong processing your request), stop_reason tells you why Claude successfully finished generating its response. It's your application's signal for what to do next.

Here's a typical response structure:

{
  "id": "msg_01234",
  "type": "message",
  "role": "assistant",
  "content": [
    {
      "type": "text",
      "text": "Here's the answer to your question..."
    }
  ],
  "stop_reason": "end_turn",
  "stop_sequence": null,
  "usage": {
    "input_tokens": 100,
    "output_tokens": 50
  }
}

The Four Stop Reason Values

Claude can return one of four stop_reason values. Let's examine each one.

1. `end_turn` – Natural Completion

What it means: Claude finished its response naturally. The model decided it had said everything it needed to say and ended its turn. When it occurs: This is the most common stop reason. You'll see it for simple Q&A, single-turn conversations, or when Claude completes a task without needing additional tools. How to handle it: In most cases, you can process the response content directly and present it to the user.

from anthropic import Anthropic
client = Anthropic()
response = client.messages.create(
    model="claude-sonnet-4-20250514",
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello!"}]
)
if response.stop_reason == "end_turn":
    # Process the complete response
    print(response.content[0].text)

2. `max_tokens` – Token Limit Reached

What it means: Claude stopped because it reached the max_tokens limit you set in your request. The response may be cut off mid-sentence. When it occurs: Your max_tokens value is too low for the expected response length, or Claude is generating a very long response. How to handle it: You should continue the conversation by sending Claude's response back along with a prompt to continue. This is often called a "continuation loop."

def handle_max_tokens(client, messages, response):
    if response.stop_reason == "max_tokens":
        # Add Claude's response to the conversation
        messages.append({"role": "assistant", "content": response.content})
        # Add a continuation prompt
        messages.append({"role": "user", "content": "Please continue."})
        # Make a new request
        return client.messages.create(
            model="claude-sonnet-4-20250514",
            max_tokens=1024,
            messages=messages
        )
    return response

3. `tool_use` – Tool Call Requested

What it means: Claude wants to use a tool (function call). The response content will contain one or more tool_use blocks. When it occurs: You've provided tools in your request, and Claude determined it needs to call one to complete the task. How to handle it: You must execute the tool, then send the results back to Claude in a tool_result block.

def handle_tool_use(client, messages, response):
    if response.stop_reason == "tool_use":
        # Add Claude's response to messages
        messages.append({"role": "assistant", "content": response.content})
        
        # Process each tool use block
        tool_results = []
        for block in response.content:
            if block.type == "tool_use":
                result = execute_tool(block.name, block.input)
                tool_results.append({
                    "type": "tool_result",
                    "tool_use_id": block.id,
                    "content": str(result)
                })
        
        # Send results back to Claude
        messages.append({"role": "user", "content": tool_results})
        
        # Continue the conversation
        return client.messages.create(
            model="claude-sonnet-4-20250514",
            max_tokens=1024,
            messages=messages
        )
    return response

4. `stop_sequence` – Custom Stop Sequence Triggered

What it means: Claude encountered one of the custom stop_sequences you specified in your API request. When it occurs: You've defined stop sequences (e.g., ["\n\nHuman:", "\n\nAssistant:"]) and Claude generated one of them. How to handle it: The response is complete up to the stop sequence. You can process it as-is or continue if needed.

response = client.messages.create(
    model="claude-sonnet-4-20250514",
    max_tokens=1024,
    messages=[{"role": "user", "content": "List 5 facts about Mars."}],
    stop_sequences=["\n\nHuman:"]
)
if response.stop_reason == "stop_sequence":
    print(f"Stopped at sequence: {response.stop_sequence}")
    print(response.content[0].text)

Handling Empty Responses with `end_turn`

One of the most confusing scenarios developers encounter is when Claude returns an empty response (2-3 tokens, no content) with stop_reason: "end_turn". This typically happens in tool-use workflows.

Why It Happens

Claude learns from patterns. If you consistently add text blocks immediately after tool_result blocks, Claude learns to expect that pattern and may end its turn prematurely, assuming you'll insert text.

How to Prevent It

Don't add text after tool results. Send only the tool_result block without any additional text:

# INCORRECT: Adding text after tool_result
messages = [
    {"role": "user", "content": "Calculate 1234 + 5678"},
    {"role": "assistant", "content": [{
        "type": "tool_use",
        "id": "toolu_123",
        "name": "calculator",
        "input": {"operation": "add", "a": 1234, "b": 5678}
    }]},
    {"role": "user", "content": [
        {"type": "tool_result", "tool_use_id": "toolu_123", "content": "6912"},
        {"type": "text", "text": "Here's the result"}  # Don't do this!
    ]}
]
CORRECT: Just the tool_result
messages = [
    {"role": "user", "content": "Calculate 1234 + 5678"},
    {"role": "assistant", "content": [{
        "type": "tool_use",
        "id": "toolu_123",
        "name": "calculator",
        "input": {"operation": "add", "a": 1234, "b": 5678}
    }]},
    {"role": "user", "content": [
        {"type": "tool_result", "tool_use_id": "toolu_123", "content": "6912"}
    ]}
]

How to Recover from Empty Responses

If you still get empty responses after fixing the pattern above, use a continuation prompt in a new user message:

def handle_empty_response(client, messages):
    response = client.messages.create(
        model="claude-opus-4-20250514",
        max_tokens=1024,
        messages=messages
    )
    
    if response.stop_reason == "end_turn" and not response.content:
        # Add a continuation prompt in a NEW user message
        messages.append({
            "role": "user",
            "content": "Please provide your response now."
        })
        return client.messages.create(
            model="claude-opus-4-20250514",
            max_tokens=1024,
            messages=messages
        )
    return response

Important: Don't just retry with the same messages—Claude already decided it's done, so it will remain done. You need to add new context.

Building a Complete Handler

For production applications, you'll want a function that handles all stop reasons:

def handle_claude_response(client, messages, tools=None):
    response = client.messages.create(
        model="claude-sonnet-4-20250514",
        max_tokens=1024,
        messages=messages,
        tools=tools
    )
    
    if response.stop_reason == "end_turn":
        if not response.content:
            # Handle empty response
            messages.append({
                "role": "user",
                "content": "Please provide your response."
            })
            return handle_claude_response(client, messages, tools)
        return response.content[0].text
    
    elif response.stop_reason == "max_tokens":
        messages.append({"role": "assistant", "content": response.content})
        messages.append({"role": "user", "content": "Please continue."})
        return handle_claude_response(client, messages, tools)
    
    elif response.stop_reason == "tool_use":
        messages.append({"role": "assistant", "content": response.content})
        tool_results = []
        for block in response.content:
            if block.type == "tool_use":
                result = execute_tool(block.name, block.input)
                tool_results.append({
                    "type": "tool_result",
                    "tool_use_id": block.id,
                    "content": str(result)
                })
        messages.append({"role": "user", "content": tool_results})
        return handle_claude_response(client, messages, tools)
    
    elif response.stop_reason == "stop_sequence":
        return response.content[0].text

Best Practices

Always check stop_reason – Don't assume end_turn means you have content. Always verify the content array isn't empty.
Implement continuation loops – For max_tokens, always continue the conversation rather than truncating responses.
Keep tool results clean – Never add text after tool_result blocks. Let Claude generate its own commentary.
Log stop reasons – In production, log stop_reason values to monitor for unexpected patterns.
Set appropriate max_tokens – Estimate response length and set max_tokens generously to avoid unnecessary continuation calls.

Key Takeaways

Four stop reasons exist: end_turn (natural completion), max_tokens (limit reached), tool_use (tool call requested), and stop_sequence (custom sequence triggered).
Handle each reason differently: Continue on max_tokens, execute tools on tool_use, process normally on end_turn and stop_sequence.
Empty responses with end_turn are usually caused by adding text after tool_result blocks—send only the tool result.
Never retry the same messages for empty responses; add a continuation prompt in a new user message.
Build a unified handler that recursively processes responses until you get a final end_turn with content.

Mastering Claude's Stop Reasons: A Practical Guide for API Developers

What Is stop_reason?

The Four Stop Reason Values

1. end_turn – Natural Completion

2. max_tokens – Token Limit Reached

3. tool_use – Tool Call Requested

4. stop_sequence – Custom Stop Sequence Triggered

Handling Empty Responses with end_turn

Why It Happens

How to Prevent It

CORRECT: Just the tool_result

How to Recover from Empty Responses

Building a Complete Handler

Best Practices

Key Takeaways

What Is `stop_reason`?

1. `end_turn` – Natural Completion

2. `max_tokens` – Token Limit Reached

3. `tool_use` – Tool Call Requested

4. `stop_sequence` – Custom Stop Sequence Triggered

Handling Empty Responses with `end_turn`