Models¶

Available Claude models for your local proxy.

Supported Models¶

Model ID	Description	Context Length	Use Case
`claude-sonnet-4-20250514`	Claude 4 Sonnet	200K tokens	General purpose, latest model
`claude-3-5-sonnet-20241022`	Claude 3.5 Sonnet	200K tokens	High-quality reasoning
`claude-3-7-sonnet-20250219`	Claude 3.7 Sonnet	200K tokens	Enhanced capabilities
`claude-opus-4-20250514`	Claude 4 Opus	200K tokens	Maximum capability
`claude-3-5-haiku-20241022`	Claude 3.5 Haiku	200K tokens	Fast responses

Model Selection Guide¶

For General Use¶

claude-sonnet-4-20250514: Latest and most capable model
claude-3-5-sonnet-20241022: Reliable choice for most tasks

For Speed¶

claude-3-5-haiku-20241022: Fastest response times
claude-3-7-sonnet-20250219: Good balance of speed and quality

For Complex Tasks¶

claude-opus-4-20250514: Maximum reasoning capability
claude-sonnet-4-20250514: Latest advanced features

Model Capabilities¶

All Models Support¶

Text generation and analysis
Code writing and debugging
Mathematical reasoning
Creative writing
Language translation
Summarization
Question answering

Advanced Features¶

Image Analysis: Support for JPEG, PNG, GIF, WebP
Document Processing: Text extraction and analysis
Code Understanding: Multi-language code analysis
Streaming: Real-time response streaming

Usage Examples¶

Model Selection in Requests¶

{
  "model": "claude-sonnet-4-20250514",
  "messages": [
    {
      "role": "user",
      "content": "Write a Python function to calculate fibonacci numbers"
    }
  ],
  "max_tokens": 1000
}

Dynamic Model Selection¶

# Choose model based on task complexity
def get_model_for_task(task_type):
    if task_type == "simple_qa":
        return "claude-3-5-haiku-20241022"
    elif task_type == "code_generation":
        return "claude-3-5-sonnet-20241022"
    elif task_type == "complex_reasoning":
        return "claude-opus-4-20250514"
    else:
        return "claude-sonnet-4-20250514"  # Default to latest

Model Availability¶

Models are available through your Claude subscription. The proxy automatically handles: - Model validation - Subscription checking - Capability routing

Performance Characteristics¶

Model	Speed	Quality	Cost Efficiency
claude-3-5-haiku-20241022	⭐⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐⭐
claude-3-7-sonnet-20250219	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐
claude-3-5-sonnet-20241022	⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐
claude-sonnet-4-20250514	⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐
claude-opus-4-20250514	⭐⭐	⭐⭐⭐⭐⭐	⭐⭐

Error Handling¶

Model Not Found¶

{
  "error": {
    "type": "not_found_error",
    "message": "Model 'invalid-model' not found"
  }
}

Model Unavailable¶

{
  "error": {
    "type": "service_unavailable_error",
    "message": "Model 'claude-opus-4-20250514' temporarily unavailable"
  }
}

Best Practices¶

Start with Haiku for testing and development
Use Sonnet for production applications
Reserve Opus for complex reasoning tasks
Monitor usage to optimize costs
Cache responses when appropriate

Model Updates¶

The proxy automatically supports new Claude models as they become available. Check the /v1/models endpoint for the latest list of supported models.