The Anthropic API provides programmatic access to Claude, Anthropic's family of advanced large language models, enabling developers to integrate state-of-the-art AI capabilities into applications. Available through direct API access, AWS Bedrock, Google Vertex AI, and Microsoft Foundry, the API supports text generation, vision, function calling, streaming responses, server-side tools, and agentic workflows. As of April 2026, Claude Opus 4.7 is the flagship model with a 1M token context window, adaptive thinking, and step-change gains in agentic coding; the 1M context window is now standard pricing with no surcharge across all Claude 4.x models. The API follows a stateless message-based architecture, and new offerings like Claude Managed Agents (public beta April 2026) provide full hosted infrastructure for production agent deployments at $0.08 per session-hour.
What This Cheat Sheet Covers
This topic spans 14 focused tables and 112 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.
Table 1: Claude Models
| Model | Example | Description |
|---|---|---|
model: "claude-opus-4-7"1M context, 128K output | β’ Current flagship GA model (released April 16, 2026) β’ step-change in agentic coding, vision, and knowledge work β’ adaptive thinking only (no manual budget_tokens) β’ temperature/top_p/top_k removed (400 error if set) β’ new tokenizer uses up to 35% more tokens than prior models β’ 5/25 per million input/output tokens. | |
model: "claude-sonnet-4-6"1M context, 64K output | β’ Best speed/intelligence balance; 1M context GA as of March 13, 2026 β’ 30β50% faster than Sonnet 4.5 β’ supports adaptive thinking and extended thinking β’ 3/15 per million tokensβ’ default for most production use cases. | |
model: "claude-haiku-4-5"200K context, 64K output | β’ Fastest and most cost-effective at 1/5 per million tokensβ’ supports extended thinking β’ no prompt injection protection β review before agentic use β’ ideal for high-volume, low-latency tasks. | |
model: "claude-opus-4-6"1M context, 128K output | β’ Previous flagship; still available at same price (5/25)β’ supports adaptive + manual extended thinking β’ fast mode beta available (6Γ price) β’ superseded by Opus 4.7 for most use cases. |