Is Claude Sonnet 5 available on the free plan?

Yes. Claude Sonnet 5 is the default model on the Free and Pro plans, and it is also available to Max, Team and Enterprise plans, in Claude Code, the Claude API, Amazon Bedrock, Google Cloud Vertex AI, Microsoft Foundry, GitHub Copilot and OpenRouter.

How much does Claude Sonnet 5 cost per token?

Introductory pricing is $2 per million input tokens and $10 per million output tokens through August 31, 2026. From September 1, 2026 the standard rate is $3 input and $15 output per million tokens. Cache reads are $0.20 (intro) / $0.30 (standard); a 5-minute cache write is 1.25x base input and a 1-hour cache write is 2x base input.

When does the Claude Sonnet 5 introductory price end?

The introductory $2/$10 pricing runs through August 31, 2026. On September 1, 2026 pricing moves to the standard $3 input / $15 output per million tokens. Always plan budgets around the standard rate for anything past August.

What is the Claude Sonnet 5 context window?

Claude Sonnet 5 has a 1,000,000 (1M) token context window. That figure is both the default and the maximum; there is no smaller-context variant to opt into.

What is the maximum output for Claude Sonnet 5?

Maximum output is 128K tokens, and up to 300K tokens when you send the Message Batches beta header output-300k-2026-03-24.

Is Claude Sonnet 5 better than Opus 4.8?

No. Anthropic positions Sonnet 5 as a mid-tier model whose performance is close to Opus 4.8 at roughly 40% lower price, but Opus 4.8 still leads on the hardest coding, judgment and cybersecurity tasks. Sonnet 5 approaches Opus 4.8; it does not match or beat it.

What is the Claude Sonnet 5 model id?

The model id is claude-sonnet-5, a pinned dateless snapshot with no -v1 suffix. On Amazon Bedrock it is anthropic.claude-sonnet-5, and on OpenRouter the slug is anthropic/claude-sonnet-5-20260630.

Can I use Claude Sonnet 5 in Claude Code?

Yes. Claude Sonnet 5 is available in Claude Code as well as the Claude API and every major cloud platform. Adaptive thinking is on by default with effort levels low, medium, high, xhigh and max (default high); manual extended thinking and non-default temperature/top_p/top_k return HTTP 400, so omit them.

新品：Claude Sonnet 5 - 于 2026 年 6 月 30 日发布

Claude Sonnet 5：为智能体编码打造的中端模型

关于 claude-sonnet-5 的权威指南 - Free 和 Pro 套餐上的新默认模型。1M 上下文、128K 输出，Anthropic 称其性能接近 Opus 4.8，而价格约低 40%。

#Sonnet 5 #claude-sonnet-5 #智能体 #编码

Claude Sonnet 5 速览

claude-sonnet-5

模型 ID

固定的无日期快照，于 2026 年 6 月 30 日发布。无 -v1 后缀。

1M / 128K

上下文与最大输出

1M token 上下文（默认即最大），最大输出 128K（通过 batches beta 标头可达 300K）。

$2 / $10

引导期定价

2026 年 8 月 31 日前每 MTok 输入 $2 / 输出 $10；自 2026 年 9 月 1 日起标准价 $3/$15。

≈ Opus 4.8

定位

中端模型，Anthropic 称其接近 Opus 4.8，位于 Haiku 4.5 与 Opus 4.8 之间。

速览：完整规格

claude-sonnet-5 的核实事实汇于一表，直接源自 Anthropic 的模型文档。

规格	Claude Sonnet 5
模型 ID	claude-sonnet-5（Bedrock：anthropic.claude-sonnet-5；OpenRouter：anthropic/claude-sonnet-5-20260630）
发布日期	2026 年 6 月 30 日
上下文窗口	1,000,000 token（默认与最大均为此值 - 无更小的变体）
最大输出	128K token（使用 output-300k-2026-03-24 Message Batches beta 标头可达 300K）
引导期定价（至 2026 年 8 月 31 日）	输入 $2 / MTok，输出 $10 / MTok；缓存读取 $0.20
标准定价（自 2026 年 9 月 1 日起）	输入 $3 / MTok，输出 $15 / MTok；缓存读取 $0.30
思考与努力程度	默认开启自适应思考；努力程度 low / medium / high / xhigh / max（默认 high）。手动思考或非默认 temperature/top_p/top_k 将返回 HTTP 400。
知识截止时间	2026 年 1 月
可用性	Free 与 Pro 上的默认模型；Max/Team/Enterprise；Claude Code；Claude API；Amazon Bedrock；Google Cloud Vertex AI；Microsoft Foundry；GitHub Copilot；OpenRouter

Sonnet 5 的强项所在 - 以及 Opus 4.8 仍领先之处

Anthropic 将 Sonnet 5 定位为速度与智能的最佳组合。它接近 Opus 4.8，但并不取代它 - 以下是坦诚的划分。

Sonnet 5 的强项所在

高吞吐量智能体编码

快速的工具循环、重构和多文件编辑，在需要高质量且每 token 成本低于 Opus 的场景。
对速度敏感的交互式工作

聊天、结对编程和 IDE 辅助，低延迟与推理深度同样重要的场景。
预算有限的长上下文任务

完整的 1M token 窗口默认可用，因此大型代码库和文档无需支付 Opus 的价格即可容纳。
日常默认主力

作为 Free 和 Pro 上的新默认模型，它能出色地处理大多数通用与编码请求。

Opus 4.8 仍领先之处

最艰难的编码任务

深度、含糊、长周期的工程问题仍更适合 Opus 4.8 额外的余量。
高风险判断

细腻的推理、棘手的权衡和审慎的审查得益于 Opus 4.8 顶级的深度。
网络安全与对抗性工作

在最苛刻的安全与红队式推理上，Opus 4.8 仍保持领先。
绝对巅峰质量

当你不计价格需要最佳答案时，Opus 4.8 仍是旗舰。

Anthropic 尚未公布 Sonnet 5 的确切基准数字 - 仅表示其性能在定性上接近 Opus 4.8。Sonnet 5 接近 Opus 4.8；它并未与之持平或超越。

定价摘要

请始终同时规划引导期和标准价 - 引导期定价是临时的，并非永久。

至 2026 年 8 月 31 日

$2 / $10

引导期价格

每百万输入 token $2，每百万输出 token $10。缓存读取 $0.20。这是发布促销，并非长期价格。

自 2026 年 9 月 1 日起

$3 / $15

标准价格

每百万输入 token $3，每百万输出 token $15。缓存读取 $0.30。请按此价格为 8 月之后的任何工作负载做预算。

缓存读取

$0.20 / $0.30

提示缓存

缓存读取 $0.20（引导期）/ $0.30（标准）。5 分钟缓存写入为基础输入的 1.25x；1 小时缓存写入为基础输入的 2x。

⚠️

分词器提醒 - 在与 Sonnet 4.6 比较之前请先阅读

Sonnet 5 使用新的分词器：相同文本消耗的 token 比 Sonnet 4.6 约多 30%。因此，在相同文本上，$2/$10 的引导期价格更准确地说是与 Sonnet 4.6 的 $3/$15 大致成本持平 - 而非 33% 的折扣。请按真实请求成本比较，而不要只看标价。

哪个 Claude 模型适合哪种工作负载

针对当前 Claude 系列的简单路由框架。让模型匹配任务，而不要默认选最大的那个。

Haiku 4.5

最快且最便宜

在高吞吐量、对延迟敏感、低复杂度的任务上选用 Haiku 4.5：分类、抽取、路由和简单编辑。

⭐ Sonnet 5

速度与智能的最佳组合

Sonnet 5 是大多数智能体编码和通用工作的默认主力 - 质量强、速度快，接近 Opus 而价格约低 40%。

Opus 4.8

巅峰推理

在最艰难的编码、高风险判断和网络安全等需要旗舰的场景，升级到 Opus 4.8（$5/$25）。

Fable 5

专才旗舰

Fable 5（$10/$50，1M 上下文，128K 输出）面向其自身的专才工作负载 - 当其特有强项契合时使用它。

如何在 QCode 上使用 Sonnet 5

QCode 通过单一 API 为你提供 claude-sonnet-5，无需另行管理 Anthropic 账户。

指向 claude-sonnet-5

在任何 Anthropic 兼容请求中将 model 设为 claude-sonnet-5。无日期快照意味着无需追踪日期后缀。

默认自适应思考

思考在 effort=high 时自动开启。用努力程度 low / medium / high / xhigh / max 调节 - 不要发送手动 thinking 块。

在 Claude Code 中使用

将 Sonnet 5 选作你的 Claude Code 模型以进行快速的智能体循环，然后仅在最艰难的步骤升级到 Opus 4.8。

跳过不支持的参数

省略 temperature、top_p 和 top_k - 非默认值将返回 HTTP 400，与 Opus 4.7+ 的规则相同。

python - Anthropic 兼容调用

# 最小化的 Sonnet 5 请求

client.messages.create(
    model="claude-sonnet-5",
    max_tokens=8000,
    messages=[{"role": "user", "content": "Refactor this module"}]
    # adaptive thinking is ON by default (effort="high")
    # do NOT pass thinking={"type":"enabled"} or temperature -> HTTP 400
)

从 Sonnet 4.6 迁移

Sonnet 5 是推荐的后继者和新默认模型 - 但没有强制迁移，也没有截止期限。

Sonnet 4.6 并未退役

claude-sonnet-4-6 状态为 Active，暂定退役时间不早于 2027 年 2 月 17 日。你可以继续使用它；Sonnet 5 只是你决定迁移时推荐的后继者。

1. 替换模型 id

将 claude-sonnet-4-6 改为 claude-sonnet-5，然后重新核对输出 token 预算 - 新分词器对相同文本会多产出约 30% 的 token。

2. 移除旧版参数

移除任何手动 thinking 块以及非默认的 temperature/top_p/top_k。改为依赖自适应思考和 effort 参数。

常见问题

Claude Sonnet 5 在免费套餐上可用吗？

可用。Claude Sonnet 5 是 Free 和 Pro 套餐上的默认模型，同时也在 Max、Team 和 Enterprise 套餐、Claude Code、Claude API、Amazon Bedrock、Google Cloud Vertex AI、Microsoft Foundry、GitHub Copilot 和 OpenRouter 中可用。

Claude Sonnet 5 每 token 多少钱？

引导期定价为每百万输入 token $2、每百万输出 token $10，持续至 2026 年 8 月 31 日。自 2026 年 9 月 1 日起，标准价为每百万 token 输入 $3、输出 $15。缓存读取 $0.20（引导期）/ $0.30（标准）；5 分钟缓存写入为基础输入的 1.25x，1 小时缓存写入为基础输入的 2x。

Claude Sonnet 5 的引导期价格何时结束？

$2/$10 的引导期定价持续至 2026 年 8 月 31 日。2026 年 9 月 1 日起，定价转为标准的每百万 token 输入 $3 / 输出 $15。8 月之后的任何预算请按标准价规划。

Claude Sonnet 5 的上下文窗口是多少？

Claude Sonnet 5 拥有 1,000,000（1M）token 的上下文窗口。该数字既是默认值也是最大值；没有可选的更小上下文变体。

Claude Sonnet 5 的最大输出是多少？

最大输出为 128K token，在发送 Message Batches beta 标头 output-300k-2026-03-24 时可达 300K token。

Claude Sonnet 5 比 Opus 4.8 更好吗？

不是。Anthropic 将 Sonnet 5 定位为中端模型，其性能接近 Opus 4.8 而价格约低 40%，但 Opus 4.8 在最艰难的编码、判断和网络安全任务上仍保持领先。Sonnet 5 接近 Opus 4.8；它并未与之持平或超越。

Claude Sonnet 5 的模型 id 是什么？

模型 id 是 claude-sonnet-5，一个固定的无日期快照，无 -v1 后缀。在 Amazon Bedrock 上它是 anthropic.claude-sonnet-5，在 OpenRouter 上其 slug 为 anthropic/claude-sonnet-5-20260630。

我能在 Claude Code 中使用 Claude Sonnet 5 吗？

可以。Claude Sonnet 5 在 Claude Code 以及 Claude API 和各大云平台上均可用。自适应思考默认开启，努力程度为 low、medium、high、xhigh 和 max（默认 high）；手动扩展思考和非默认的 temperature/top_p/top_k 将返回 HTTP 400，因此请省略它们。

在 QCode 上开始使用 Claude Sonnet 5 构建

通过一个简单的 API 获得 claude-sonnet-5 以及完整的 Claude 系列 - Haiku 4.5、Opus 4.8 和 Fable 5。

免费开始查看定价浏览模型