Skip to content

Qwen 3.7 Plus

Qwen 3.7 Plus is the Qwen 3.7 Plus-tier model in Alibaba's Qwen 3 family, succeeding Qwen3.6-Plus with stronger reasoning, vision input, and a context window of 1M tokens.

ReasoningTool UseImplicit CachingFile InputVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen3.7-plus',
prompt: 'Why is the sky blue?'
})

Playground

Try out Qwen 3.7 Plus by Alibaba. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Alibaba
Legal:Terms
Privacy
1M
2.8s
55tps
$0.40/MFree
$1.60/MFree
Read:
$0.08/MFree
Write:
$0.5/MFree
+3
06/01/2026
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by Alibaba

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
991K
2.7s
95tps
$1.25/MFree
$3.75/MFree
Read:$0.25/MFree
Write:$1.56/MFree
+3
alibaba logo
05/21/2026
1M
1.1s
109tps
$0.50/M
$3.00/M
Read:
$0.1/M
Write:
$0.63/M
+3
alibaba logo
fireworks logo
04/02/2026
1M
0.7s
255tps
$0.10/M$0.40/M
Read:$0.0/M
Write:$0.13/M
+3
alibaba logo
02/24/2026
131K
0.3s
307tps
$0.10/M$0.30/M
Read:$0.14/M
Write:
alibaba logo
bedrock logo
deepinfra logo
+1
04/01/2025
262K
0.3s
82tps
$0.07/M$0.46/M
deepinfra logo
novita logo
vertex logo
04/01/2025
41K
0.3s
52tps
$0.12/M$0.24/M
deepinfra logo
04/01/2025

About Qwen 3.7 Plus

Qwen 3.7 Plus is the Plus-tier release in the Qwen 3.7 generation, succeeding Qwen3.6-Plus in Alibaba's production Qwen 3 lineup. It ships with a context window of 1M tokens and is available through alibaba on AI Gateway.

The Qwen Plus line targets workloads that need deeper reasoning and stronger instruction following than the Flash tier provides, while keeping per-token cost below the Max tier. Qwen 3.7 Plus continues that positioning and adds vision input, making it the multimodal entry in the 3.7 line. The model is described as a balanced version focused on reasoning and logical expression, with the broader toolchain rolling out over time.

Like earlier Plus releases, Qwen 3.7 Plus supports structured outputs and tool calling, letting the model invoke registered functions or external APIs during multi-turn sessions. For teams already using Qwen3.6-Plus, Qwen 3.7 Plus offers an incremental upgrade path that slots into existing integrations without changes to prompt structure or tool-calling configuration.

You can integrate Qwen 3.7 Plus through AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python.

What To Consider When Choosing a Provider

  • Configuration: As a newer Plus-tier release than Qwen3.6-Plus, monitor the AI Gateway cost dashboard to compare per-token spend and confirm the quality uplift justifies any pricing difference for your workload.
  • Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Qwen 3.7 Plus

Best For

  • Deliberate Multi-Step Reasoning: Analytical workflows, structured document processing, and multi-constraint problem solving
  • Multimodal Reasoning Tasks: Pairing text instructions with screenshot analysis, chart interpretation, or UI grounding
  • High-Fidelity Code Work: Refactoring and code generation where instruction adherence outweighs raw throughput
  • Multi-Turn Agentic Pipelines: Tool-calling across many turns where the model plans and invokes external APIs
  • Long-Context Workloads: Passing full documents or codebases using the window of 1M tokens
  • Upgrading From Qwen3.6-Plus: Teams moving to the Qwen 3.7 Plus tier without changing their integration

Consider Alternatives When

  • Throughput And Cost First: A Flash-tier model is a closer fit when deep reasoning isn't required and latency and price dominate
  • Flagship Agent Workloads: Qwen3.7-Max is the flagship in the 3.7 line for long-horizon agent tasks that need maximum reasoning ceiling
  • Text-Only Pipelines: A dedicated text model offers lower cost per token when vision input is never used
  • Video Or Image Generation: A generation-class model fits tasks that produce pixels rather than read them

Conclusion

Qwen 3.7 Plus extends the Qwen Plus tier with the Qwen 3.7 generation's reasoning improvements and adds vision input as the multimodal entry in the 3.7 line. Routing through AI Gateway gives teams already on Qwen3.6-Plus a direct upgrade path with the same unified API, provider routing, and billing.

Frequently Asked Questions

  • How does Qwen 3.7 Plus relate to Qwen3.6-Plus?

    Qwen 3.7 Plus is the next generation in the Plus tier, succeeding Qwen3.6-Plus. It builds on the same architectural lineage with stronger reasoning and adds vision input as the multimodal entry in the 3.7 line.

  • What is the context window for Qwen 3.7 Plus?

    The context window is 1M tokens. This applies to the combined input and output token length.

  • Can Qwen 3.7 Plus accept image input?

    Yes. Qwen 3.7 Plus is the multimodal entry in the Qwen 3.7 line, accepting image input alongside text within a single request.

  • Does Qwen 3.7 Plus support tool calling and agentic workflows?

    Yes. Like other Qwen 3 Plus-tier models, Qwen 3.7 Plus supports structured tool calling, letting the model invoke registered functions or APIs during multi-turn sessions.

  • Can I switch from Qwen3.6-Plus to Qwen 3.7 Plus without changing my integration?

    Yes. Update the model identifier in your AI Gateway request to `alibaba/qwen3.7-plus`. No changes to prompt structure or tool-calling configuration are required.

  • When should I use Qwen3.7-Max instead of Qwen 3.7 Plus?

    Use Qwen3.7-Max for long-horizon agent workloads that need the highest reasoning ceiling in the 3.7 line, including multi-file software engineering and extended tool-calling sessions. Qwen 3.7 Plus is a closer fit when you need a balanced reasoning model with vision input and lower per-token cost than the Max tier.

  • How do I access Qwen 3.7 Plus through AI Gateway?

    Authenticate with an AI Gateway API key or OIDC token and reference `alibaba/qwen3.7-plus` as the model. You can call Qwen 3.7 Plus through AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python.

  • Does Qwen 3.7 Plus support zero data retention?

    Zero Data Retention is not currently available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.