ModelQwen (Alibaba Cloud)Qwen (Alibaba Cloud)published Apr 21, 2026seen 5d

Qwen/Qwen3.6-27B

Open original ↗

Captured source

source ↗
published Apr 21, 2026seen 5dcaptured 15hhttp 200method plaintask image-text-to-textlicense apache-2.0library transformersparams 28Bdownloads 5277klikes 1.7k

Qwen3.6-27B

> [!Note] > This repository contains model weights and configuration files for the post-trained model in the Hugging Face Transformers format. > > These artifacts are compatible with Hugging Face Transformers, vLLM, SGLang, KTransformers, etc.

Following the February release of the Qwen3.5 series, we're pleased to share the first open-weight variant of Qwen3.6. Built on direct feedback from the community, Qwen3.6 prioritizes stability and real-world utility, offering developers a more intuitive, responsive, and genuinely productive coding experience.

Qwen3.6 Highlights

This release delivers substantial upgrades, particularly in

  • Agentic Coding: the model now handles frontend workflows and repository-level reasoning with greater fluency and precision.
  • Thinking Preservation: we've introduced a new option to retain reasoning context from historical messages, streamlining iterative development and reducing overhead.

!Benchmark Results

For more details, please refer to our blog post Qwen3.6-27B.

Model Overview

  • Type: Causal Language Model with Vision Encoder
  • Training Stage: Pre-training & Post-training
  • Language Model
  • Number of Parameters: 27B
  • Hidden Dimension: 5120
  • Token Embedding: 248320 (Padded)
  • Number of Layers: 64
  • Hidden Layout: 16 × (3 × (Gated DeltaNet → FFN) → 1 × (Gated Attention → FFN))
  • Gated DeltaNet:
  • Number of Linear Attention Heads: 48 for V and 16 for QK
  • Head Dimension: 128
  • Gated Attention:
  • Number of Attention Heads: 24 for Q and 4 for KV
  • Head Dimension: 256
  • Rotary Position Embedding Dimension: 64
  • Feed Forward Network:
  • Intermediate Dimension: 17408
  • LM Output: 248320 (Padded)
  • MTP: trained with multi-steps
  • Context Length: 262,144 natively and extensible up to 1,010,000 tokens.

Benchmark Results

Language

Qwen3.5-27BQwen3.5-397B-A17BGemma4-31BClaude 4.5 OpusQwen3.6-35B-A3BQwen3.6-27B

Coding Agent

SWE-bench Verified 75.0 76.2 52.0 80.9 73.4 77.2

SWE-bench Pro 51.2 50.9 35.7 57.1 49.5 53.5

SWE-bench Multilingual 69.3 69.3 51.7 77.5 67.2 71.3

Terminal-Bench 2.0 41.6 52.5 42.9 59.3 51.5 59.3

SkillsBench Avg5 27.2 30.0 23.6 45.3 28.7 48.2

QwenWebBench 1068 1186 1197 1536 1397 1487

NL2Repo 27.3 32.2 15.5 43.2 29.4 36.2

Claw-Eval Avg 64.3 70.7 48.5 76.6 68.7 72.4

Claw-Eval Pass^3 46.2 48.1 25.0 59.6 50.0 60.6

QwenClawBench 52.2 51.8 41.7 52.3 52.6 53.4

Knowledge

MMLU-Pro 86.1 87.8 85.2 89.5 85.2 86.2

MMLU-Redux 93.2 94.9 93.7 95.6 93.3 93.5

SuperGPQA 65.6 70.4 65.7 70.6 64.7 66.0

C-Eval 90.5 93.0 82.6 92.2 90.0 91.4

STEM & Reasoning

GPQA Diamond 85.5 88.4 84.3 87.0 86.0 87.8

HLE 24.3 28.7 19.5 30.8 21.4 24.0

LiveCodeBench v6 80.7 83.6 80.0 84.8 80.4 83.9

HMMT Feb 25 92.0 94.8 88.7 92.9 90.7 93.8

HMMT Nov 25 89.8 92.7 87.5 93.3 89.1 90.7

HMMT Feb 26 84.3 87.9 77.2 85.3 83.6 84.3

IMOAnswerBench 79.9 80.9 74.5 84.0 <td style="padding:7px 7px;text-align:center;bord

Notability

notability 9.0/10

Massive download count for a major model release.