Bluesky Thread

Qwen3-4B Instruct & Thinking

August 06, 2025 View original thread

Qwen3-4B Instruct & Thinking

uuuh, guys this isn’t a boring model

This crushes all the agentic benchmarks, even beating out the already-impressive qwen3-30b-a4b

It’s hanging with some already impressive mid-sized models at only 4B

55 4

huggingface.co/Qwen/Qwen3-4...

huggingface.co

Qwen/Qwen3-4B-Thinking-2507 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

4 1

More like this