Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

arxiv: 2501.15383

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

74

Full-text search

Active filters: 2501.15383

unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF

Text Generation • 80B • Updated 3 days ago • 95.9k • 99

unsloth/Qwen3-Next-80B-A3B-Thinking-GGUF

Text Generation • 80B • Updated 3 days ago • 43.7k • 47

Qwen/Qwen3-30B-A3B-Instruct-2507

Text Generation • 31B • Updated Sep 17 • 578k • • 686

Qwen/Qwen3-Next-80B-A3B-Instruct

Text Generation • 81B • Updated Sep 17 • 2.94M • • 902

Qwen/Qwen3-Next-80B-A3B-Instruct-GGUF

Text Generation • 80B • Updated 7 days ago • 2.41k • 9

Qwen/Qwen3-Next-80B-A3B-Thinking-GGUF

Text Generation • 80B • Updated 7 days ago • 841 • 6

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17 • 75.4k • • 383

Qwen/Qwen3-Next-80B-A3B-Thinking

Text Generation • 81B • Updated Sep 15 • 177k • • 454

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • 15B • Updated Jan 29 • 12.2k • • 330

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17 • 127k • • 730

Qwen/Qwen3-Next-80B-A3B-Thinking-FP8

Text Generation • Updated Sep 22 • 421k • 39

Qwen/Qwen3-30B-A3B-Thinking-2507

Text Generation • 31B • Updated Aug 17 • 560k • • 323

unsloth/Qwen3-Next-80B-A3B-Instruct

Text Generation • 81B • Updated 4 days ago • 820 • 86

cpatonn/Qwen3-Next-80B-A3B-Instruct-AWQ-4bit

Text Generation • Updated 16 days ago • 35.8k • 49

Qwen/Qwen3-Next-80B-A3B-Instruct-FP8

Text Generation • Updated Sep 22 • 809k • 62

unsloth/Qwen3-Next-80B-A3B-Thinking

Text Generation • 81B • Updated 12 days ago • 248 • 6

Qwen/Qwen2.5-7B-Instruct-1M

Text Generation • 8B • Updated Jan 29 • 40.5k • • 358

async0x42/Qwen2.5-7B-Instruct-1M-exl2_4.65bpw

Text Generation • Updated Jan 29 • 7

async0x42/Qwen2.5-14B-Instruct-1M-exl2_4.65bpw

Text Generation • Updated Jan 29 • 7

ZeroXClem/Qwen2.5-7B-CelestialHarmony-1M

Text Generation • 8B • Updated Feb 8 • 20 • 7

remymenard/Qwen2.5-7B-Instruct-1M-ct2-int8

Text Generation • Updated Feb 3 • 9

QuantFactory/Qwen2.5-14B-Instruct-1M-GGUF

Text Generation • 15B • Updated Feb 8 • 369 • 3

QuantFactory/Qwen2.5-7B-Instruct-1M-GGUF

Text Generation • 8B • Updated Feb 9 • 232 • 3

professorf/Qwen2.5-7B-Instruct-1M-gguf

Text Generation • 8B • Updated Feb 17 • 22 • 1

AightBits/Qwen2.5-14B-Instruct-1M-8.0bpw-h8-exl2

Text Generation • Updated Feb 19 • 5

AightBits/Qwen2.5-7B-Instruct-1M-8.0bpw-h8-exl2

Text Generation • Updated Feb 19 • 8

Mungert/Qwen2.5-14B-Instruct-1M-GGUF

Text Generation • 15B • Updated Sep 24 • 8.2k • 6

Mungert/Qwen2.5-7B-Instruct-1M-GGUF

Text Generation • 8B • Updated Sep 24 • 782 • 6

duyntnet/Qwen2.5-14B-Instruct-1M-imatrix-GGUF

Text Generation • 15B • Updated Mar 25 • 251

RichardErkhov/Qwen_-_Qwen2.5-7B-Instruct-1M-4bits

4B • Updated Mar 27 • 4