Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

7,288

Full-text search

Active filters: gptq

TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ

Text Generation • 33B • Updated Sep 27, 2023 • 134k • 599

TheBloke/MythoMax-L2-13B-GPTQ

Text Generation • Updated Sep 27, 2023 • 549 • 219

hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4

Text Generation • 8B • Updated Aug 7, 2024 • 19.4k • 41

Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8

Image-Text-to-Text • 2B • Updated Sep 21, 2024 • 105 • 16

fbaldassarri/meta-llama_Llama-3.2-11B-Vision-Instruct-OpenVino

Text Generation • Updated Nov 11, 2024 • 9 • 1

QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix

Text Generation • 534B • Updated Sep 5, 2025 • 42 • 7

QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix

Text Generation • 253B • Updated Sep 5, 2025 • 21 • 3

thomasip/Qwen3-Omni-30B-A3B-Instruct-GPTQ-4bit

35B • Updated Dec 10, 2025 • 566 • 2

tencent/HY-MT1.5-7B-GPTQ-Int4

Translation • 8B • Updated Jan 1 • 1.23k • 9

krishhx/Hymba-1.5B-Eigen-Hybrid-4bit

Updated Dec 31, 2025 • 1

FayeQuant/GLM-4.7-Flash-GPTQ-4bit

Text Generation • 30B • Updated 1 day ago • 1.92k • 1

Ubuku/Qwen2.5-Math-72B-Instruct-GPTQ-Int4-TP2

73B • Updated 8 days ago • 9 • 1

elinas/alpaca-13b-lora-int4

Text Generation • Updated Apr 5, 2023 • 14 • 41

elinas/alpaca-30b-lora-int4

Text Generation • Updated Apr 5, 2023 • 18 • 68

mayaeary/pygmalion-6b-4bit-128g

Text Generation • Updated Mar 28, 2023 • 9 • 40

mayaeary/pygmalion-6b_dev-4bit-128g

Text Generation • Updated Mar 28, 2023 • 9 • 121

mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g

Text Generation • Updated Mar 31, 2023 • 3 • 2

mayaeary/PPO_Pygway-6b-Mix-4bit-128g

Text Generation • Updated Mar 31, 2023 • 2 • 2

elinas/vicuna-13b-4bit

Text Generation • Updated Apr 5, 2023 • 8 • 45

TheBloke/koala-7B-GPTQ

Text Generation • 7B • Updated Aug 21, 2023 • 42 • 31

TheBloke/koala-7B-HF

Text Generation • Updated Jun 5, 2023 • 1.22k • 21

TheBloke/koala-13B-HF

Text Generation • Updated Jun 5, 2023 • 1.24k • 41

TheBloke/koala-13B-GPTQ

Text Generation • 13B • Updated Aug 21, 2023 • 15 • 38

TheBloke/galpaca-30B-GPTQ

Text Generation • Updated Aug 21, 2023 • 7 • 48

Ancestral/Dolly_Shygmalion-6b-4bit-128g

Text Generation • Updated Apr 9, 2023 • 19 • 5

Ancestral/PPO_Shygmalion-6b-4bit-128g

Text Generation • Updated Apr 9, 2023 • 3

TheBloke/vicuna-7B-v0-GPTQ

Text Generation • 7B • Updated Aug 21, 2023 • 15 • 15

Ancestral/Dolly_Malion-6b-4bit-128g

Text Generation • Updated Apr 10, 2023 • 3 • 1

4bit/pygmalion-6b-4bit-128g

Text Generation • Updated Apr 13, 2023 • 1 • 3

TheBloke/gpt4-alpaca-lora-30B-GPTQ

Text Generation • 33B • Updated Aug 21, 2023 • 16 • 20