-
-
-
-
-
-
Inference Providers
Active filters:
gptq
TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ
Text Generation
•
33B
•
Updated
•
134k
•
599
TheBloke/MythoMax-L2-13B-GPTQ
Text Generation
•
Updated
•
549
•
219
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4
Text Generation
•
8B
•
Updated
•
19.4k
•
41
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
2B
•
Updated
•
105
•
16
fbaldassarri/meta-llama_Llama-3.2-11B-Vision-Instruct-OpenVino
Text Generation
•
Updated
•
9
•
1
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix
Text Generation
•
534B
•
Updated
•
42
•
7
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
•
253B
•
Updated
•
21
•
3
thomasip/Qwen3-Omni-30B-A3B-Instruct-GPTQ-4bit
35B
•
Updated
•
566
•
2
tencent/HY-MT1.5-7B-GPTQ-Int4
Translation
•
8B
•
Updated
•
1.23k
•
9
krishhx/Hymba-1.5B-Eigen-Hybrid-4bit
FayeQuant/GLM-4.7-Flash-GPTQ-4bit
Text Generation
•
30B
•
Updated
•
1.92k
•
1
Ubuku/Qwen2.5-Math-72B-Instruct-GPTQ-Int4-TP2
73B
•
Updated
•
9
•
1
elinas/alpaca-13b-lora-int4
Text Generation
•
Updated
•
14
•
41
elinas/alpaca-30b-lora-int4
Text Generation
•
Updated
•
18
•
68
mayaeary/pygmalion-6b-4bit-128g
Text Generation
•
Updated
•
9
•
40
mayaeary/pygmalion-6b_dev-4bit-128g
Text Generation
•
Updated
•
9
•
121
mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g
Text Generation
•
Updated
•
3
•
2
mayaeary/PPO_Pygway-6b-Mix-4bit-128g
Text Generation
•
Updated
•
2
•
2
Text Generation
•
Updated
•
8
•
45
Text Generation
•
7B
•
Updated
•
42
•
31
Text Generation
•
Updated
•
1.22k
•
21
Text Generation
•
Updated
•
1.24k
•
41
Text Generation
•
13B
•
Updated
•
15
•
38
TheBloke/galpaca-30B-GPTQ
Text Generation
•
Updated
•
7
•
48
Ancestral/Dolly_Shygmalion-6b-4bit-128g
Text Generation
•
Updated
•
19
•
5
Ancestral/PPO_Shygmalion-6b-4bit-128g
Text Generation
•
Updated
•
3
TheBloke/vicuna-7B-v0-GPTQ
Text Generation
•
7B
•
Updated
•
15
•
15
Ancestral/Dolly_Malion-6b-4bit-128g
Text Generation
•
Updated
•
3
•
1
4bit/pygmalion-6b-4bit-128g
Text Generation
•
Updated
•
1
•
3
TheBloke/gpt4-alpaca-lora-30B-GPTQ
Text Generation
•
33B
•
Updated
•
16
•
20