-
-
-
-
-
-
Inference Providers
Active filters:
vLLM
Text Generation
•
358B
•
Updated
•
17.2k
•
17
QuantTrio/MiniMax-M2.1-AWQ
Text Generation
•
229B
•
Updated
•
4.3k
•
8
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
1.78k
•
1
Text Generation
•
229B
•
Updated
•
366k
•
9
QuantTrio/MiniMax-M2-REAP-162B-A10B-AWQ
Text Generation
•
162B
•
Updated
•
567
•
3
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
3.01k
•
9
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
•
9B
•
Updated
•
53
•
6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
•
9B
•
Updated
•
22
•
2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
•
73B
•
Updated
•
59
•
2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
•
69B
•
Updated
•
57
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
•
15B
•
Updated
•
4
•
2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B
•
Updated
•
70
•
1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B
•
Updated
•
90
•
1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
428
•
1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
•
0.6B
•
Updated
•
20
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
455
•
1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
•
2B
•
Updated
•
16
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
•
33B
•
Updated
•
803
•
3
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
•
33B
•
Updated
•
268
•
3
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
27
•
1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
•
15B
•
Updated
•
89
•
1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
•
15B
•
Updated
•
680
•
4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
103
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
1.37k
•
4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
239
•
1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
•
4B
•
Updated
•
7
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
7.27k
QuantTrio/Qwen3-235B-A22B-GPTQ-Int8
Text Generation
•
235B
•
Updated
•
48
BeastyZ/Qwen2.5-3B-ConvSearch-R1-TopiOCQA
3B
•
Updated
•
5
QuantTrio/DeepSeek-R1-0528-Qwen3-8B-GPTQ-Int4-Int8Mix
Text Generation
•
11B
•
Updated
•
88
•
3