Inference Providers
Active filters: vLLM
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 54.1k
• 369
QuantTrio/Qwen3.6-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 32.1k
• 6
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 31.6k
• 66
QuantTrio/Qwen3.6-27B-AWQ-6Bit
Image-Text-to-Text
• 28B • Updated • 3.77k
• 5
Text Generation
• 754B • Updated • 656
• 4
QuantTrio/gemma-4-31B-it-AWQ
Image-Text-to-Text
• 31B • Updated • 264k
• 10
QuantTrio/Qwen3.6-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 142k
• 15
selode-ai/Qwen-3.6-35B-A3B-VRAP-4-bit-AWQ-21.2GB
Image-Text-to-Text
• 29B • Updated • 200
• 12
QuantTrio/MiniMax-M2.7-AWQ
Text Generation
• 229B • Updated • 26.8k
• 6
QuantTrio/Qwen3.5-122B-A10B-AWQ
Image-Text-to-Text
• 125B • Updated • 69.8k
• 26
unsloth/Mistral-Small-4-119B-2603
119B • Updated • 237
• 5
QuantTrio/Qwopus3.5-27B-v3-AWQ
Image-Text-to-Text
• 27B • Updated • 24.3k
• 10
QuantTrio/gemma-4-31B-it-AWQ-6Bit
Image-Text-to-Text
• 31B • Updated • 14.5k
• 8
Xingyu-Zheng/Qwopus3.5-9B-v3.5-INT4-FOEM
Image-Text-to-Text
• 9B • Updated • 122
• 1
Xingyu-Zheng/Qwopus3.6-27B-v1-preview-INT8-FOEM
Image-Text-to-Text
• 27B • Updated • 16
• 1
mistralai/Mistral-Small-4-119B-2603-eagle
Updated • 277
• 48
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 97
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 5
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 94
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 71
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 4
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 151
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 317
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 262
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 17
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 152
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 18
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 28.5k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 295
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 49
• 1