Nemotron models that have been converted and/or quantized to work well in vLLM
-
mgoin/Nemotron-4-340B-Instruct-hf-FP8
Text Generation • 341B • Updated • 4.03k • 3 -
mgoin/Nemotron-4-340B-Base-hf-FP8
Text Generation • 341B • Updated • 34 • 2 -
mgoin/Nemotron-4-340B-Instruct-hf
Text Generation • 341B • Updated • 45 • 4 -
mgoin/Nemotron-4-340B-Base-hf
Text Generation • 341B • Updated • 16 • 1