TheBloke
/

Llama-2-70B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

Resources

View closed (1)

Fine-tuning?

#14 opened over 2 years ago by

OSK-Creative-Tech

The model is not responding.

#13 opened over 2 years ago by

model responses not good.

#12 opened over 2 years ago by

how to quant llama2 70b model with AutoGPTQ

#11 opened over 2 years ago by

Wrong shape when loading with Peft-AutoGPTQ

#10 opened over 2 years ago by

Long waiting time

#9 opened over 2 years ago by

Context Length Differences

#7 opened over 2 years ago by

Problems with temperature when using with python code.

#6 opened over 2 years ago by

Should we expect GGML soon?

#5 opened over 2 years ago by

Issue with 64g version?

#4 opened over 2 years ago by

The `main` branch for TheBloke/Llama-2-70B-GPTQ appears borked

#3 opened over 2 years ago by

I found an fp16 model if it helps

#2 opened over 2 years ago by

❤️❤️❤️❤️

#1 opened over 2 years ago by