Show HN: Llama-dl – high-speed download of LLaMA, Facebook’s 65B GPT model

HN discussion | Twitter announcement UPDATE (9:51 AM CST): HN user MacsHeadroom left a valuable comment: I'm running LLaMA-65B on a single A100 80GB with 8bit quantization. $1.5/hr on vast.ai The output is at least as good as davinci. I think some early results are using bad repetition penalty and/or temperature settings. I had to…