Code Llama 70b on text-generation-webui?

librecat@lemmy.basedcount.com · 9 months ago

Code Llama 70b on text-generation-webui?

noneabove1182@sh.itjust.works · 9 months ago

If you’re using text generation webui there’s a bug where if your max new tokens is equal to your prompt truncation length it will remove all input and therefore just generate nonsense since there’s no prompt

Reduce your max new tokens and your prompt should actually get passed to the backend. This is more noticable in models with only 4k context (since a lot of people default max new tokens to 4k)

shootwhatsmyname@lemm.ee · edit-2 9 months ago

I’m just using Ollama with Ollama WebUI. You’ll have to use the right tag when installing Llama to make sure you get 70b.