You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
You will have to add #198 and set threshold probably between 1 and 1.5 and limit context to where you don't get NaN error or out of memory. For llama 13b I can pull this off, for pythia 12b I cannot.
run
python server.py --share --listen --chat --model llama-7b
.Everything is okay.
However, when run ```` python server.py --share --listen --chat --model llama-13b ```,
It would report an error:
NameError: name 'cuda_setup' is not defined. Did you mean: 'CUDASetup'?
The text was updated successfully, but these errors were encountered: