why do u need 2GPUs to run Qwen2.5 3B? #33

rpking1107 · 2025-02-01T11:53:56Z

sorry if it's a stupid question.
But my 4090 is 24GB ram, it's more than enough to run a 3B model right? can i use Qwen2.5 3B and change the parameter to GPU=1?

thanks guys

Superskyyy · 2025-02-01T15:36:32Z

Training is different from inference, you need way more vram, safe to say your 4090 is not enough.

zacksiri · 2025-02-02T01:33:10Z

@rpking1107 i was able to get training running on my A4500 with 20GB ram. you just need to adjust the training parameters. It's going to take longer because of smaller batch size, but it does work.

carlos-aguayo · 2025-02-02T04:36:16Z

@zacksiri Can you share what parameters did you use? I tried lowering the batch sizes and a few others and would still run out of memory.

zacksiri · 2025-02-02T04:58:07Z

@carlos-aguayo You can see my parameters here.

#5 (comment)

It works with qwen2.5 1.5b, I'm not sure I have the best configuration / output yet. Need to try a few more experiments.

I'm experimenting with 3b model but I think I'm pushing my luck.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why do u need 2GPUs to run Qwen2.5 3B? #33

why do u need 2GPUs to run Qwen2.5 3B? #33

rpking1107 commented Feb 1, 2025

Superskyyy commented Feb 1, 2025

zacksiri commented Feb 2, 2025

carlos-aguayo commented Feb 2, 2025

zacksiri commented Feb 2, 2025 •

edited

Loading

why do u need 2GPUs to run Qwen2.5 3B? #33

why do u need 2GPUs to run Qwen2.5 3B? #33

Comments

rpking1107 commented Feb 1, 2025

Superskyyy commented Feb 1, 2025

zacksiri commented Feb 2, 2025

carlos-aguayo commented Feb 2, 2025

zacksiri commented Feb 2, 2025 • edited Loading

zacksiri commented Feb 2, 2025 •

edited

Loading