[`fix`] Change eval dataloader to use eval_batch_size #2847

akashd-2 · 2024-07-18T09:44:33Z

The evaluation dataloader is currently initialized using the train_batch_size which seems to be a bug. This PR fixes this.

into pr-2847

I think eval is more similar than train here

The former is identical for single-GPU and DDP, but has a higher batch size for DP (which is the expected behaviour).

tomaarsen · 2024-09-10T10:09:35Z

Hello!

Apologies for the delay, I've been recovering from a surgery this last month.
Well spotted! I had another look at the other batch sizes as well, and I noticed that I'm using train for the get_test_dataloader currently: I think eval is better (there is no per_device_test_batch_size). This also matches what transformers does.

Additionally, I started using ..._batch_size everywhere instead of per_device_..._batch_size: The former is identical for single-GPU and DDP, but has a higher batch size for DP (which is the expected behaviour). I think after this PR, everything should be as expected again!

Tom Aarsen

into pr-2847

akashd-2 and others added 4 commits July 18, 2024 10:41

Change eval_dataloader to use eval_batch_size

de8c80a

Merge branch 'master' of https://github.com/UKPLab/sentence-transformers

7cc8f70

into pr-2847

train -> eval for the test dataloader

6d7637a

I think eval is more similar than train here

Use ..._batch_size rather than per_device_..._batch_size

5f91adb

The former is identical for single-GPU and DDP, but has a higher batch size for DP (which is the expected behaviour).

tomaarsen changed the title ~~Change eval dataloader to use eval_batch_size~~ [fix] Change eval dataloader to use eval_batch_size Sep 10, 2024

Merge branch 'master' of https://github.com/UKPLab/sentence-transformers

7494472

into pr-2847

tomaarsen merged commit 02fb5f8 into UKPLab:master Sep 10, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`fix`] Change eval dataloader to use eval_batch_size #2847

[`fix`] Change eval dataloader to use eval_batch_size #2847

akashd-2 commented Jul 18, 2024

tomaarsen commented Sep 10, 2024

[fix] Change eval dataloader to use eval_batch_size #2847

[fix] Change eval dataloader to use eval_batch_size #2847

Conversation

akashd-2 commented Jul 18, 2024

tomaarsen commented Sep 10, 2024

[`fix`] Change eval dataloader to use eval_batch_size #2847

[`fix`] Change eval dataloader to use eval_batch_size #2847