Skip to content
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.

Questions about GPU memory usage in non local blocks. #70

Open
seominseok0429 opened this issue Oct 14, 2019 · 0 comments
Open

Questions about GPU memory usage in non local blocks. #70

seominseok0429 opened this issue Oct 14, 2019 · 0 comments

Comments

@seominseok0429
Copy link

I am an undergraduate student studying Deep Learning in Korea. I am impressed by your paper and doing experiments.

I thought gpu memory usage was only related to the parameters of the model, but using non-local blocks would use too much gpu memory.

Adding two non-local blocks to the model of the i3d inception structure, the total number of parameters increased by about 7%.

However, gpu memory usage has increased significantly. For example, in the i3d inception structure, six batches were available for each gpu, but adding two non-local blocks maximized two batches for each gpu.

Can you tell me if this is common or if I miscoded it?

(input size=224x224, time=64)

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant