Questions about GPU memory usage in non local blocks. #70

seominseok0429 · 2019-10-14T02:14:46Z

I am an undergraduate student studying Deep Learning in Korea. I am impressed by your paper and doing experiments.

I thought gpu memory usage was only related to the parameters of the model, but using non-local blocks would use too much gpu memory.

Adding two non-local blocks to the model of the i3d inception structure, the total number of parameters increased by about 7%.

However, gpu memory usage has increased significantly. For example, in the i3d inception structure, six batches were available for each gpu, but adding two non-local blocks maximized two batches for each gpu.

Can you tell me if this is common or if I miscoded it?

(input size=224x224, time=64)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about GPU memory usage in non local blocks. #70

Questions about GPU memory usage in non local blocks. #70

seominseok0429 commented Oct 14, 2019

Questions about GPU memory usage in non local blocks. #70

Questions about GPU memory usage in non local blocks. #70

Comments

seominseok0429 commented Oct 14, 2019