Suggestion: Improved memory use #568

johannesulf · 2016-06-27T15:34:00Z

The following code which continuously repopulates the same halo catalog with galaxies uses a lot of memory. Most importantly, the memory usage increases even after the first hundred iterations and only reaches a maximum after around 300 iterations. The amount of memory used after 300 iterations is roughly 3 times the memory used after one iteration. It's not obvious why halotools should at all increase in memory usage after each iteration.

Obviously, re-populating the same halo catalog over and over again is important for MCMC and there one commonly has a lot of memory, anyway. However, re-populating the same halo catalog is also important when plotting results of an MCMC chain. It might be desirable that this could be done on a computer with less than 16GB of RAM. It would be great if the memory usage of halotools could be reduced in such scenarios.

from halotools.empirical_models import PrebuiltHodModelFactory
from halotools.sim_manager import CachedHaloCatalog

halocat = CachedHaloCatalog(simname ='bolplanck', redshift=0)
model = PrebuiltHodModelFactory('zheng07')
model.param_dict['sigma_logM'] = 1.0

for i in range(1000):
  try:
    model.mock.populate()
  except:
    model.populate_mock(halocat)

The text was updated successfully, but these errors were encountered:

aphearin · 2016-06-27T15:39:33Z

Yes, agreed, this code snippet should definitely not require so much memory. Will look into this soon.

…first attempt to eliminate obvious sources of the memory leak.

aphearin · 2016-06-27T22:03:03Z

@johannesulf - when I run your code snippet on my machine, I do not get what you get. I added the following line to code snippet you emailed to me:

mem_arr[i] = float(os.popen("ps u -p %d | awk '{sum=sum+$6}; END {print sum/1024}'" % osid).read())
Then I made the following plot:

Probably we should be using a more sophisticated tool like heapy or even just pympler to do this with greater reproducibility, but in any case I'm not sure what the source of the discrepancy is.

I started a new branch on my private fork to begin to address this. First commit is here: aphearin@0afe3f7

If I can solve this within the next day or so, I'll include the fix in the v0.3 release that I'll push up to pip on Wednesday. Otherwise, implementing this fix into a public release will have to wait for v0.4 (which would be perfectly fine by me).

johannesulf · 2016-07-25T00:05:31Z

So I wanted to wait and see whether the issue still occurs with Ubuntu 16.04 and it does. I also tried your proposed fixes but they did not change anything. So in my case halotools seems to use 7 times more memory than necessary.

aphearin · 2016-07-25T14:36:29Z

Thanks for following up on this @johannesulf. I won't be able to address this in the coming release, but will revisit this after finishing v0.4.

johannesulf · 2016-07-25T19:58:35Z

So it's really a problem with the garbage collection. When I call gc.collect() after each iteration the memory stays low all the time. But this gives some small overhead (~10%).

johannesulf · 2016-07-25T20:02:13Z

Using pympler apparently doesn't work. When using it the increased memory use is gone.

aphearin · 2016-07-25T21:27:02Z

Ahh, very interesting observation about gc.collect(). A 10% overhead in CPU runtime seems like a very small sacrifice for a ~7x improvement in memory efficiency.

What do you think, would it be worth it to include an optional keyword that allows users to run in "low-memory mode", or something like that? This way, users like you who would prefer to run MCMCs on their laptops would have the option to do so. I think that any such user would not be bothered by a 10% runtime overhead. Let me know if you have an opinion on this.

surhudm · 2016-07-25T21:41:29Z

Quick question: is garbage collection by python dependent upon the memory
of the machine? If so, then should we not let python figure out what the
best thing to do is?

S

On Tue, Jul 26, 2016 at 6:27 AM, Andrew Hearin [email protected]
wrote:

Ahh, very interesting observation about gc.collect(). A 10% overhead in
CPU runtime seems like a very small sacrifice for a ~7x improvement in
memory efficiency.

What do you think, would it be worth it to include an optional keyword
that allows users to run in "low-memory mode", or something like that? This
way, users like you who would prefer to run MCMCs on their laptops would
have the option to do so. I think that any such user would not be
bothered by a 10% runtime overhead. Let me know if you have an opinion on
this.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#568 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AEMGuYqTK9rkb70aXHVNuew2DlgO_AbCks5qZSomgaJpZM4I_OFr
.

aphearin · 2016-07-25T21:50:09Z

Garbage collection is definitely machine-dependent, yes. It is not a coincidence that @johannesulf sees the equilibrium value of the sawtooth pattern oscillate at a value near the maximum ram of his laptop. But I don't think I quite follow your point. What is the downside of giving users the option to force python to clean up after itself? This reduces the memory footprint by ~5x in the case that @johannesulf.

surhudm · 2016-07-25T22:22:03Z

HI Andrew,

The question would be where all would you put the garbage collection
statements?

Cheers,
Surhud

On Tue, Jul 26, 2016 at 6:50 AM, Andrew Hearin [email protected]
wrote:

Garbage collection is definitely machine-dependent, yes. It is not a
coincidence that @johannesulf https://github.com/johannesulf sees the
equilibrium value of the sawtooth pattern oscillate at a value near the
maximum ram of his laptop. But I don't think I quite follow your point.
What is the downside of giving users the option to force python to clean up
after itself? This reduces the memory footprint by ~5x in the case that
@johannesulf https://github.com/johannesulf.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#568 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AEMGuRrGw3XB6zqWR4l-C-0qwldMaxQNks5qZS-RgaJpZM4I_OFr
.

aphearin · 2016-07-25T22:24:56Z

@johannesulf - where did you put your call to gc.collect()? Just at the end of mock.populate()?

surhudm · 2016-07-25T22:30:39Z

My guess is that he put it in the main loop. Makes no sense to put it at
the end of mock.populate().

from halotools.empirical_models import PrebuiltHodModelFactory
from halotools.sim_manager import CachedHaloCatalog

halocat = CachedHaloCatalog(simname ='bolplanck', redshift=0)
model = PrebuiltHodModelFactory('zheng07')
model.param_dict['sigma_logM'] = 1.0

for i in range(1000):
try:
model.mock.populate()
except:
model.populate_mock(halocat)

#**** Garbage collect Here ***

Cheeers,

Surhud

On Tue, Jul 26, 2016 at 7:24 AM, Andrew Hearin [email protected]
wrote:

@johannesulf https://github.com/johannesulf - where did you put your
call to gc.collect()? Just at the end of mock.populate()?

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#568 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AEMGueiAzCdgeP_RH1CW1KEHR_S6Q5WFks5qZTe4gaJpZM4I_OFr
.

aphearin · 2016-07-25T22:31:16Z

It would be nice to limit the proliferation of keyword arguments. Maybe the best option is to simply add a Notes section to the docstring of mock.populate notifying users that they can call gc.collect() after calling mock.populate() if they need to limit memory usage, since this particular example happens to be so effective.

aphearin · 2016-07-25T22:32:35Z

I see, yes, good point, the namespace of mock.populate() would need to be exited in order for this to make sense.

johannesulf · 2016-07-26T17:15:22Z

It is not a coincidence that @johannesulf sees the equilibrium value of the sawtooth pattern oscillate at a value near the maximum ram of his laptop.

It actually is. For different model parameters it will eventually start to write into the swap and slow down the entire computation. So it seems like in this case Python doesn't know what's best.

But given that it can be easily prevented by gc.collect(), I would close the issue and maybe add a note to mock.populate().

aphearin · 2016-07-27T14:37:05Z

Ok then @johannesulf - in that case I think we agree that this is best handled with documentation rather than a new feature. Would you mind adding a few sentences to the part(s) of the code that typical users would most commonly go looking for information about reducing the memory footprint? If you think a dedicated doc page for this is warranted, then I can handle it, and we can just put hyperlinks to that page in a few conspicuous places. But if you think one or two docstring notes can do the trick, could you handle it? You can submit a PR if you're comfortable, or otherwise just email me with code modifications as you have before.

aphearin self-assigned this Jun 27, 2016

aphearin added this to the v0.4 milestone Jun 27, 2016

aphearin added performance-enhancement empirical-models labels Jun 27, 2016

aphearin modified the milestones: v0.3, v0.4 Jun 27, 2016

aphearin pushed a commit to aphearin/halotools that referenced this issue Jun 27, 2016

Started to investigate astropy#568. @johannesulf - this commit is my …

0afe3f7

…first attempt to eliminate obvious sources of the memory leak.

aphearin modified the milestones: v0.4, v0.3 Jun 28, 2016

aphearin modified the milestones: next-release, v0.4 Jul 25, 2016

johannesulf mentioned this issue Aug 8, 2016

Added note about RAM usage of HodMockFactory.populate() #619

Merged

aphearin closed this as completed Aug 8, 2016

aphearin mentioned this issue Jul 3, 2018

Severe memory leak in populate_mock() #917

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suggestion: Improved memory use #568

Suggestion: Improved memory use #568

johannesulf commented Jun 27, 2016

aphearin commented Jun 27, 2016

aphearin commented Jun 27, 2016

johannesulf commented Jul 25, 2016

aphearin commented Jul 25, 2016

johannesulf commented Jul 25, 2016

johannesulf commented Jul 25, 2016

aphearin commented Jul 25, 2016

surhudm commented Jul 25, 2016

aphearin commented Jul 25, 2016

surhudm commented Jul 25, 2016

aphearin commented Jul 25, 2016

surhudm commented Jul 25, 2016

aphearin commented Jul 25, 2016

aphearin commented Jul 25, 2016

johannesulf commented Jul 26, 2016

aphearin commented Jul 27, 2016

Suggestion: Improved memory use #568

Suggestion: Improved memory use #568

Comments

johannesulf commented Jun 27, 2016

aphearin commented Jun 27, 2016

aphearin commented Jun 27, 2016

johannesulf commented Jul 25, 2016

aphearin commented Jul 25, 2016

johannesulf commented Jul 25, 2016

johannesulf commented Jul 25, 2016

aphearin commented Jul 25, 2016

surhudm commented Jul 25, 2016

aphearin commented Jul 25, 2016

surhudm commented Jul 25, 2016

aphearin commented Jul 25, 2016

surhudm commented Jul 25, 2016

aphearin commented Jul 25, 2016

aphearin commented Jul 25, 2016

johannesulf commented Jul 26, 2016

aphearin commented Jul 27, 2016