Outline memory required for tedana to run #267

jbteves · 2019-04-22T14:53:37Z

Summary

In several issues and in informal conversations, users have reported large RAM usage. With no formal guidelines, it is difficult for a user to know what to expect from their data. We should outline memory requirements for various data sets, most notably high-resolution data.

Additional Detail

In issues #254 and #144 as well as in formal conversation, users have noted a large RAM usage. While @tsalo has done refactoring that should ameliorate this problem and we should have a release that reduces usage, it would be good to have guidelines for how RAM usage should scale with dataset:

Spatial Resolution
Temporal Resolution
Echo Number

We should bear in mind that peak usage can be problematic since taking all available RAM will lead to system thrashing such as described in #254 (the operating system is spending all of its time retrieving memory from disk since it's run out of RAM, and then the problem cascades into all programs as it struggles to handle its I/O load).

Next Steps

Run datasets of varying resolution and length and record their memory usage at different steps
Create plots that demonstrate the memory usage over time, dependent on the parameters (like this

jbteves · 2019-04-22T15:59:03Z

@dowdlelt ran a quick test and indicated that the memory usage seems to match expectations, with increased voxel size causing rapid memory consumption and a near-linear scaling with echo number.

jbteves · 2019-04-22T21:48:54Z

According to @dowdlelt a user-supplied mask can drastically reduce the number of data points required to run data and thus drastically reduce memory requirements; he notes that an AfNI EPI mask reduced a 3mm (resampled to 2.5mm) isotropic data set from 540k voxels to 120k voxels-- a dramatic 80% compression in memory required. This should be ameliorated in the next release due to @tsalo's contributions in #226, especially per a comparison between the nilearn mask and AfNI mask that yielded an ~5k voxel difference.

jbteves · 2019-10-29T01:34:36Z

With more datasets coming in this should be more easily testable.

dowdlelt · 2019-10-31T15:03:24Z

As it seems very deterministic (some function of n_voxels x n_echoes), it may be possible to add a check at runtime comparing estimated RAM usage to available ram and to provide a message for users. Or even just show estimated RAM usage, so if things don't work, they can scroll through and see that as a potential problem, you know, for user friendliness.

jbteves · 2019-10-31T15:56:49Z

That's a great idea! Probably not too hard to just estimate the required, and add 10% for a buffer amount.

rmarkello · 2019-10-31T15:58:56Z

I believe the data are being loaded into memory as float32 (assuming Nifti-1), which means that the number of bytes used will be:

nbytes = (4 * x * y * z * t * e)

If the data are being loaded as float64 then substitute 8 for the 4 in the equation.

The trickier part is determining how many copies of the data are being made in memory during computations... You could use memory-profiler to try and make some estimates, but as a lower bound that's a good start.

jbteves · 2019-10-31T16:04:11Z

Yeah, it's also hard to tell memory usage instantaneously. We'll add this to things to look out for on Testing & Validation. I'll try to familiarize myself with this tool, thanks @rmarkello.

stale · 2020-02-16T00:24:58Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions to tedana:tada: !

jbteves added the documentation issues related to improving documentation for the project label Apr 22, 2019

jbteves added this to the documentation milestone May 24, 2019

jbteves mentioned this issue Jun 19, 2019

Reduce memory usage for metric calculations #269

Closed

3 tasks

jbteves added hackathon Issues to tackle in the NIH hackathon testing issues related to improving testing in the project labels Oct 29, 2019

tsalo changed the title ~~Outline Memory Required for Tedana to run~~ Outline memory required for tedana to run Nov 18, 2019

stale bot added the stale label Feb 16, 2020

stale bot closed this as completed Feb 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Outline memory required for tedana to run #267

Outline memory required for tedana to run #267

jbteves commented Apr 22, 2019 •

edited

Loading

jbteves commented Apr 22, 2019

jbteves commented Apr 22, 2019

jbteves commented Oct 29, 2019

dowdlelt commented Oct 31, 2019

jbteves commented Oct 31, 2019

rmarkello commented Oct 31, 2019

jbteves commented Oct 31, 2019

stale bot commented Feb 16, 2020

Outline memory required for tedana to run #267

Outline memory required for tedana to run #267

Comments

jbteves commented Apr 22, 2019 • edited Loading

Summary

Additional Detail

Next Steps

jbteves commented Apr 22, 2019

jbteves commented Apr 22, 2019

jbteves commented Oct 29, 2019

dowdlelt commented Oct 31, 2019

jbteves commented Oct 31, 2019

rmarkello commented Oct 31, 2019

jbteves commented Oct 31, 2019

stale bot commented Feb 16, 2020

jbteves commented Apr 22, 2019 •

edited

Loading