-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
stargz-snapshotter uses up all available disk space #1349
Comments
@bodgit Thanks for reporting this. Snapshots are automatically cleaned up when the image is removed. You can also manually remove images (using What contents does consume large space under |
Hi @ktock Here's the output from
The nodes have a 50 GB disk, 12 GB of that is consumed by I tried running |
@bodgit Thanks for the info.
What does consume the large space under this directory? Are there many snapshot dirs or is there a large snapshot dir (or a file)?
You need
Are there active snapshot mounts ( |
Lots of snapshot directories. All of them are under 1 GB but there are about 6-700 of them.
Ah, that worked. Running On this particular host, there 612 snapshots. 117 of them are "Active", 495 of them are "Committed". Some of the committed snapshots don't have a parent SHA256.
That's picking up any mount that has
They're all "overlay" mounts and they seem to vary by the number of Is it a case of cleaning up the committed snapshots and keeping the active ones? Assuming the number of active mounts seems roughly the same as the number of overlay mounts? To be clear, we're not (yet) trying to use any stargz images, I just installed the snapshotter on the EKS nodes to make sure everything still worked as before with our existing workloads. Everything seems to be working fine, but it's now using more disk space and it seems relative to how long the node has been running. So eventually, the node runs out of disk space and needs to be recycled, which isn't ideal. |
I think I've found the problem. I noticed we were getting this message logged often:
Every five minutes I was also seeing this:
On a hunch I manually created the
The disk usage had gone from 94% down to 40% I've gone through the install documentation and I can't see any mention of having to create this missing directory, but it seems critical that it exists otherwise image garbage collection stops working. Is it just a case of manually creating it or should it be being created automatically? Here's the contents of
The other |
Thanks for finding the root cause and the workaround. That directory should be handled by containerd (or cri plugin) so I think we need to fix containerd for completely fixing this issue. |
The same problem. |
Is there any other issues where this problem is being tracked? I'm seeing the same problem. |
I have version 0.14.3 of the snapshotter installed on some EKS nodes, some of which have been running for around 16 days. They have started to run out of disk space and it seems the majority of this is consumed by
/var/lib/containerd-stargz-grpc/snapshotter/snapshots
.Is there a way to prune/clean this up automatically?
The text was updated successfully, but these errors were encountered: