-
Notifications
You must be signed in to change notification settings - Fork 847
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WSL filesystem keeps accumulating corrupted file and directory entries #1662
Comments
I've seen this too on build 15025, haven't noticed it before. Exactly the same symptoms. After a couple of minutes, and several attempts to delete the inaccessible file both from bash and Windows Explorer, the file disappeared of its own accord. Reverted to 15019 because of another problem (that's apparently been fixed for the next release). |
Just close bash I think it should clean these up by itself . iirc @Manouchehri had this problem and they just disappeared after closing bash and maybe rebooting. |
No, closing bash does not resolve the problem. |
Try rebooting. |
The poison directory entries do not disappear on reboot either, but after rebooting and ensuring LXSS is not running, it is generally possible to delete them from the cmd side of the house. |
Still seeing this on build 15048, it's extremely disruptive. Any update? |
For what it's worth, this has never happened to me. I've used DrvFs quite a bit on each of the builds listed above. Do y'all have a reproducer? Ideally a script or series of commands that consistently (possibly after a few retries) results in a corrupt file. |
Yes, 100% repro any time I try to delete a directory. $ mkdir ./foo Same repro if I use Happy to collect any relevant logs. |
Hm... That works fine for me:
I unfortunately don't know what logs would be relevant. Hopefully one of the WSL devs will chime in? One theory: Do you happen to have any sort of third-party antivirus or disk-indexing tool installed (or any other program that might try to read on-disk content as soon as it's written)? If so, could you try either disabling it temporarily, or telling it to ignore some directory and trying this tests again in that directory, to see if that fixes the issue? |
Ok I've narrowed down the issue actually - the file trees in which this repros is being watched by a file system watcher used by source control, it uses the inotify syscall that WSL added a couple months ago. Best guess would be the inotify implementation is hanging on to a reference to the underlying directory object after it's been deleted, causing it to hang around until LxssManager is stopped. |
Adding @JasonLinMS to see if he can help here. |
Thanks Sunil! |
Inotify on DrvFs does hang on to the watched file, which means that when the file is deleted, it will be in this "limbo state" where it still shows up in Explorer and gives an access denied when you try to open it, but the file should disappear and be cleaned up after the inotify watch on the file is stopped (and all other open handles to the file closed). Worst case you would need to close Bash for the files to be cleaned up properly. The fact that you need to shutdown LxssManager or even reboot is strange, do you have any repro instructions for easily getting some files corrupted? |
The file watcher I'm using is Watchman (https://facebook.github.io/watchman/) My repro steps would be:
This is a 100% repro for me. I think DrvFS needs to let go of its handle to the watched file if the file is deleted, this isn't the behavior you'd see on Linux. These types of fs watchers don't typically exit, so the file would never be let go of. Exiting all instances of bash usually (but not always) gets it out of this state, but that is super painful because bootstrapping this filesystem watcher takes a really long time for a large directory (like on the order of 10 minutes!) and I have to do that every time I perform an operation that deletes a directory. Occasionally when I get into this state, and exit all instances of bash, when I try to start bash again I get error 0x8000005. In those cases I have to reboot to get WSL working again. Thanks! |
I just got into this state without watchman running actually. Exiting bash and stopping LxssManager did not resolve the issue, forcing me to reboot. Are there any logs I can collect for you? |
Do you mean that this issue still happened even without you running any file watching program? |
Yes, that's correct - this time it happened without any file watching program. |
I tried to repro your issue with watchman, but couldn't unfortunately. |
Just throwing ideas out here, have you ever tried re-installing your entire Bash installation? |
Now I'm wondering if I'm falsely blaming Watchman (seemed like too much of a coincidence that it reproed for dirs under the watch root, but not elsewhere). I have not tried reinstalling Bash, might be worth a shot. |
Well, the issue you are seeing is definitely due to Bash hanging on to file handles, and inotify does open a lot of handles. If you are seeing this issue so often, it may be worth tar-ing up all your files in /home, then doing a fresh re-install of Bash. But I can't guarantee that your issue will be resolved. |
Fair enough, thanks for your help! |
No problem, hopefully it can resolve this issue. The interesting thing is that only an extremely small number of users have reported issues similar to this. |
@JasonLinMS For what it's worth, I'm hitting this as well - "this" being the issue of open file handles after DrvFs file manipulations, which lead to inaccessible and undeletable files. The "manipulation" in my case is doing a gradle build and then cleaning it. I'm on build 15048. In contrast to the inotify-related issue, WSL sees no relevant open handles (using So far, only a reboot solves the issue for me. I've never tried restarting LxxsManager before, but now that I did, it actually hangs in the "stopping" state. Rebooting in the middle of your work is pretty frustrating - anything we can try or do to help solve this? |
@lephyrus Please help answer the following: So have you been using any inotify-based file watchers when this issue repros? Also, do you have any third-party file system related drivers/filters/apps installed or any third-party anti-virus software installed? Have you reinstalled WSL recently, or have you been using the same WSL installation across many Insider builds. |
@JasonLinMS I'm afraid I can't repro the problem currently, so I can't really comment with certainty. I'm pretty sure there was no watcher running - as I said, the effects of that looked different and I could solve them "inside" WSL, not needing to reboot. I used to run Novell Filr (cloud sync), which actually caused BSODs in connection with WSL - I've reported this issue and am not running it since. I'm not aware of having installed any other possibly relevant software and am only using Windows Defender. I've used this WSL installation across 2 or 3 slow ring builds. Not being able to repro reliably, I can't really make clear-cut statements... sorry. |
Just to add more info to this - I ran into this problem with a directory in /mnt/d/ and after about 30 minutes of troubleshooting, I realized that I had it added as a project folder in Atom (running at the time). I closed Atom and it then allowed me to remove the directory. |
@joelebeau - Thank you! Removing the project folder from Atom also resolved this issue in my case. |
i am getting the same error. i file MPC.cpp i renamed to MPC_cpp for a backup (ie. not in use by any program), and now either of them not letting themselves be deleted, with file permissions showing as '-?????????' and all file properties in listing as '?' what do i do? |
Reboot. |
Is there an alternative for this by now? |
Please try this on Windows Build 1803 and reopen if you run into the same issue |
I'm on version
and this is still an issue. really crappy to have to constantly restart my computer because some program (i.e. Android Studio) wont work properly because it can't delete a directory... that has nothing in it |
@tara-raj pls help . I am working on a react native app and using android studio between Multiple times :( One or the other is bound to hang or leave some phantom deleted file UPDATE!: Running watchman in the foreground Immediately killing killing it if I android studio starts being unable to delete things seems to consistently mitigate. Still would be nice to not have to do that. |
Err, how to reopen it? Still happens on the newest fully patched builds. |
Yes, WebStorm, but the issue remains after webstorm is closed. And all bash windows are closed.
No
Recent fresh install. Once more: Sysinternals Process Explorer can not find any process that holds these files open. |
OS needs a reboot to clean corrupted files sometimes (e.g. above scenario - when link count gets zero). Detailed explanation here: https://superuser.com/a/1441014/629535 |
Hey, I'm also seeing this frequently. I'm using Github Desktop, Visual Studio and Visual Studio code. If I touch stuff in both the VSCode WSL instance and Visual Studio things tend to go wonky, even though I'm not (afaik) using any file watchers. The result for me is undeleteable folders, from either Windows or WSL. |
Still a problem for me. |
On WSL 2 and still have this issue sometimes two different PC. Files in WSL file system, accessing with editor and gitkraken over network mapped drive if that helps at all. |
anyway to fixes it in wsl1? |
I keep ending up with inaccessible files and directories, usually after trying to perform a source control operations that delete or edit a bunch of files. I'm manipulating the directories from bash, and the files are all on my windows file system under /mnt/c/Users/...
From the bash side, I see something like this:
ls -la
ls: cannot access 'contrib': No such file or directory
ls: cannot access 'hooks': No such file or directory
total 12
drwxrwxrwx 0 root root 0 Feb 3 17:48 .
drwxrwxrwx 0 root root 0 Feb 3 17:47 ..
d????????? ? ? ? ? ? contrib
drwxrwxrwx 0 root root 0 Feb 3 17:48 .git
d????????? ? ? ? ? ? hooks
On the Windows side, trying to navigate to the directory in Explorer returns access denied. Trying to view or change the permissions also results in access denied. Restarting LxssManager usually clears up the issue, but sometimes I've had to reboot. My IDE on Windows was trying to read one of these files and ends up just hanging forever. One time when this happened, I saw the CreateFile call from the IDE fail with DELETE_PENDING in SysInternals process explorer, but that error does not always occur.
The text was updated successfully, but these errors were encountered: