Plugins should be killed on exit #234

Mygod · 2020-05-04T17:49:49Z

https://stackoverflow.com/a/30540177/2245107

zonyitoo · 2020-05-05T01:46:16Z

Yes, they are killed on exit. Plugins are hold in a Plugins structure, so when it is dropped, all child processes are killed.

https://github.com/shadowsocks/shadowsocks-rust/blob/master/src/plugin/mod.rs#L49

Mygod · 2020-05-05T02:18:43Z

Hmm on Android they're not being terminated correctly. Maybe need some other mechanisms like signaling? @madeye

madeye · 2020-05-05T02:28:46Z

Yeah, we send SIGTERM to shadowsocks-rust, which should be handled correctly.

Let me double check locally.

zonyitoo · 2020-05-05T02:29:51Z

kill() is called when Child is dropped: https://github.com/tokio-rs/tokio/blob/master/tokio/src/process/mod.rs#L691

madeye · 2020-05-05T02:53:55Z

I send SIGTERM to sslocal on Linux, the subprocess won't be killed.

It looks we need to handle SIGTERM in shadowsocks-rust properly.

madeye · 2020-05-05T02:55:22Z

Steps to reproduce:

sslocal --server-addr 127.0.0.1:6001 --local-addr 127.0.0.1:1080 -k password -m rc4-md5 --plugin v2ray-plugin --plugin-opts "host=example.com"
killall sslocal
ps aux | grep v2ray

zonyitoo · 2020-05-05T02:55:34Z

It works correctly in macOS (launchd) and Debian 9 (with systemd).

madeye · 2020-05-05T02:57:26Z

I tested on Ubuntu 18.04... It's interesting that we see different behavior.

madeye · 2020-05-05T02:58:21Z

And the output of shadowsocks-rust

2020-05-05T10:52:11.522+08:00 INFO  shadowsocks 1.8.11
2020-05-05T10:52:11.549+08:00 INFO  started plugin "v2ray-plugin" on 127.0.0.1:43065 <-> shcompute-pva3.nvidia.com:6001
2020/05/05 10:52:11 V2Ray 4.22.1 (V2Fly, a community-driven edition of V2Ray.) Custom (go1.13.4 linux/amd64)
2020/05/05 10:52:11 A unified platform for anti-censorship.
2020/05/05 10:52:11 [Warning] v2ray.com/core: V2Ray 4.22.1 started
2020-05-05T10:52:11.751+08:00 INFO  shadowsocks TCP listening on 127.0.0.1:1081
2020-05-05T10:52:29.802+08:00 INFO  received SIGTERM, exiting

zonyitoo · 2020-05-05T03:01:45Z

Ah, unfortunately, yes. v2ray is not killed! I can reproduce that.

zonyitoo · 2020-05-05T03:21:17Z

It should be fixed by this commit.

zonyitoo · 2020-05-05T04:19:44Z

Yeah, I can also reproduce it in Ubuntu 18.04. I should release a new version to fix this bug.

Mygod · 2020-05-05T04:53:05Z

According to doc, kill sends SIGKILL but we should ideally want SIGTERM to allow clean ups.

https://stackoverflow.com/a/58156963/2245107

zonyitoo · 2020-05-05T04:57:59Z

That's because std only exposed kill(): https://doc.rust-lang.org/std/process/struct.Child.html#method.kill
Is there an equivilent SIGTERM on windows?

Mygod · 2020-05-05T04:59:35Z

Yeah that does not seem to be cross platform... For now, we can just use a Unix-only block to ask the process to terminate first and then kill it after a timeout maybe. Something like this: https://github.com/shadowsocks/shadowsocks-android/blob/7fdcee61216ff35427bf0719d3c542b557ea1f79/core/src/main/java/com/github/shadowsocks/bg/GuardedProcessPool.kt#L93-L108

if unix {
    child.terminate()
    waitForExitOrTimeout(1000)
}
child.kill()

Related: rust-lang/rust#41822

zonyitoo · 2020-05-05T05:17:43Z

This is not an easy task to implement "kill and wait then timeout". Because we are running this in a drop() function, which doesn't have an available Runtime to execute timer and poll.

zonyitoo · 2020-05-05T05:25:40Z

There is a simple solution, pseudocode:

for plugin in &mut self.plugins {
    plugin.terminate();
}
// Blocks process for 500ms
sleep(500ms);
// Kills all of them
for plugin in &mut self.plugins {
    plugin.kill();
}

Mygod · 2020-05-05T05:30:22Z

I guess we only terminate plugins on exit/being terminated so blocking the entire process/thread is probably acceptable.

Mygod · 2020-05-05T05:51:06Z

Hmm 67de124 waits 500ms unconditionally but clean up might be much faster... Maybe consider some sort of waitpid? https://stackoverflow.com/a/20173592/2245107 (I guess this is kind of complicated and really sets up another "Runtime")

madeye · 2020-05-05T06:51:40Z

Verified locally on Android. The latest fix works well.

zonyitoo · 2020-05-05T13:15:14Z

It is unwise to do such a lot of things in drop(). Huh, is there another way...?

Mygod · 2020-05-05T16:55:13Z

How about Runtime::block_on or something similar? https://docs.rs/tokio/0.2.20/tokio/runtime/struct.Runtime.html#method.block_on

zonyitoo · 2020-05-06T00:51:09Z

Nope. Runtime is also dropping.

Mygod · 2020-05-06T00:56:52Z

Runtime should be dropped after everything else right...? We can also release stuff in Runtime's drop handler I guess?

zonyitoo · 2020-05-06T06:09:01Z

Nope. I tried. Spawning into a panicking Runtime will cause another panic.

Mygod · 2020-05-06T08:04:03Z

Hmm if this is really that difficult we can try to clean up the child processes from JVM. A 500ms delay seems undesirable to me.

madeye · 2020-05-06T08:25:20Z

I think we'd better handle this in shadowsocks-rust, as it's not the problem only on Android.

Given we already use unsafe code to send SIGTERM through libc::kill, it maybe acceptable to use libc::waitpid as well.

zonyitoo · 2020-05-06T09:12:12Z

Yes, it should be a problem to be handled in shadowsocks-rust.

It seems that waitpid is the most simple option.

for plugin in &self.plugins {
    let mut status: libc::c_int = 0;
    libc::waitpid(plugin.id(), &mut status, 0);
}

Another option is to use sigtimedwait to wait for SIGCHLD. But we have lots of subprocesses, is it guaranteed that master process could receive all SIGCHLD even if it is not waiting on sigtimedwait?

A very tricky solution is to use alarm() to trigger a EINTR. Hmm..

zonyitoo · 2020-05-06T14:42:02Z

How about this.

Mygod · 2020-05-06T18:14:35Z

I see the current solution is a busy spin -- sigtimedwait would probably be better but no hurries. 👍

zonyitoo · 2020-05-07T01:37:00Z

sigtimedwait cannot specify pid. That's why I didn't use it.

Mygod · 2020-05-07T01:58:53Z

Well the code I quoted simply spawns (or actually reuse) another thread that runs waitpid and wait with timeout for it to finish.

zonyitoo · 2020-05-07T05:23:00Z

That is probably slower. I tested on my laptop and v2ray-plugin processes would be exited in about 2ms. Busy waiting is not a bad idea.

Mygod · 2020-05-07T05:57:11Z

Yeah I guess I will take this fix for now. Thanks for bearing with me! 🤣

Mygod · 2020-05-08T23:10:16Z

Maybe the way to do it is instead to catch SIGINT/SIGTERM signal, perform clean ups when Runtime is still functioning, and then shutdown?

zonyitoo · 2020-05-09T11:42:37Z

If any panic! occurs, Plugins instance will be destructed.

Mygod · 2020-05-09T17:37:59Z

Well we can always use another macro for panic but you are right...

zonyitoo · 2020-07-05T12:24:16Z

Some related discussions about Asynchronous Destruction.

Mygod · 2020-07-05T16:52:07Z

I think scoped concurrency might help as that way we can destruct objects with blocking the runtime before Runtime destructs itself.

zonyitoo added a commit that referenced this issue May 5, 2020

[#234] Ensure plugin subprocesses are killed when server is exited

40c188b

zonyitoo added a commit that referenced this issue May 5, 2020

[#234] Send SIGTERM for plugins to exit gracefully

67de124

zonyitoo added a commit that referenced this issue May 6, 2020

[#234] Actively wait for plugin subprocesses to exit gracefully

8a28a1f

zonyitoo added a commit that referenced this issue May 6, 2020

[#234] Actively wait for plugin subprocesses to exit gracefully

5e18394

zonyitoo added a commit that referenced this issue May 6, 2020

[#234] Actively wait for plugin subprocesses to exit gracefully

91ccfb1

zonyitoo added a commit that referenced this issue May 6, 2020

[#234] Actively wait for plugin subprocesses to exit gracefully

11f543b

zonyitoo added a commit that referenced this issue May 6, 2020

[#234] Actively wait for plugin subprocesses to exit gracefully

b6c3fb0

Mygod closed this as completed May 7, 2020

Mygod mentioned this issue May 7, 2020

Gracefully clean up Child on drop instead of SIGKILL? tokio-rs/tokio#2504

Closed

wyzdot mentioned this issue May 14, 2020

How to get the latest build from CircleCI? #251

Closed

Plugins should be killed on exit #234

Plugins should be killed on exit #234

Comments

Mygod commented May 4, 2020

zonyitoo commented May 5, 2020

Mygod commented May 5, 2020

madeye commented May 5, 2020

zonyitoo commented May 5, 2020

madeye commented May 5, 2020

madeye commented May 5, 2020

zonyitoo commented May 5, 2020

madeye commented May 5, 2020 • edited Loading

madeye commented May 5, 2020

zonyitoo commented May 5, 2020

zonyitoo commented May 5, 2020

zonyitoo commented May 5, 2020

Mygod commented May 5, 2020

zonyitoo commented May 5, 2020

Mygod commented May 5, 2020 • edited Loading

zonyitoo commented May 5, 2020

zonyitoo commented May 5, 2020

Mygod commented May 5, 2020

Mygod commented May 5, 2020

madeye commented May 5, 2020

zonyitoo commented May 5, 2020

Mygod commented May 5, 2020

zonyitoo commented May 6, 2020

Mygod commented May 6, 2020

zonyitoo commented May 6, 2020

Mygod commented May 6, 2020

madeye commented May 6, 2020

zonyitoo commented May 6, 2020 • edited Loading

zonyitoo commented May 6, 2020

Mygod commented May 6, 2020

zonyitoo commented May 7, 2020 • edited Loading

Mygod commented May 7, 2020

zonyitoo commented May 7, 2020 • edited Loading

Mygod commented May 7, 2020

Mygod commented May 8, 2020

zonyitoo commented May 9, 2020

Mygod commented May 9, 2020

zonyitoo commented Jul 5, 2020

Mygod commented Jul 5, 2020

madeye commented May 5, 2020 •

edited

Loading

Mygod commented May 5, 2020 •

edited

Loading

zonyitoo commented May 6, 2020 •

edited

Loading

zonyitoo commented May 7, 2020 •

edited

Loading

zonyitoo commented May 7, 2020 •

edited

Loading