Add support for listening gRPC over UNIX socket #1159

thevilledev · 2018-10-24T13:44:08Z

One typical deployment model for Tensorflow Serving is to run it as a sidecar container. With this approach the model is often served over HTTP through a loopback interface. For performance reasons it would make sense to offer a possibility to access Serving over UNIX sockets. This would remove TCP overhead and reduce context switching.

This PR adds a new CLI flag --grpc_socket_path. If defined, Serving will listen to a UNIX domain socket at this path. It can be a relative or an absolute path. Note that abstract UNIX sockets are not supported with gRPC. There is an issue about this at grpc/grpc#4677.

netfs · 2018-10-29T17:44:30Z

thanks for the change! i will review this sometime this week.

do you have any benchmarks to show case improvements in latency (or throughput)
when going over unix domain sockets vs tcp/ip for ML inference? i'd be curious to see
the results.

thevilledev · 2019-01-03T13:39:35Z

Sorry, I had too much fun with Tensorflow and it took a while to get back here. :)

Here are some benchmark results from a project that runs Tensorflow Serving on GPUs on Google Kubernetes Engine. Our client app uses Applifier/go-tensorflow to interface with Tensorflow Serving over gRPC. We also built a benchmark tool with the same library. We did two separate runs with the benchmark tool, first by calling Tensorflow Serving over a UNIX domain socket and then by calling it over the default TCP socket. There was a small pause between the runs.

This first graph shows the average rate of successful predictions per second.

The second graph shows the average endpoint latency seen by the gRPC client.

There's a huge difference in latency with the default TCP socket, which is actually why our test run eventually failed to finish. Peak median latency increased from 30 ms to 135 ms, where p99 latency increased from 260 ms to 480 ms.

Unfortunately I don't have a pathological and reproducible example about this.

orktes · 2019-01-20T12:36:15Z

@netfs Do you still need more info?

tensorflow_serving/model_servers/server.cc

netfs · 2019-01-22T18:36:08Z

@netfs Do you still need more info?

nope. this looks good. thanks for the change, started to take a look/review!

tensorflow_serving/model_servers/server.cc

netfs

looks great. minor nit and we should be done :-)

thanks for the change!

PiperOrigin-RevId: 231284886

ndeepesh · 2022-04-28T23:16:00Z

Hi @vtorhonen
Do you have the code for the benchmark that you performed? I am trying a similar thing and latency (p99) is worse for unix domain socket. Just wanted to understand if you ran your configuration with some specific configurations

Add support for listening gRPC over UNIX socket

a25b0da

googlebot added the cla: yes label Oct 24, 2018

netfs self-requested a review October 29, 2018 17:44

gautamvasudevan added awaiting review stat:awaiting response labels Dec 19, 2018

netfs reviewed Jan 22, 2019

View reviewed changes

tensorflow_serving/model_servers/server.cc Outdated Show resolved Hide resolved

tensorflow_serving/model_servers/server.cc Outdated Show resolved Hide resolved

tensorflow_serving/model_servers/server.cc Show resolved Hide resolved

thevilledev added 2 commits January 24, 2019 19:33

Modify based on review

cf000bd

Add test for running prediction over domain socket

5367901

netfs reviewed Jan 24, 2019

View reviewed changes

tensorflow_serving/model_servers/server.cc Outdated Show resolved Hide resolved

netfs reviewed Jan 24, 2019

View reviewed changes

netfs added kokoro:force-run and removed awaiting review stat:awaiting response labels Jan 24, 2019

kokoro-team removed the kokoro:force-run label Jan 24, 2019

Fix whitespace

dbab51d

netfs added the kokoro:force-run label Jan 28, 2019

kokoro-team removed the kokoro:force-run label Jan 28, 2019

netfs approved these changes Jan 28, 2019

View reviewed changes

tensorflow-copybara merged commit dbab51d into tensorflow:master Jan 28, 2019

tensorflow-copybara pushed a commit that referenced this pull request Jan 28, 2019

Merge pull request #1159 from vtorhonen:feature/unix-socket-support

3398fd3

PiperOrigin-RevId: 231284886

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for listening gRPC over UNIX socket #1159

Add support for listening gRPC over UNIX socket #1159

thevilledev commented Oct 24, 2018

netfs commented Oct 29, 2018

thevilledev commented Jan 3, 2019

orktes commented Jan 20, 2019

netfs commented Jan 22, 2019

netfs left a comment

ndeepesh commented Apr 28, 2022

Add support for listening gRPC over UNIX socket #1159

Add support for listening gRPC over UNIX socket #1159

Conversation

thevilledev commented Oct 24, 2018

netfs commented Oct 29, 2018

thevilledev commented Jan 3, 2019

orktes commented Jan 20, 2019

netfs commented Jan 22, 2019

netfs left a comment

Choose a reason for hiding this comment

ndeepesh commented Apr 28, 2022