feat: do not send blob from server when it is loaded in client #804

ZiniuYu · 2022-08-10T10:46:02Z

If the user encodes a document with uri and no blob, the current implementation will load the blob in the client and send it to the server.
This pr saves the bandwidth by not sending back blob from the server

I ran some tests to see the benefit we have from this change.

from clip_client import Client
from jina import Document, DocumentArray

c = Client('grpc://0.0.0.0:5001')

def data_gen():
    for _ in range(80):
        yield Document(uri='toy.png')

r = c.encode(data_gen(), batch_size=8, show_progress=True)
r.summary()

Server config: default minibatch_size=32, model: Vit-L/14@336px, replicas: 3
Client config: default batch_size=8, input DocumentArray size=80 with each Document having a URI of an image of size ~1.5MB
Before: send 90MB, recv 90MB, cost 3-13min (Server hosted on Beijing GPU 3-7min/Berlin GPU 6-13min)
After: send 90MB, recv 250KB, cost 1-3min

In general, the time varies because of remote network communications, but the recv package sizes substantially drops

server/clip_server/executors/helper.py

codecov · 2022-08-10T10:50:07Z

Codecov Report

Merging #804 (05786c8) into main (b4fb0dd) will increase coverage by 2.57%.
The diff coverage is 95.65%.

@@            Coverage Diff             @@
##             main     #804      +/-   ##
==========================================
+ Coverage   81.52%   84.10%   +2.57%     
==========================================
  Files          21       21              
  Lines        1440     1466      +26     
==========================================
+ Hits         1174     1233      +59     
+ Misses        266      233      -33

Flag	Coverage Δ
cas	`84.10% <95.65%> (+2.57%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
client/clip_client/client.py	`87.33% <95.00%> (+1.54%)`	⬆️
server/clip_server/executors/helper.py	`97.05% <100.00%> (+0.08%)`	⬆️
server/clip_server/model/clip_trt.py	`85.71% <0.00%> (+16.32%)`	⬆️
server/clip_server/model/trt_utils.py	`83.51% <0.00%> (+27.47%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

server/clip_server/executors/helper.py

numb3r3 · 2022-08-12T09:59:57Z

client/clip_client/client.py

@@ -119,6 +121,11 @@ def encode(self, content, **kwargs):
                **self._get_post_payload(content, kwargs),
                on_done=partial(self._gather_result, results=results),
            )
+
+        for c in content:
+            if isinstance(c, Document) and c.tags.pop('__loaded_by_CAS__', False):


Suggested change

if isinstance(c, Document) and c.tags.pop('__loaded_by_CAS__', False):

if hasattr(c, 'tags') and c.tags.pop('__loaded_by_CAS__', False):

This is a broken change

The benefits from change above:

reduce the memory footprint (to address OOM when processing a large number of documents);

keep the inputs not updated;

The future better way is to support in-place update in encode(docs)

numb3r3

add unittest to check server-side

numb3r3

LGTM

feat: do not send blob from server when it is loaded in client

cff986e

github-actions bot added size/xs component/client component/server labels Aug 10, 2022

numb3r3 requested changes Aug 10, 2022

View reviewed changes

server/clip_server/executors/helper.py Outdated Show resolved Hide resolved

ZiniuYu added 3 commits August 11, 2022 11:23

fix: pop tensor

1fe3028

fix: pop tensor

341b786

test: remove unused tags

8011743

github-actions bot added size/s area/testing and removed size/xs labels Aug 11, 2022

ZiniuYu marked this pull request as ready for review August 11, 2022 04:57

test: preseve original docs

3ecf61f

numb3r3 requested changes Aug 11, 2022

View reviewed changes

server/clip_server/executors/helper.py Outdated Show resolved Hide resolved

server/clip_server/executors/helper.py Outdated Show resolved Hide resolved

fix: reset input

4d61e28

ZiniuYu force-pushed the no-blob-from-server branch from 6edbfde to 4d61e28 Compare August 11, 2022 07:15

chore: clean up

7f6fb2a

ZiniuYu requested a review from numb3r3 August 12, 2022 09:07

numb3r3 reviewed Aug 12, 2022

View reviewed changes

numb3r3 requested changes Aug 12, 2022

View reviewed changes

ZiniuYu added 2 commits August 12, 2022 18:41

test: preproc_image

f4f3146

test: rank preserve input

05786c8

github-actions bot added size/m and removed size/s labels Aug 12, 2022

numb3r3 approved these changes Aug 15, 2022

View reviewed changes

numb3r3 merged commit 3c43eed into main Aug 15, 2022

numb3r3 deleted the no-blob-from-server branch August 15, 2022 07:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: do not send blob from server when it is loaded in client #804

feat: do not send blob from server when it is loaded in client #804

ZiniuYu commented Aug 10, 2022 •

edited

Loading

codecov bot commented Aug 10, 2022 •

edited

Loading

numb3r3 Aug 12, 2022

numb3r3 Aug 12, 2022

numb3r3 Aug 12, 2022 •

edited

Loading

numb3r3 Aug 12, 2022

numb3r3 left a comment

numb3r3 left a comment

	if isinstance(c, Document) and c.tags.pop('__loaded_by_CAS__', False):
	if hasattr(c, 'tags') and c.tags.pop('__loaded_by_CAS__', False):

feat: do not send blob from server when it is loaded in client #804

feat: do not send blob from server when it is loaded in client #804

Conversation

ZiniuYu commented Aug 10, 2022 • edited Loading

codecov bot commented Aug 10, 2022 • edited Loading

Codecov Report

numb3r3 Aug 12, 2022

Choose a reason for hiding this comment

numb3r3 Aug 12, 2022

Choose a reason for hiding this comment

numb3r3 Aug 12, 2022 • edited Loading

Choose a reason for hiding this comment

numb3r3 Aug 12, 2022

Choose a reason for hiding this comment

numb3r3 left a comment

Choose a reason for hiding this comment

numb3r3 left a comment

Choose a reason for hiding this comment

ZiniuYu commented Aug 10, 2022 •

edited

Loading

codecov bot commented Aug 10, 2022 •

edited

Loading

numb3r3 Aug 12, 2022 •

edited

Loading