docs: add disk usage / memory usage benchmark table #751

ZiniuYu · 2022-06-13T02:28:05Z

No description provided.

codecov · 2022-06-13T02:31:34Z

Codecov Report

Merging #751 (d2d7be2) into main (4d069a8) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##             main     #751   +/-   ##
=======================================
  Coverage   81.78%   81.78%           
=======================================
  Files          17       17           
  Lines        1208     1208           
=======================================
  Hits          988      988           
  Misses        220      220

Flag	Coverage Δ
cas	`81.78% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 96923f1...d2d7be2. Read the comment docs.

docs/user-guides/server.md

hanxiao · 2022-06-13T10:07:04Z

docs/user-guides/server.md

+| Model          | PyTorch | ONNX | TensorRT | Output Dimension | Disk Usage (MB) | Peak Memory Usage (GB) | 
+|----------------|---------|------|----------|------------------|-----------------|------------------------|
+| RN50           | ✅       | ✅    | ✅        | 1024             | 256             | 4.25                   |
+| RN101          | ✅       | ✅    | ✅        | 512              | 292             | 4.28                   |
+| RN50x4         | ✅       | ✅    | ✅        | 640              | 422             | 6.92                   |
+| RN50x16        | ✅       | ✅    | ❌        | 768              | 661             | 13.01                  |
+| RN50x64        | ✅       | ✅    | ❌        | 1024             | 1382            | 20.26                  |
+| ViT-B/32       | ✅       | ✅    | ✅        | 512              | 351             | 2.19                   |
+| ViT-B/16       | ✅       | ✅    | ✅        | 512              | 354             | 3.90                   |
+| ViT-L/14       | ✅       | ✅    | ✅        | 768              | 933             | 5.38                   |
+| ViT-L/14-336px | ✅       | ✅    | ❌        | 768              | 934             | 11.36                  |


what about VRAM? thats very important.

This is in fact the VRAM. Maybe change the title. Do we need both RAM and VRAM statistics?

hanxiao

need a column on VRAM

hanxiao · 2022-06-13T18:46:25Z

docs/user-guides/server.md

+|----------------|---------|------|----------|------------------|-----------------|---------------------|---------------------|
+| RN50           | ✅       | ✅    | ✅        | 1024             | 256             | 2.97                | 4.25                |
+| RN101          | ✅       | ✅    | ✅        | 512              | 292             | 3.05                | 4.28                |
+| RN50x4         | ✅       | ✅    | ✅        | 640              | 422             | 3.18                | 6.92                |
+| RN50x16        | ✅       | ✅    | ❌        | 768              | 661             | 3.55                | 13.01               |
+| RN50x64        | ✅       | ✅    | ❌        | 1024             | 1382            | 3.95                | 20.26               |
+| ViT-B/32       | ✅       | ✅    | ✅        | 512              | 351             | 3.18                | 2.19                |
+| ViT-B/16       | ✅       | ✅    | ✅        | 512              | 354             | 3.17                | 3.90                |
+| ViT-L/14       | ✅       | ✅    | ✅        | 768              | 933             | 3.64                | 5.38                |
+| ViT-L/14-336px | ✅       | ✅    | ❌        | 768              | 934             | 3.67                | 11.36               |


on RAM and vRAM, either the number is wrong or the column names are wrong. Please check again

I quickly reran the test and noticed that the RAM usage is under-calculated by ~20%. The table is updated

docs/user-guides/server.md

hanxiao · 2022-06-14T06:22:41Z

docs/user-guides/server.md

+| ViT-B/32       | ✅       | ✅    | ✅        | 512              | 351             | 3.66                | 2.19                 |
+| ViT-B/16       | ✅       | ✅    | ✅        | 512              | 354             | 3.58                | 3.90                 |
+| ViT-L/14       | ✅       | ✅    | ✅        | 768              | 933             | 4.11                | 5.38                 |
+| ViT-L/14-336px | ✅       | ✅    | ❌        | 768              | 934             | 5.07                | 11.36                |


This number is impossible. In general, I believe the whole column is wrong.

We run 2 ViT-L/14-336px on Berlin GPU for the last 2 months and together they take 5GB VRAM.

see also similar usage on Google Colab https://jina-ai.slack.com/archives/CTH1CMP8V/p1655126173106729

In fact, the VRAM data is not wrong. I think the reason I got a large VRAM value is that I was using a mini-batch of size 256 in clip-server. If I stick to the default value, the VRAM indeed drops to ~4GB.

I think I should clarify on that in the docs

No, you should clarify it in the benchmark docs.

You should use the default one for benchmark otherwise it is meaningless.

Imagine the reader install clip-as-service and start to use it in the first time. Your table do not explain how much VRAM they would expect when running it out-of-the-box. People read it, "oh, 11GB that much? oh it uses batch_size 256, then what is the default batch_size and what is the default cost?" see? your benchmark does not solve the first question that users ask. In fact, it raises two questions for the readers: what is the default_batch and how much does it cost by default?

Think you buy oranges in the grocery store. The price is always measured on the basic unit. One orange 1 usd, then people can derive a dozen of oranges is probably at 12$ or so. You don't often see that the shop only tells you the price of a dozen, and you have to infer it by yourself. which is basically what you are doing right now. You tell the reader the cost of batch_size 512, and user have to infer the vram cost of batch_size 8 (default, aka "the basic unit")

I see. Using a different batch size indeed introduces some confusion. Will update the table

hanxiao

please fix as my comment
please resolve conflict

numb3r3 · 2022-06-15T05:25:44Z

@ZiniuYu please fix the commit message issue.

Error: You have commit messages with errors
⧗   input: docs: RAM usage
✖   subject must not be sentence-case, start-case, pascal-case, upper-case [subject-case]
✖   found 1 problems, 0 warnings

docs/user-guides/server.md

github-actions · 2022-06-15T12:44:08Z

📝 Docs are deployed on https://ft-docs_benchmark_table--jina-docs.netlify.app 🎉

hanxiao

looks like a test failing

ZiniuYu · 2022-06-15T13:33:53Z

looks like a test failing

That's weird. Fixed by reran the failed one. Anyway, learned a lot of lessons from this task🥹

github-actions bot added size/s area/docs labels Jun 13, 2022

ZiniuYu linked an issue Jun 13, 2022 that may be closed by this pull request

complete benchmark table #742

Closed

numb3r3 requested changes Jun 13, 2022

View reviewed changes

docs/user-guides/server.md Outdated Show resolved Hide resolved

ZiniuYu marked this pull request as ready for review June 13, 2022 09:10

ZiniuYu requested a review from numb3r3 June 13, 2022 09:15

hanxiao reviewed Jun 13, 2022

View reviewed changes

hanxiao requested changes Jun 13, 2022

View reviewed changes

ZiniuYu requested a review from hanxiao June 13, 2022 11:54

hanxiao reviewed Jun 13, 2022

View reviewed changes

docs/user-guides/server.md Outdated Show resolved Hide resolved

hanxiao reviewed Jun 14, 2022

View reviewed changes

ZiniuYu requested a review from hanxiao June 14, 2022 08:33

hanxiao requested changes Jun 14, 2022

View reviewed changes

ZiniuYu force-pushed the docs_benchmark_table branch 2 times, most recently from 50d3484 to 18e8796 Compare June 15, 2022 03:18

ZiniuYu force-pushed the docs_benchmark_table branch from 18e8796 to 5a227a1 Compare June 15, 2022 06:01

hanxiao reviewed Jun 15, 2022

View reviewed changes

docs/user-guides/server.md Outdated Show resolved Hide resolved

hanxiao reviewed Jun 15, 2022

View reviewed changes

docs/user-guides/server.md Outdated Show resolved Hide resolved

ZiniuYu added 9 commits June 15, 2022 20:41

docs: add disk usage and peak memory usage to benchmark table

12cc4c7

docs: add disk usage

01f1445

docs: benchmark_table

789fa73

docs: benchmark_table

7747a23

docs: disk usage

f30ce8e

docs: peak memory usage

418111d

docs: peak memory usage

1e66ad2

docs: peak memory usage

fe9ff41

docs: peak memory usage

b03f47b

ZiniuYu added 8 commits June 15, 2022 20:41

docs: benchmark table

c79ac3b

docs: add RAM usage

b77fae2

docs: add RAM usage

7789ae3

docs: update RAM usage

61db23c

docs: update RAM usage

c687fe3

docs: update narrative

882cad0

docs: use default config in benchmark

5946c75

docs: correct link

d2d7be2

ZiniuYu force-pushed the docs_benchmark_table branch from dc07e40 to d2d7be2 Compare June 15, 2022 12:42

hanxiao approved these changes Jun 15, 2022

View reviewed changes

numb3r3 merged commit 9d872f2 into main Jun 15, 2022

numb3r3 deleted the docs_benchmark_table branch June 15, 2022 15:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add disk usage / memory usage benchmark table #751

docs: add disk usage / memory usage benchmark table #751

ZiniuYu commented Jun 13, 2022

codecov bot commented Jun 13, 2022 •

edited

Loading

hanxiao Jun 13, 2022

ZiniuYu Jun 13, 2022

hanxiao left a comment

hanxiao Jun 13, 2022

ZiniuYu Jun 14, 2022

hanxiao Jun 14, 2022

ZiniuYu Jun 14, 2022

ZiniuYu Jun 14, 2022

hanxiao Jun 14, 2022 •

edited

Loading

ZiniuYu Jun 15, 2022

hanxiao left a comment

numb3r3 commented Jun 15, 2022

github-actions bot commented Jun 15, 2022

hanxiao left a comment

ZiniuYu commented Jun 15, 2022

docs: add disk usage / memory usage benchmark table #751

docs: add disk usage / memory usage benchmark table #751

Conversation

ZiniuYu commented Jun 13, 2022

codecov bot commented Jun 13, 2022 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hanxiao left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hanxiao Jun 14, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hanxiao left a comment

Choose a reason for hiding this comment

numb3r3 commented Jun 15, 2022

github-actions bot commented Jun 15, 2022

hanxiao left a comment

Choose a reason for hiding this comment

ZiniuYu commented Jun 15, 2022

codecov bot commented Jun 13, 2022 •

edited

Loading

hanxiao Jun 14, 2022 •

edited

Loading