docs: clip benchmark on zeroshot classification and retrieval tasks #832

ZiniuYu · 2022-09-27T08:00:21Z

No description provided.

codecov · 2022-09-27T08:20:46Z

Codecov Report

Merging #832 (9839451) into main (2ba8a4f) will decrease coverage by 2.79%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main     #832      +/-   ##
==========================================
- Coverage   84.38%   81.58%   -2.80%     
==========================================
  Files          21       21              
  Lines        1575     1575              
==========================================
- Hits         1329     1285      -44     
- Misses        246      290      +44

Flag	Coverage Δ
cas	`81.58% <ø> (-2.80%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
server/clip_server/model/clip_onnx.py	`72.72% <ø> (-5.46%)`	⬇️
server/clip_server/model/pretrained_models.py	`98.41% <ø> (ø)`
server/clip_server/model/model.py	`69.85% <0.00%> (-9.12%)`	⬇️
server/clip_server/executors/clip_onnx.py	`81.94% <0.00%> (-2.78%)`	⬇️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

docs/user-guides/benchmark.rst

jemmyshin · 2022-10-09T08:05:26Z

docs/user-guides/benchmark.rst

+| ViT-g-14::laion2b_s12b_b42k      | 0.696      | **0.811** | **0.851**  | 0.839     | **0.682** | 0.776      | 0.943     | **0.962** | **0.603** | 0.648     | 0.718     | 0.560          | 0.580       | **0.332**   | 0.175          | 0.036             | 0.031                | 0.060             | 0.115               | 0.190     | 0.138          |
+----------------------------------+------------+-----------+------------+-----------+-----------+------------+-----------+-----------+-----------+-----------+-----------+----------------+-------------+-------------+----------------+-------------------+----------------------+-------------------+---------------------+-----------+----------------+
+
+From the table, we observe that the ViT models still outperform the RN models in most tasks, except for the Patch Camelyon dataset where ``RN50::openai`` has the best top-1 accuracy of 0.636, and the KITTI/distance dataset where ``RN50::yfcc15m`` has the best result of 0.336.


There are some cases where RN models beat ViT, such as Patch Camelyon, KITTI/distance. Is there any reason that they perform like this way?

jemmyshin · 2022-10-09T08:06:45Z

docs/user-guides/benchmark.rst

+From the table, we observe that the ViT models still outperform the RN models in most tasks, except for the Patch Camelyon dataset where ``RN50::openai`` has the best top-1 accuracy of 0.636, and the KITTI/distance dataset where ``RN50::yfcc15m`` has the best result of 0.336.
+Similar to retrieval results, the ``ViT-H-14::laion2b_s32b_b79k`` model and ``ViT-g-14::laion2b_s12b_b42k`` model still have the best or close to the best results on 12/21 zero-shot classification tasks.
+All models tend to perform well on ImageNetV2, VOC2007, VTAB natural and VTAB specialized (except for Retinopathy) datasets, whereas they perform poorly on VTAB structured datasets.
+We do not observe any significant difference between the ViT models of the same base model. 


what does this mean?

For ViT models in retrieval tasks, the results of the same base model are better on those pre-trained with larger datasets (e.g., ViT-B-32::openai vs ViT-B-32::laion400m_e31 vs ViT-B-32::laion2b-s34b-b79k).
This is not the same in classification tasks

* chore: update benchmark intro * chore: minor revision * chore: minor revision * chore: minor revision * chore: minor revision * chore: minor revision * chore: minor revision

github-actions · 2022-10-10T06:08:09Z

📝 Docs are deployed on https://ft-clip-benchmark--jina-docs.netlify.app 🎉

ZiniuYu marked this pull request as draft September 27, 2022 08:00

github-actions bot added size/s area/docs labels Sep 27, 2022

ZiniuYu force-pushed the clip-benchmark branch from 7e379ad to fc2e1c7 Compare September 27, 2022 08:14

ZiniuYu changed the title ~~docs: CLIP benchmark on zeroshot classification and retrieval tasks~~ docs: clip benchmark on zeroshot classification and retrieval tasks Sep 27, 2022

github-actions bot added size/m and removed size/s labels Sep 27, 2022

ZiniuYu added 7 commits September 30, 2022 16:53

docs: clip benchmark on zeroshot classification and retrieval tasks

2b2f90a

docs: add label

54421cc

docs: introduction

8d445eb

docs: open clip naming convention

01a2be4

fix: typo

844cba1

docs: retrieval table

1e87dec

docs: update classification

5b2f782

ZiniuYu force-pushed the clip-benchmark branch from 07c3ff2 to 5b2f782 Compare September 30, 2022 08:54

chore: test html table

e4918ea

github-actions bot added size/l and removed size/m labels Sep 30, 2022

ZiniuYu added 3 commits September 30, 2022 17:12

chore: update css

4ab55ec

chore: test rst

24fff82

chore: test rst

2ee0498

github-actions bot added size/m and removed size/l labels Sep 30, 2022

ZiniuYu added 6 commits September 30, 2022 18:40

chore: test

f01245b

fix: use rst in benchmark

4d8f5c1

fix: typo

d395f69

fix: rst

c2591ee

fix: rst

444b57d

fix: subtitle

72a1cd1

ZiniuYu added 14 commits September 30, 2022 22:05

docs: classification benchmark

63d90cd

docs: highlight retrieval

e37aa8b

docs: highlight retireval

5446611

docs: highlight classification

0265b35

docs: remove redundancy

8df3ed5

docs: add links

378a500

fix: link

8c3ae08

docs: update section

e232e32

docs: datasets description

4b81a9f

docs: add datasets description

d07b5d7

docs: format

67605de

docs: footnote

ccaaec1

docs: add QPS

8e398b6

docs: improve conclusion

c1f06e5

ZiniuYu marked this pull request as ready for review October 9, 2022 06:50

ZiniuYu requested a review from a team October 9, 2022 06:55

jemmyshin reviewed Oct 9, 2022

View reviewed changes

ZiniuYu and others added 3 commits October 9, 2022 22:57

docs: update machine config

16689a3

docs: update software version

cc0e98c

chore: polish benchmark doc (#839)

9839451

* chore: update benchmark intro * chore: minor revision * chore: minor revision * chore: minor revision * chore: minor revision * chore: minor revision * chore: minor revision

numb3r3 approved these changes Oct 10, 2022

View reviewed changes

numb3r3 merged commit 7ee58c8 into main Oct 10, 2022

numb3r3 deleted the clip-benchmark branch October 10, 2022 06:27

numb3r3 mentioned this pull request Oct 10, 2022

chore: draft release note v0.8.0 #840

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: clip benchmark on zeroshot classification and retrieval tasks #832

docs: clip benchmark on zeroshot classification and retrieval tasks #832

ZiniuYu commented Sep 27, 2022

codecov bot commented Sep 27, 2022 •

edited

Loading

jemmyshin Oct 9, 2022

jemmyshin Oct 9, 2022

ZiniuYu Oct 9, 2022

github-actions bot commented Oct 10, 2022

docs: clip benchmark on zeroshot classification and retrieval tasks #832

docs: clip benchmark on zeroshot classification and retrieval tasks #832

Conversation

ZiniuYu commented Sep 27, 2022

codecov bot commented Sep 27, 2022 • edited Loading

Codecov Report

jemmyshin Oct 9, 2022

Choose a reason for hiding this comment

jemmyshin Oct 9, 2022

Choose a reason for hiding this comment

ZiniuYu Oct 9, 2022

Choose a reason for hiding this comment

github-actions bot commented Oct 10, 2022

codecov bot commented Sep 27, 2022 •

edited

Loading