-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add tensorrt support #688
Conversation
Codecov Report
@@ Coverage Diff @@
## main #688 +/- ##
==========================================
+ Coverage 78.15% 80.27% +2.11%
==========================================
Files 12 16 +4
Lines 943 1100 +157
==========================================
+ Hits 737 883 +146
- Misses 206 217 +11
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
efed459
to
f972419
Compare
📝 Docs are deployed on https://ft-feat-tensorrt--jina-docs.netlify.app 🎉 |
* fix: drafts tempt * feat: support tensorrt backend * fix: offline tensorrt loading * fix: remove draft Dockerfile * fix: setup * fix: s3 bucket for tensorrt * fix: polish codes * fix: int64 input tensors * fix: rebase * fix: refactor preprocess * fix: deprecate _preprocess_blob * fix: refacotr * fix: available model for tensorrt * fix: warning message * fix: imports * fix: draft tensorrt tests * fix: tensrrt output * fix: trt test * fix: tensorrt deps * fix: runtimeerrr re-initialized torch in pytest * fix: compute capability * fix: dynamic trt converting * fix: ci workflow * fix: import onnxruntime * fix: errors * fix: upload cov file * fix: upload cov file * fix: minor revision * chore: update ocs * fix: try to fix cov uploader * fix: temp diable netfily * fix: temp diable netfily * fix: revert netifly * fix: gpu test * fix: runs on * fix: upgrade cov action * fix: use cov bash uploader * fix: action checkout fetch-depth * fix: rebase conflict * fix: cov uploader * fix: setup script
TODO:
onnx
totensorrt
takes a while (10 mins per model), which is not friendly in production.onnx_simpler