Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
PawelGorny authored Mar 31, 2022
1 parent 61d0699 commit 7677fa2
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -89,6 +89,7 @@ Test card: RTX3060 (eGPU!) with 224 BLOCKS & 512 BLOCK_THREADS (program default
| RTX 3060 eGPU | 10000 | 1520 (224/512/20000)|
| RTX 3090 | 29500 | 3950 (656/640/5000) |
| RTX 3080TI | | 4090 (640/640/5000) |
| RTX A6000 | | 4070 (588/640/5000) |
| GTX 1080TI | 6000 | 750 |

Please consult official Nvidia Occupancy Calculator (https://docs.nvidia.com/cuda/cuda-occupancy-calculator/index.html) to see how to select desired amount of threads/block (shared memory=0, registers per thread = 48). Adjust number of steps per thread to obtain the optimal performance.
Expand Down

0 comments on commit 7677fa2

Please sign in to comment.