Prune CUDA static numerical libraries for specifice compute capability #3234

umar456 · 2022-03-22T21:33:37Z

Prune CUDA static libraries so that the binary size of the final executable
is smaller.

Description

Prune CUDA static libraries so that the binary size of the final executable
is smaller. This commit will run the nvprune utility on some static libraries
(cublasLt, cublas, cusolver, and cusparse) to remove unused architectures
from the binary. The resulting binary is significantly smaller when targeting
a single compute capability.

For example when targeting only compute capability 7.5 the resulting binary
is 545MB as opposed to 1100+MB without pruning.

Changes to Users

N/A

Checklist

Rebased on latest master
Code compiles
Tests pass
~~[ ] Functions added to unified API~~
~~[ ] Functions documented~~

Prune CUDA static libraries so that the binary size of the final executable is smaller. This commit will run the nvprune utility on some static libraries (cublasLt, cublas, cusolver, and cusparse) to remove unused architectures from the binary. The resulting binary is significantly smaller when targeting a single compute capability.

syurkevi approved these changes Mar 22, 2022

View reviewed changes

umar456 merged commit 453cdc3 into arrayfire:master Mar 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prune CUDA static numerical libraries for specifice compute capability #3234

Prune CUDA static numerical libraries for specifice compute capability #3234

Uh oh!

umar456 commented Mar 22, 2022

Uh oh!

Uh oh!

Prune CUDA static numerical libraries for specifice compute capability #3234

Prune CUDA static numerical libraries for specifice compute capability #3234

Uh oh!

Conversation

umar456 commented Mar 22, 2022

Description

Changes to Users

Checklist

Uh oh!

Uh oh!