AA V4 I2 Speed Up Simulation With GPU PDF
AA V4 I2 Speed Up Simulation With GPU PDF
AA V4 I2 Speed Up Simulation With GPU PDF
Speed Up Simulations
with a GPU
A new feature in ANSYS Mechanical leverages graphics
processing units to significantly lower solution times for
large analysis problem sizes.
By Jeff Beisheim, Senior Software Developer, ANSYS, Inc.
• Only NVIDIA® Tesla™ GPU is should accelerate the solution, when one ANSYS Mechanical simu-
currently supported for use when possible, without requiring input from lation — to be run at a time on
accelerating ANSYS structural the user. For cases in which it does the GPU.
mechanics simulations. Only the not apply, this new feature will simply • Another hardware setting for
more powerful 20-series (Fermi) have no effect on the program NVIDIA Tesla 20-series GPUs
GPUs are recommended, as behavior. disables error correcting (ECC)
these are the most compu- memory to make use of all the
tationally powerful and, memory on the GPU card as
therefore, the most likely to well as to increase overall
produce faster solution times. memory bandwidth and GPU
• The GPU accelerator capability performance. To ensure FEA
is not currently supported when result accuracy, however, it is
using Distributed ANSYS. recommended that users keep
the default setting of ECC
Activating the New Feature memory enabled.
For commercial license users, the
GPU accelerator capability is enabled NVIDIA Tesla 20-series GPUs such as the When to Use a GPU
C2050 and C2070 are the most computationally
using the ANSYS HPC Pack licensing The amount of acceleration
powerful and, therefore, most likely to produce
model. For academic license users, faster solution times in ANSYS simulations. achievable when using the GPU
the GPU capability is included within accelerator capability will vary greatly
the base ANSYS Academic product Optional Control Settings depending on the hardware
(that provides access to ANSYS A new ACCOPTION command is being used and the model being
Mechanical or higher capability) and available for users who want additional simulated. The following guidelines
no add-on Academic HPC licenses control over various settings related to can help determine whether use of
are required. Engineers can use a the GPU accelerator capability: the GPU accelerator capability will
GPU to accelerate computations on • Activate to control which provide a performance boost. In
conventional multicore processors analysis will use/not use the general, the capability provides the
without any additional GPU-specific GPU accelerator capability g re a t e s t re d u c t i o n s i n o v e r a l l
licensing required. During structural • MinSzThresh, a threshold simulation time when the following
mechanics simulations, ANSYS parameter to determine when conditions are met:
Mechanical APDL software makes the sparse direct solver data size • The simulation spends most of
use of only a single GPU per is large enough to justify using its time on the numerical
simulation. the GPU analysis solution rather than
ANSYS Mechanical APDL users • SPkey to control the use of other tasks, such as pre- and
can activate the accelerator capability single- or double-precision math post-processing. Only the
simply by selecting the H i g h - operations when running the operation of the solver is
P e r f o r m a n c e S e t u p tab in sparse direct solver on the GPU accelerated with a GPU,
the launcher and then checking the including analyses that use the
GPU Accelerator Capability In addition, some hardware sparse direct or PCG /JCG
box. Alternatively, -acc nvidia settings for NVIDIA GPU cards can be iterative solvers (including block
can be added to the list of arguments useful under certain scenarios: Lanczos and PCG Lanczos
supplied on the ANSYS Mechanical • Environment variables are eigensolvers).
A P D L c o m m a n d l i n e . A NSYS available in ANSYS Mechanical • The problem size is in the
Workbench users can choose to APDL to help avoid over- following ranges:
activate the GPU accelerator subscribing the GPU hardware – 500K to 5,000K DOFs for the
capability during solution by for users with multiple GPU sparse direct solver
modifying the GPU acceleration cards or users who run in a – 500K to 3,000K DOFs for
option on the A d v a n c e d multi-user environment, such as PCG/JCG iterative solvers
Properties page of the Solve a server.
Process Settings. • NVIDIA GPU users can consider Size guidelines listed above
Once the GPU accelerator capa- switching their hardware to represent the general range many
bility is activated, when ANSYS exclusive mode, which allows users now routinely work within and
Mechanical APDL is launched it only one process — for example, are based on the NVIDIA Tesla C2050,