Tags
gpgpu
gpu
cuda
university
lecture
programming
numerical simulation
finite difference method
shared memory
parallel processing
fdm
education
vector addition
cpu
architecture
cuda fortran
gnuplot
gpu accelerated library
python
constant memory
euler method
cusparse
cublas
cfd
time integration
linear simultaneous equations
optimization
double buffering
cavity flow
computational fluid dynamics
global memory
memory hierarchy
box filter
image processing
naive implementation
parallel reduction
matrix-matrix multiplication
cluster
fortran
diffusion equation
modern fortran
openmp
fortran 2003
bank conflict
convection equation
educational material
goal orientation
opencl
object-oriented
jetson
curriculum
multicore
tk1
streamfunction
processor
particle
vorticity
thread
hierarchy
laplacian
curand
monte carlo
mosaic
negative
thrust
n-body problem
uchar4
bitmap
gaussian blur
blur
grayscale
order of accuracy
runge-kutta method
modified euler method
open source
hardware
software
accelerator
co-processor
process
mpi
tesla m2050
lectura
fermi
universitiy
multi-thread
flip
memory
opencv
tegra
embedded platform
openacc
moving average
profiling
profiler
occupancy
incompressible flow
project-based learning
micro intelligent robot system
numazu national collage of technology
nagaoka university of technology
lbm
d2q9 model
lattice boltzmann method
bounceback
pycuda
stream
overlap
asynchronous
cooperative processing
concurrent processing
multi-gpu
uva
gpu direct
unified virtual addressing
marching
software development
vscode
computational science
visual studio code
lagrange polynomial
sympy
array of characters
string
power approximation
excel
scipy
best practices
generation
schematic diagram
multiple gpu
fortran 95
fortran 90
cylinder
fem
performance
pinned memory
zero-copy
page-locked memory
transpose
warp
branch divergence
branch
memory access
stride access
coalesce access
cuda event
pi
csr
library
cufft
all-pair
loop unroll
interaction
template
iso_c_binding
porting
fluid dynamics
cpu implementation
vorticity equation
taylor-green vortex
laplace equation
conjugate gradient method
poisson equation
residual
red-black ordering
sor method
rotating cone
fastmath
atomic operation
flops
compute-bound
roofline
flop/byte
memory-bound
See more
- Presentations
- Documents
- Infographics