Compute Unified Device Architecture NVIDIA CUDA
14,453 Followers
Recent papers in Compute Unified Device Architecture NVIDIA CUDA
Nowadays modern computer GPU (Graphic Processing Unit) became widely used to improve the performance of a computer, which is basically for the GPU graphics calculations, are now used not only for the purposes of calculating the graphics... more
The computational epidemiology is the development and use of computational models that aims to understand the proliferation of diseases of the dynamic point of view. The computational models are capable to simulate the behavior of an... more
This Research aims to compare the execution time of processing raw data (K-Space raw data) into images on CPUs that are processed in serial and processing on GPU processed in parallel. There is one method on the serial implementation of... more
CUDA (Compute Unified Device Architecture) is a parallel computing platform developed by Nvidia which provides the ability of using GPUs to run computationally intensive programs. This presentation provides a brief overview of CUDA,... more
This thesis focuses on the development, implementation and optimization of pattern-matching algorithms in two different, yet closely-related research fields: malicious code detection in intrusion detection systems and digital forensics... more
The aim of this master thesis is to develop, implement and adapt a neural model for bio-inspired segmentation of color images. This model is based on BCS/FCS and previous works developed by the research group, but incorporating... more
In this paper, we investigate computing systems and network architectures, dedicated to high frequency trading applications and evaluate their performances. Both a high processing speed and low network latency are important for... more
Graphics processing units - or GPUs as they are more commonly known - are specialized circuits historically designed to efficiently handle computer graphics. They are highly parallel computers which can process large amounts of data... more
Stereo matching is an important research topic in virtual reality.Existing research mainly focuses on improving accuracy,concerning less on run time of the algorithm.In order to facilitate the speed comparison of stereo matching... more
This paper presents a parallel implementation of the hybrid BiCGStab(2) (bi-conjugate gradient stabilized) iterative method in a GPU (graphics processing unit) for solution of large and sparse linear systems. This implementation uses the... more
This GPU book teaches both CUDA and CPU Parallel Programming using pThreads.
Graphics Processing Units (GPUs) offer tremendous computational power. CUDA (Compute Unified Device Architecture) provides a multi-threaded parallel programming model, facilitating high performance implementations of general-purpose... more
GPGPUs have recently emerged as powerful vehicles for generalpurpose high-performance computing. Although a new Compute Unified Device Architecture (CUDA) programming model from NVIDIA offers improved programmability for general... more
Graphic processing Units (GPUs) are gaining ground in high-performance computing. CUDA (an extension to C) is most widely used parallel programming framework for general purpose GPU computations. However, the task of writing optimized... more
Zur Entstehung dieser Studie haben viele Kollegen und Freunde beigetragen. Mein erster Dank gilt meinem Doktorvater Eckart Conze. Er hat die Arbeit nicht nur von Anfang an konzeptionell begleitet, sondern ist mir darüber hinaus stets... more
Recent development in Graphics Processing Units (GPUs) has enabled inexpensive high performance computing for general-purpose applications. Compute Unified Device Architecture (CUDA) programming model provides the programmers adequate C... more
Статья посвящена разработке алгоритма взвешенного перекрывающегося сложения (weightedoverlap-add‑WOLA) для обработки векторных сигналов при проведении радиомониторинга в широком частотном диапазоне (ШЧД). Алгоритм предназначен для... more
A Smoothed Particle Hydrodynamics (SPH) method for free surface flows has been implemented on a graphical processing unit (GPU) using the Compute Unified Device Architecture (CUDA) developed by Nvidia, resulting in tremendous speed-ups.... more
The coding efficiency of the H.264/AVC standard makes the decoding process computationally demanding. This has limited the availability of cost-effective, high-performance solutions. Modern computers are typically equipped with powerful... more
This paper show an advanced computer graphic techniques for laser range finder (LRF) simulation. The LRF is the common sensor for unmanned ground vehicle, autonomous mobile robot and security applications. The cost of the measurement... more
Con metodo Monte Carlo su GPU Cuda
Processing of human faces finds application in various domains like law enforcement and surveillance, entertainment (interactive video games), information security, smart cards etc. Several of these applications are interactive and... more
In this paper, we present a multi-level programming model for recent GPU-based high performance computing systems. Involving cooperative stream threads and symmetric multiprocessing threads our model gives a computational framework that... more
a b s t r a c t Granular flows are extremely important for the pharmaceutical and chemical industry, as well as for other scientific areas. Thus, the understanding of the impact of particle size and related effects on the mean, as well as... more
Recently, graphics processing units (GPUs) have had great success in accelerating many numerical computations. We present their application to computations on unstructured meshes such as those in finite element methods. Multiple... more
Overview • Intro to Mathematica and its API • CUDA + Mathematica • Some examples
Cycles count in a graph is an NP-complete problem. This work minimizes the execution time to solve the problem compared to the other traditional serial, CPU based one. It reduces the hardware resources needed to a single commodity GPU. We... more
This work has the goal to study how an efficient deep packet inspection (DPI) algorithm may be implemented using the graphical processing unit (GPU) CUDA (Computer Unified Device Architecture) enabled boards existing in personal... more
Data mining place viral aspect in many of the applications like market –basket analysis, fraud detection etc. In data mining association rule mining and frequent pattern mining, both are key feature of market-basket analysis. In a given... more