Since the advent of programmable graphics processors (GPUs) their computational powers have been utilized for general purpose
computation. Initially by “exploiting” graphics APIs and recently through dedicated parallel computation frameworks such as
the Compute Unified Device Architecture (CUDA) from Nvidia. This paper investigates multiple implementations of volumetric
Mass-Spring-Damper systems in CUDA. The obtained performance is compared to previous implementations utilizing the GPU through
the OpenGL graphics API. We find that both performance and optimization strategies differ widely between the OpenGL and CUDA
implementations. Specifically, the previous recommendation of using implicitly connected particles is replaced by a recommendation
that supports unstructured meshes and run-time topological changes with an insignificant performance reduction.
Keywords Mass-Spring-Damper Models - GPGPU and Deformable Models