Lecture Notes in Computer Science, 2001, Volume 2150/2001, 678-686, DOI: 10.1007/3-540-44681-8_97

Performance of the Complex Streamed Instruction Set on Image Processing Kernels

Dmitri Tcheressiz, Ben Juurlink, Stamatis Vassiliadis and Harry Wijshoff

View Related Documents

Abstract

The Complex Streamed Instruction (CSI) set is an architectural paradigm designed to accelerate multimedia applications. These applications are characterized by streaming operations on small-width data elements such as 8-bit pixels or 16-bit audio samples. CSI instructions operate on two-dimensional data streams in a SIMD fashion and are able to process streams of arbitrary length. In this paper we evaluate the performance of the CSI architecture on a set of important image processing kernels. These kernels are characterized by little data reuse which results in poor cache performance. Simulation results show that CSI provides a speedup by a factor of up to 3.98 (2.60 on average) compared to Sun’s media ISA extension VIS. We also analyze the scalability of VIS and CSI with respect to memory bandwidth. The results show that CSI scales much better than VIS with increasing bandwidth.

Fulltext Preview

Image of the first page of the fulltext document