This section describes an efficient implementation of ray casting volume rendering on the DASH shared memory architecture [9]. The algorithm is a variation of the optimized ray tracer of Levoy [7,8]. First, a small introduction to the DASH architecture [33] is given, followed by the description of the algorithm and some performance results.