Document Server@UHasselt >
Research publications >
Please use this identifier to cite or link to this item:
|Title: ||A High-level Kernel Transformation Rule Set for Efficient Caching on Graphics Hardware - Increasing Streaming Execution Performance with Minimal Design Effort|
|Authors: ||ROGMANS, Sammy|
|Issue Date: ||2009|
|Publisher: ||INSTICC Press|
|Citation: ||Proceedings of the International Conference on Signal Processing and Multimedia Applications. p. 38-43.|
|Abstract: ||This paper proposes a high-level rule set that allows algorithmic designers to optimize their implementation on graphics hardware, with minimal design effort. The rules suggest possible kernel splits and merges to transform the kernels of the original design, resulting in an inter-kernel rather then low-level intra-kernel optimization. The rules consider both traditional texture caches and next-gen shared memory – which are used in the abstract stream-centric paradigms such as CUDA and Brook+ – and can therefore be implicitly applied in most generic streaming applications on graphics hardware.|
|ISI #: ||000282246300011|
|Type: ||Proceedings Paper|
|Validation: ||ecoom, 2011|
|Appears in Collections: ||Research publications|
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.