Suitability of Tile-Based Rendering for Low-Power 3D Graphics Accelerators

PROEFSCHRIFT
ter verkrijging van de graad van doctor
aan de Technische Universiteit Delft,
op maandag 29 oktober 2007 om 12:30 uur

door

Iosif ANTOCHI
inginer Universitatea Politehnica București
geboren te Boekarest, Roemenie ISBN: 978-90-807957-6-1
Keywords: 3D Graphics Accelerators, Tile-based Rendering, Low-Power Graphics Architectures

This dissertation is dedicated to Claudia and my family, for all their understanding and support over the years. The purpose of these accelerators is to relieve theburdenofgraphicalcomputations fromthemainprocessorandalsoto achieveabetterenergyefficiencythancanbeachievedbyexecutingthesecom- putations on the main processor. Since external data traffic is a major source of power consumption and because usually the rasterization stage of the 3D graphics pipeline requires the highest amount of data traffic, in this disserta- tionweespeciallyfocusonthisstageofthegraphics pipeline. Anotherreason forfocusingontherasterizationstageisthatitrequiresmoreprocessingpower thantheotherstagesbecausetheoperationsarepixel-based. Apromisingtech- niquetoreducetheexternaldatatrafficintherasterizationstageofthegraphics pipeline is tile-based rendering. This technique decomposes ascene into tiles and renders thetiles one by one. Thisallows the colorcomponents and zval- ues of one tile to be stored in small, on-chip buffers, so that only the pixels visible in the final scene need to be stored in the external framebuffer. Tile- basedaccelerators, however,require largescenebufferstostoretheprimitives to be rendered. While there have been studies related to the tile-based ren- dering paradigm forhighperformance systems, wearespecifically discussing thesuitabilityoftile-based3Dgraphicsacceleratorsforlow-powerdevices. In ordertoevaluatevariouslow-power3Dgraphicsarchitectures wefirstpresent GraalBench, a set of 3D graphics workloads representative for contemporary and emerging mobile devices. Furthermore, we propose several scene and state management algorithms fortile-based renderers. Thereafter, weanalyze theperformance oftile-based rendererscomparedtothatof traditional render- ers and we also determine the influence of the tile size on the amount of the data-trafficrequiredfortherasterizationstageofatile-basedrenderer. Inorder to reduce even more the data traffic between the main memory and graphics accelerators, and to exploit the high temporal and spatial locality of texture accesses, wehavealsoinvestigated severalcache structures. Ourresults show that the proposed algorithms for tile-based renderers can effectively decrease the data traffic and computational requirements for the rasterization stage of the3Dgraphicspipeline. i Acknowledgments During the time that I was performing the research described in this disserta- tion, Icameacross manypeople whohave supported andassisted mewithout whomitwouldhavebeenmuchhardertoproducethisdissertation. 