Stencil Buffer Trick for Volume Deferred Shading - Daqi's Blog

文章推薦指數: 80 %
投票人數:10人

Here is a nice stencil buffer trick introduced by Yuriy O'Donnell (2009). What it do is basically using the stencil buffer as a counter to ... Togglenavigation Daqi'sBlog Home Publications Music Photo About Deferredshadinghavebeenknowntolargelyboosttheefficiencyofrenderingwhentreatingalargeamountoflights.Themechanismisverysimple:separatethegeometrycomplexityfromlighting.Todothis,G-bufferswhichformanarrayoftextures,oftenincludingposition,materialcolor,normalofthepointstoshade,arerenderedinthefirstpassfromthepointofview.Then,anorthographiccameraareusedtorenderaquadthatcoversthewholescreen.Thenormalized(0-1)screencoordinatesarethenusedtoretrievedthegeometry/materialdataatthepointofthescreen,whichisfedintothelightingfunction.Insuchaway,weavoidproducingtonsoffragmentsfromtheprojectedscenegeometry,instead,onlyrenderthosewhicharevisible. However,imaginewehavealargegroupoflights.We’llstillhavetogothroughthewholelistoflightsforeachscreenpixel.Withamorephysicallybasedlightingmodel,inwhicheachlighthasainfluentialradius(resultedfromthephysicalfactthathasanidealpointlightsourcehasinversesquareddrop-off),fragmentsthatareoutsideacertainlight’sinfluentialradiuswouldwastetimeonwaitingotherfragmentsinthesamebatchtogotoadifferentbranch.WeknowthatbranchingisbadforGPU.Thisleadstoaseveretimepenalty.Manytechniqueshavebeenproposedtoalleviatethisproblem.Tileddeferredshadingisaverypopularmethod,probablymostofyouhaveheardof.Itpartitionthescreenintotilesandcreatea“lightlist”foreachtileusingonlythelightsthatintersectwiththetiles.Thisisofcourseanelegantmethod.However,wewillalwaysneedtodosomepreprocessingbeforegeneratinganewgroupoflights(ifthereisaneedto). Asimplersolutionisvolumedeferredshading.Wejustneedtorender“lightvolumes”foreachlight,which,asyoumightguess,canbeaspherewitharadiusequaltothelight’sinfluentialradius.Forexample,inOpenGL,wejustneedtocreatealistofvertices/indicesofasphereandprepareamodelmatrixforeachlight(whichissimplyscalingandtranslation).Whilerendering,weperformthedrawcallntimes,wherenisthenumberoflights.Onesuchlightvolumewillproducefragmentsthatcoversthespecificregiononthescreenwherethefragmentsarepossibletobeshadedbythelight.Ofcourse,bydoingthiswearelosingthedepthdimension.Wehavetoexplicitlytestthefragmentsandmakesuretheyareataboutthesamedepthregionofthelight(whichisonlyanecessarycondition).Ofcourse,tilerenderingalsorequiresuchtesting,butifwehavelightsscatteringeverywhereinavery“deep”scene,thelightstobetestedaresignificantlylesser.However,becausenopreprocessingarerequired,volumedeferredshadinghavequitecompetitiveperformanceinmostcases. Wait!Weshouldnotrendernpasses!Instead,abetterwayistousetheinstancingfunctionwhichissupportedoneverymoderngraphicscardtoavoidthelatencycausedbylotsofcommunicationbetweenCPUandGPU.Alsoanotherimportantthing,depthwriteshouldbedisabledandadditiveblendingshouldbeenabled.Thereasonthatdepthwriteshouldbedisabledisthatlightvolumesarenotrealgeometry.Whiletwolightvolumesareclosetoeachotherandareilluminatingthesameregionofthescene,wedon’twantthemtooccludeeachothersuchthatsomepartareonlyshadedbyoneofthelightvolumes. Ifyoudothevolumedeferredshadingdescribedabovedirectly,wewillimmediatelydiscoverthatsomethinggoeswrong.Whenyouapproachesalight’slocation(withamovingcamera),atsomepointthescreenwillsuddenlybedarkened.Thisisbecausenomatteryouturnbackfacecullingon/off,youwillfallinadilemmathatyoueitherrenderthepixelstwiceasbrightwhenyouareoutofthelightvolume,ornotrenderanythingwhenyouareinsidethelightvolume. Itturnsoutthatthissituationcanbeeasilysolvedbyswitchingthecullingmodetofront-faceculling.However,thisisnotgoodenough.WecanactuallykeeptheZbuffercreatedbyG-buffersrenderingandusethisinformationperformsomerejectionoffragmentsthatarenotintersectedbylightvolumes.HereisanicestencilbuffertrickintroducedbyYuriyO’Donnell(2009).Whatitdoisbasicallyusingthestencilbufferasacountertorecordwhetherthefrontfaceofalightvolumeisbehindthevisiblescenegeometry(sothatithasnochancetoshadethepixel).Thisisachievedbyonlyrenderingfrontfaces(withcolorbufferwritedisabled)inthefirstpassandadd1tothestenciloftheZ-failedpixels.Anothersituationisthatthebackfaceofalightvolumeisbeforethevisiblescenegeometry,whichissolvedbythesecondpass-renderingonlybackfacesanduseaGreater/Equalz-testtocontinuefilterthefinalpixelsfromthepixelswithazerostencilvalue(alreadypassthefirsttest).Sothatwecankeeponlythelightvolumepixelsthat“fail”thez-test(theoriginal“LESS”one),whichintuitivelycorrespondstothescenegeometry-generatedpixelsthatintersectswiththelightvolume.Noticethistrickalsoworkswhenyouareinsidealightvolume,inwhichcasefrontfaceswon’tgetrendered(itisillogicalthatthefactthatweareinsidealightvolumeandthatthefrontfaceisbehindthevisiblegeometryholdatthesametime!),leavingazerostencilvaluethatallowustouseGreater/Equaldepthtestonlytofilterthepixelstobeshaded.Ofcourse,ineitherpassweneedtodisableZwrite.Certainlywedon’twantthelightvolumesbumpingintoeachother. TheoriginaldiagramusedbyYuriyO'Donnell. Whilethistrickdefinitelyincreasestheefficiencyespeciallywhenthelightingsituationisverycomplex,wecandosomethingbetter.Oftenmodelingadetailedsphereaspolygoncreateslargenumberofverticesthatcramupthepipeline.Instead,wecanuseaverycoarsepolygon-sphere(e.g.,with4vertical/horizontalslices)withaslightlylargerradiustoensurethatthelightvolumeisboundedcorrectly.Wecanevenusejustaboundingcubeofthelightvolume!Ofcourse,theleastthingwecandoisjustuseaquad.However,thatgivesupsomeaforementioneddepthtestbenefitsanditalsoinvolvessomecomplexprojectivegeometry.Justforfun,Ialsopreparedaderivationoftheaxis-alignedscreenspaceboundingquadoftheprojectedsphere. ←PreviousPost NextPost→



請為這篇文章評分?