SlideShare a Scribd company logo
Practical Occlusion Culling in Killzone 3
Practical Occlusion Culling in
          Killzone 3
           Michal Valient
      Lead Tech, Guerrilla B.V.
Talk takeaway
Occlusion culling system used in Killzone 3
The reasons why to use software rasterization
(Some) technical details
How to pick good occluders
Q&A
Talk takeaway
Occlusion culling system used in Killzone 3
You’ll love software rasterization
(Some) technical details
How to pick good occluders
Q&A
Talk takeaway
Occlusion culling system used in Killzone 3
You’ll love software rasterization
(Some) technical details
How to pick good occluders
Q&A



                             Huge environments
Talk takeaway
Occlusion culling system used in Killzone 3
You’ll love software rasterization
(Some) technical details
How to pick good occluders
Q&A



                             Armed JetPacks
Talk takeaway
Occlusion culling system used in Killzone 3
You’ll love software rasterization
(Some) technical details
How to pick good occluders
Q&A



                             Giant Spider Robots
Killzone 3 visibility solution
Software rasterization running on SPUs
Render occluders into depth buffer
 –Use simplified version of the scene geometry
Conservatively scale down
 –To make it fit into SPU memory
 –To make it faster to test against
Test all objects against small depth buffer
 –Test bounding boxes
Why software rasterization
Previous solution did not scale well
 –Manually placed portals
Why software rasterization
Previous solution did not scale well
 –Manually placed portals
Works automatically
 –Can be enabled early in production
Why software rasterization
Previous solution did not scale well
 –Manually placed portals
Works automatically
 –Can be enabled early in production
Completely dynamic solution
 –Any object can become an occluder
Why software rasterization
Previous solution did not scale well
 –Manually placed portals
Works automatically
 –Can be enabled early in production
Completely dynamic solution
 –Any object can become an occluder
Maps well to SPUs
 –No sync issues
 –No GPU costs related to visibility testing
Practical Occlusion Culling in Killzone 3
Practical Occlusion Culling in Killzone 3
Practical Occlusion Culling in Killzone 3
Practical Occlusion Culling in Killzone 3
Implementation overview
Occluder setup
First stage, not parallel
Outputs clipped + projected triangles
 –One list of triangle data
 –One “index” DMA list per rasterizer job
Caches are important at this stage
 –2KB vertex array cache (90% hit rate)
 –32-entry post-transform cache (60% hit rate)
 –Various double-buffered output caches
Rasterization
Split 640x360p depth buffer into 16 pixel-high strips
 –Rasterize in parallel, one SPU job per strip
 –Load triangles using the prepared DMA list
Traditional scanline rasterizer
 –Fill internal 640x16 floating point depth buffer
Vectorization is the key
 –Set up three or four edges at once
 –Generate 4x1 pixels at once
 –Optimize in assembly
Rasterization
Compress depth buffer
 –One output pixel is maximum depth of 16x16 block.
  –But patch single pixel holes first.
 –Encode as uint16, reserve 0xffffu for infinity
Output single scanline of 40x23 occlusion buffer
Occlusion tests
Tests happen in parallel
Each object consists of one or more parts
First test object bounding box
 –Skip for objects visible last frame
Then test individual parts
Continue with submesh culling
 –Small Spatial Kd-Tree inside most meshes
 –Allows for culling arbitrarily small mesh chunks
Occlusion tests
Accurate tests
 –Bounding box rasterization and depth test
 –Working on small depth buffer, be conservative
Fast bounding sphere tests
 –Precomputed hierarchical reject data
 –Constant time test for small spheres
 –Only used for fast reject
Generating good occluders
Where to get occluders
Aiming for automated solution
Originally wanted to use scene geometry
 –Reduced polygon count
 –Too many errors, in general does not work
 Now using physics mesh
 –Closed, low polygon meshes
 –Not always conservative in the right sense
  –Visual mesh can be inside physics mesh causing drops
Where to get occluders
How to select good occluders
Simple heuristics to identify good occluders
 –Discard anything which is small
 –Discard by meta data
  –clutter, set dressing, foliage, railings…
 –Discard if surface area is significantly smaller than
 bounding box surface area
Artists can override the process
 –Still creating the best occluders by hand
Artist generated occluders
Artist generated occluders
Conclusion
Software rasterization is great
 –Fast on SPUs
 –Easy to integrate
 –Very accurate (if occluders are accurate)
Creating occluders is hard
 –Automatic system was not enough
 –Plan for this in content creation
 –Define workflow for finding and fixing leaks
Voxelization anyone?
Conclusion
Special thanks to Will Vale (Second Intention Ltd) for
implementing this system for us.
Statistics
100 occluders, 1500 triangles
Test 1000 objects, 2700 parts
Timings
 –Setup job: 0.5ms
 –Rasterize job: 2.0ms (on 5 SPUs)
 –Query job: 4.5ms (on 5 SPUs)
 –Overall latency: ~2ms

More Related Content

PPT
Secrets of CryENGINE 3 Graphics Technology
Tiago Sousa
 
PPTX
The Rendering Technology of Killzone 2
Guerrilla
 
PPTX
Rendering Technologies from Crysis 3 (GDC 2013)
Tiago Sousa
 
PPTX
Moving Frostbite to Physically Based Rendering
Electronic Arts / DICE
 
PPTX
Stochastic Screen-Space Reflections
Electronic Arts / DICE
 
PPTX
DirectX 11 Rendering in Battlefield 3
Electronic Arts / DICE
 
PDF
Rendering AAA-Quality Characters of Project A1
Ki Hyunwoo
 
PDF
Forward+ (EUROGRAPHICS 2012)
Takahiro Harada
 
Secrets of CryENGINE 3 Graphics Technology
Tiago Sousa
 
The Rendering Technology of Killzone 2
Guerrilla
 
Rendering Technologies from Crysis 3 (GDC 2013)
Tiago Sousa
 
Moving Frostbite to Physically Based Rendering
Electronic Arts / DICE
 
Stochastic Screen-Space Reflections
Electronic Arts / DICE
 
DirectX 11 Rendering in Battlefield 3
Electronic Arts / DICE
 
Rendering AAA-Quality Characters of Project A1
Ki Hyunwoo
 
Forward+ (EUROGRAPHICS 2012)
Takahiro Harada
 

What's hot (20)

PDF
Screen Space Reflections in The Surge
Michele Giacalone
 
PPT
A Bit More Deferred Cry Engine3
guest11b095
 
PDF
Rendering Tech of Space Marine
Pope Kim
 
PDF
Killzone Shadow Fall Demo Postmortem
Guerrilla
 
PDF
The Rendering Technology of 'Lords of the Fallen' (Game Connection Europe 2014)
Philip Hammer
 
PDF
Graphics Gems from CryENGINE 3 (Siggraph 2013)
Tiago Sousa
 
PDF
An introduction to Realistic Ocean Rendering through FFT - Fabio Suriano - Co...
Codemotion
 
PPT
Paris Master Class 2011 - 07 Dynamic Global Illumination
Wolfgang Engel
 
PDF
Dissecting the Rendering of The Surge
Philip Hammer
 
PPTX
Optimizing the Graphics Pipeline with Compute, GDC 2016
Graham Wihlidal
 
PPT
Frostbite Rendering Architecture and Real-time Procedural Shading & Texturing...
repii
 
PPT
Crysis Next-Gen Effects (GDC 2008)
Tiago Sousa
 
PDF
Lighting Shading by John Hable
Naughty Dog
 
PPTX
Physically Based and Unified Volumetric Rendering in Frostbite
Electronic Arts / DICE
 
PPT
Visibility Optimization for Games
Umbra
 
PPTX
Hable John Uncharted2 Hdr Lighting
ozlael ozlael
 
PPTX
High dynamic range
changehee lee
 
PPTX
A Certain Slant of Light - Past, Present and Future Challenges of Global Illu...
Electronic Arts / DICE
 
PDF
DD18 - SEED - Raytracing in Hybrid Real-Time Rendering
Electronic Arts / DICE
 
PPTX
FrameGraph: Extensible Rendering Architecture in Frostbite
Electronic Arts / DICE
 
Screen Space Reflections in The Surge
Michele Giacalone
 
A Bit More Deferred Cry Engine3
guest11b095
 
Rendering Tech of Space Marine
Pope Kim
 
Killzone Shadow Fall Demo Postmortem
Guerrilla
 
The Rendering Technology of 'Lords of the Fallen' (Game Connection Europe 2014)
Philip Hammer
 
Graphics Gems from CryENGINE 3 (Siggraph 2013)
Tiago Sousa
 
An introduction to Realistic Ocean Rendering through FFT - Fabio Suriano - Co...
Codemotion
 
Paris Master Class 2011 - 07 Dynamic Global Illumination
Wolfgang Engel
 
Dissecting the Rendering of The Surge
Philip Hammer
 
Optimizing the Graphics Pipeline with Compute, GDC 2016
Graham Wihlidal
 
Frostbite Rendering Architecture and Real-time Procedural Shading & Texturing...
repii
 
Crysis Next-Gen Effects (GDC 2008)
Tiago Sousa
 
Lighting Shading by John Hable
Naughty Dog
 
Physically Based and Unified Volumetric Rendering in Frostbite
Electronic Arts / DICE
 
Visibility Optimization for Games
Umbra
 
Hable John Uncharted2 Hdr Lighting
ozlael ozlael
 
High dynamic range
changehee lee
 
A Certain Slant of Light - Past, Present and Future Challenges of Global Illu...
Electronic Arts / DICE
 
DD18 - SEED - Raytracing in Hybrid Real-Time Rendering
Electronic Arts / DICE
 
FrameGraph: Extensible Rendering Architecture in Frostbite
Electronic Arts / DICE
 
Ad

Similar to Practical Occlusion Culling in Killzone 3 (20)

PPT
Visibility Optimization for Games
Sampo Lappalainen
 
PDF
Software Engineering Advice from Google's Jeff Dean for Big, Distributed Systems
adrianionel
 
ODP
Garbage Collection in Hotspot JVM
jaganmohanreddyk
 
PDF
The Diabolical Developers Guide to Performance Tuning
jClarity
 
PPTX
SnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14th
SnappyData
 
PPT
Data Applications and Infrastructure at LinkedIn__HadoopSummit2010
Yahoo Developer Network
 
PPTX
Explore big data at speed of thought with Spark 2.0 and Snappydata
Data Con LA
 
PDF
MRT 2018: reflecting on the past and the present with temporal graph models
Antonio García-Domínguez
 
PPTX
Deep learning to the rescue - solving long standing problems of recommender ...
Balázs Hidasi
 
PDF
The Art Of Performance Tuning - with presenter notes!
Jonathan Ross
 
PDF
Distributed applications using Hazelcast
Taras Matyashovsky
 
PDF
The Art Of Performance Tuning
Jonathan Ross
 
PPTX
IMAGE PROCESSING
ABHISHEK MAURYA
 
PDF
Performance Tuning - Memory leaks, Thread deadlocks, JDK tools
Haribabu Nandyal Padmanaban
 
PDF
10 Ways To Improve Your Code
ConSanFrancisco123
 
PDF
Structured Forests for Fast Edge Detection [Paper Presentation]
Mohammad Shaker
 
PPT
Infrastructure as Code to Maintain your Sanity
Dewey Sasser
 
PPTX
Dock ir incident response in a containerized, immutable, continually deploy...
Shakacon
 
PDF
Introduction occlusion
VisCircle
 
PPTX
Parallel run selenium tests in a good way
COMAQA.BY
 
Visibility Optimization for Games
Sampo Lappalainen
 
Software Engineering Advice from Google's Jeff Dean for Big, Distributed Systems
adrianionel
 
Garbage Collection in Hotspot JVM
jaganmohanreddyk
 
The Diabolical Developers Guide to Performance Tuning
jClarity
 
SnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14th
SnappyData
 
Data Applications and Infrastructure at LinkedIn__HadoopSummit2010
Yahoo Developer Network
 
Explore big data at speed of thought with Spark 2.0 and Snappydata
Data Con LA
 
MRT 2018: reflecting on the past and the present with temporal graph models
Antonio García-Domínguez
 
Deep learning to the rescue - solving long standing problems of recommender ...
Balázs Hidasi
 
The Art Of Performance Tuning - with presenter notes!
Jonathan Ross
 
Distributed applications using Hazelcast
Taras Matyashovsky
 
The Art Of Performance Tuning
Jonathan Ross
 
IMAGE PROCESSING
ABHISHEK MAURYA
 
Performance Tuning - Memory leaks, Thread deadlocks, JDK tools
Haribabu Nandyal Padmanaban
 
10 Ways To Improve Your Code
ConSanFrancisco123
 
Structured Forests for Fast Edge Detection [Paper Presentation]
Mohammad Shaker
 
Infrastructure as Code to Maintain your Sanity
Dewey Sasser
 
Dock ir incident response in a containerized, immutable, continually deploy...
Shakacon
 
Introduction occlusion
VisCircle
 
Parallel run selenium tests in a good way
COMAQA.BY
 
Ad

More from Guerrilla (20)

PPTX
Horizon Zero Dawn: An Open World QA Case Study
Guerrilla
 
PPTX
Horizon Zero Dawn: A Game Design Post-Mortem
Guerrilla
 
PPTX
Putting the AI Back Into Air: Navigating the Air Space of Horizon Zero Dawn
Guerrilla
 
PPTX
Decima Engine: Visibility in Horizon Zero Dawn
Guerrilla
 
PDF
Building Non-Linear Narratives in Horizon Zero Dawn
Guerrilla
 
PPTX
Player Traversal Mechanics in the Vast World of Horizon Zero Dawn
Guerrilla
 
PDF
The Real-time Volumetric Cloudscapes of Horizon Zero Dawn
Guerrilla
 
PDF
The Production and Visual FX of Killzone Shadow Fall
Guerrilla
 
PPTX
Out of Sight, Out of Mind: Improving Visualization of AI Info
Guerrilla
 
PDF
The Next-Gen Dynamic Sound System of Killzone Shadow Fall
Guerrilla
 
PDF
Killzone Shadow Fall: Creating Art Tools For A New Generation Of Games
Guerrilla
 
PDF
Taking Killzone Shadow Fall Image Quality Into The Next Generation
Guerrilla
 
PDF
Lighting of Killzone: Shadow Fall
Guerrilla
 
PDF
A Hierarchically-Layered Multiplayer Bot System for a First-Person Shooter
Guerrilla
 
PPTX
Practical Occlusion Culling on PS3
Guerrilla
 
PPT
Release This! Tools for a Smooth Release Cycle
Guerrilla
 
PPT
Killzone 2 Multiplayer Bots
Guerrilla
 
PPTX
Automatic Annotations in Killzone 3 and Beyond
Guerrilla
 
PDF
The Creation of Killzone 3
Guerrilla
 
PDF
The PlayStation®3’s SPUs in the Real World: A KILLZONE 2 Case Study
Guerrilla
 
Horizon Zero Dawn: An Open World QA Case Study
Guerrilla
 
Horizon Zero Dawn: A Game Design Post-Mortem
Guerrilla
 
Putting the AI Back Into Air: Navigating the Air Space of Horizon Zero Dawn
Guerrilla
 
Decima Engine: Visibility in Horizon Zero Dawn
Guerrilla
 
Building Non-Linear Narratives in Horizon Zero Dawn
Guerrilla
 
Player Traversal Mechanics in the Vast World of Horizon Zero Dawn
Guerrilla
 
The Real-time Volumetric Cloudscapes of Horizon Zero Dawn
Guerrilla
 
The Production and Visual FX of Killzone Shadow Fall
Guerrilla
 
Out of Sight, Out of Mind: Improving Visualization of AI Info
Guerrilla
 
The Next-Gen Dynamic Sound System of Killzone Shadow Fall
Guerrilla
 
Killzone Shadow Fall: Creating Art Tools For A New Generation Of Games
Guerrilla
 
Taking Killzone Shadow Fall Image Quality Into The Next Generation
Guerrilla
 
Lighting of Killzone: Shadow Fall
Guerrilla
 
A Hierarchically-Layered Multiplayer Bot System for a First-Person Shooter
Guerrilla
 
Practical Occlusion Culling on PS3
Guerrilla
 
Release This! Tools for a Smooth Release Cycle
Guerrilla
 
Killzone 2 Multiplayer Bots
Guerrilla
 
Automatic Annotations in Killzone 3 and Beyond
Guerrilla
 
The Creation of Killzone 3
Guerrilla
 
The PlayStation®3’s SPUs in the Real World: A KILLZONE 2 Case Study
Guerrilla
 

Recently uploaded (20)

PPTX
attack on titan anime designAttack on Titan premiered as an anime adaptation ...
timesidiomasaulas
 
PPTX
Theatre of the Absurd: Understanding the Philosophy Behind Absurdist Drama
maxmag791
 
PPTX
Bill and Lalu prasad yadav jokes to laugh.pptx
PRAKASHBHATTARAI32
 
PDF
-----Rk Narayan's THE GUIDE.ppt.pdf-----
jayparmar101626
 
PPTX
90's Kallinvhvhvugguygubibiytggjrbkg.pptx
bhaswatideka11
 
PPTX
购买英国毕业证|补办贝尔法斯特女王大学毕业证|补办QUB文凭国外学位认证
mookxk3
 
PPTX
TDXFYIYILFXDGFFGUYIFXGFXGFHYUYIOFHXFHFYIOIUYYFDHFGUYILPPT.pptx
dilludcruz
 
DOCX
Jumping Jacks_ The Timeless Powerhouse of Fitness.docx
Custom Printing Boxes
 
PPTX
CODE REDPOIUYTRESDAZDXFGHJKLIUYTRDF.pptx
dilludcruz
 
PPTX
PPP;O'P;IYHFDSDFGYUIKHTLIUYHGDFGHJILUYGT.pptx
dilludcruz
 
PPTX
wavegamhjhvhjghjfyufffghfghing_finaldeck.pptx
sampleacc835
 
PDF
LectureNotes1PPTv5.pdfLectureNotes1PPTv5.pdf
ZarinTatak
 
PDF
Regarding honorarium for the year 2025-26 human resources(XV-FC) approved und...
khankhan307705
 
PPTX
Superman Template Design “Superman,” DC Studios’ first feature film to hit th...
timesidiomasaulas
 
PPTX
EXP 401- On farm Advisory for Soil Health, Water Quality and Plant Nutrition....
AbedhSabu
 
PDF
Keep It Short: India's Talent Launchpad for Filmmakers
Cinystore Technologies
 
DOCX
The Unseen Guardian_ Why a Sports Safety Helmet is Essential for Every Child'...
Custom Printing Boxes
 
PDF
Wondla_204_55_version11_Editedforfolio_.pdf
Giovanny Cardenas
 
PPTX
633930017-FINAL-The-Caterbury-Tales-Slides.pptx
dermiconmendesmatos
 
PDF
Breaking into Television Writing: A Modern Guide for Aspiring Creators
All Writers Destination
 
attack on titan anime designAttack on Titan premiered as an anime adaptation ...
timesidiomasaulas
 
Theatre of the Absurd: Understanding the Philosophy Behind Absurdist Drama
maxmag791
 
Bill and Lalu prasad yadav jokes to laugh.pptx
PRAKASHBHATTARAI32
 
-----Rk Narayan's THE GUIDE.ppt.pdf-----
jayparmar101626
 
90's Kallinvhvhvugguygubibiytggjrbkg.pptx
bhaswatideka11
 
购买英国毕业证|补办贝尔法斯特女王大学毕业证|补办QUB文凭国外学位认证
mookxk3
 
TDXFYIYILFXDGFFGUYIFXGFXGFHYUYIOFHXFHFYIOIUYYFDHFGUYILPPT.pptx
dilludcruz
 
Jumping Jacks_ The Timeless Powerhouse of Fitness.docx
Custom Printing Boxes
 
CODE REDPOIUYTRESDAZDXFGHJKLIUYTRDF.pptx
dilludcruz
 
PPP;O'P;IYHFDSDFGYUIKHTLIUYHGDFGHJILUYGT.pptx
dilludcruz
 
wavegamhjhvhjghjfyufffghfghing_finaldeck.pptx
sampleacc835
 
LectureNotes1PPTv5.pdfLectureNotes1PPTv5.pdf
ZarinTatak
 
Regarding honorarium for the year 2025-26 human resources(XV-FC) approved und...
khankhan307705
 
Superman Template Design “Superman,” DC Studios’ first feature film to hit th...
timesidiomasaulas
 
EXP 401- On farm Advisory for Soil Health, Water Quality and Plant Nutrition....
AbedhSabu
 
Keep It Short: India's Talent Launchpad for Filmmakers
Cinystore Technologies
 
The Unseen Guardian_ Why a Sports Safety Helmet is Essential for Every Child'...
Custom Printing Boxes
 
Wondla_204_55_version11_Editedforfolio_.pdf
Giovanny Cardenas
 
633930017-FINAL-The-Caterbury-Tales-Slides.pptx
dermiconmendesmatos
 
Breaking into Television Writing: A Modern Guide for Aspiring Creators
All Writers Destination
 

Practical Occlusion Culling in Killzone 3

  • 2. Practical Occlusion Culling in Killzone 3 Michal Valient Lead Tech, Guerrilla B.V.
  • 3. Talk takeaway Occlusion culling system used in Killzone 3 The reasons why to use software rasterization (Some) technical details How to pick good occluders Q&A
  • 4. Talk takeaway Occlusion culling system used in Killzone 3 You’ll love software rasterization (Some) technical details How to pick good occluders Q&A
  • 5. Talk takeaway Occlusion culling system used in Killzone 3 You’ll love software rasterization (Some) technical details How to pick good occluders Q&A Huge environments
  • 6. Talk takeaway Occlusion culling system used in Killzone 3 You’ll love software rasterization (Some) technical details How to pick good occluders Q&A Armed JetPacks
  • 7. Talk takeaway Occlusion culling system used in Killzone 3 You’ll love software rasterization (Some) technical details How to pick good occluders Q&A Giant Spider Robots
  • 8. Killzone 3 visibility solution Software rasterization running on SPUs Render occluders into depth buffer –Use simplified version of the scene geometry Conservatively scale down –To make it fit into SPU memory –To make it faster to test against Test all objects against small depth buffer –Test bounding boxes
  • 9. Why software rasterization Previous solution did not scale well –Manually placed portals
  • 10. Why software rasterization Previous solution did not scale well –Manually placed portals Works automatically –Can be enabled early in production
  • 11. Why software rasterization Previous solution did not scale well –Manually placed portals Works automatically –Can be enabled early in production Completely dynamic solution –Any object can become an occluder
  • 12. Why software rasterization Previous solution did not scale well –Manually placed portals Works automatically –Can be enabled early in production Completely dynamic solution –Any object can become an occluder Maps well to SPUs –No sync issues –No GPU costs related to visibility testing
  • 18. Occluder setup First stage, not parallel Outputs clipped + projected triangles –One list of triangle data –One “index” DMA list per rasterizer job Caches are important at this stage –2KB vertex array cache (90% hit rate) –32-entry post-transform cache (60% hit rate) –Various double-buffered output caches
  • 19. Rasterization Split 640x360p depth buffer into 16 pixel-high strips –Rasterize in parallel, one SPU job per strip –Load triangles using the prepared DMA list Traditional scanline rasterizer –Fill internal 640x16 floating point depth buffer Vectorization is the key –Set up three or four edges at once –Generate 4x1 pixels at once –Optimize in assembly
  • 20. Rasterization Compress depth buffer –One output pixel is maximum depth of 16x16 block. –But patch single pixel holes first. –Encode as uint16, reserve 0xffffu for infinity Output single scanline of 40x23 occlusion buffer
  • 21. Occlusion tests Tests happen in parallel Each object consists of one or more parts First test object bounding box –Skip for objects visible last frame Then test individual parts Continue with submesh culling –Small Spatial Kd-Tree inside most meshes –Allows for culling arbitrarily small mesh chunks
  • 22. Occlusion tests Accurate tests –Bounding box rasterization and depth test –Working on small depth buffer, be conservative Fast bounding sphere tests –Precomputed hierarchical reject data –Constant time test for small spheres –Only used for fast reject
  • 24. Where to get occluders Aiming for automated solution Originally wanted to use scene geometry –Reduced polygon count –Too many errors, in general does not work  Now using physics mesh –Closed, low polygon meshes –Not always conservative in the right sense –Visual mesh can be inside physics mesh causing drops
  • 25. Where to get occluders
  • 26. How to select good occluders Simple heuristics to identify good occluders –Discard anything which is small –Discard by meta data –clutter, set dressing, foliage, railings… –Discard if surface area is significantly smaller than bounding box surface area Artists can override the process –Still creating the best occluders by hand
  • 29. Conclusion Software rasterization is great –Fast on SPUs –Easy to integrate –Very accurate (if occluders are accurate) Creating occluders is hard –Automatic system was not enough –Plan for this in content creation –Define workflow for finding and fixing leaks Voxelization anyone?
  • 30. Conclusion Special thanks to Will Vale (Second Intention Ltd) for implementing this system for us.
  • 31. Statistics 100 occluders, 1500 triangles Test 1000 objects, 2700 parts Timings –Setup job: 0.5ms –Rasterize job: 2.0ms (on 5 SPUs) –Query job: 4.5ms (on 5 SPUs) –Overall latency: ~2ms

Editor's Notes

  • #2: \n
  • #3: \n
  • #4: I’ll first describe the occlusion system in Killzone 3\n and the reasons why we chose it and why should you\n\nThen I’ll talk about some technical details\n and heuristics for picking good occluders.\n\nAnd I hope we’ll have some time for questions.\n\n... but let’s first take look at what Killzone 3 actually is.\n
  • #5: Killzone 3 is a first person shooter released \n earlier this year exclusively for Playstation 3.\n\nAnd it's essentially about space marines trying to escape \n from the planet full of space Nazis.\n
  • #6: The game is set in huge detailed outdoor environments…\n Where you can fly around with an armed jetpack...\n And you get to fight giant spider robot.\n Twice.\n\nKillzone 3 is a big game\n and we needed good object visibility solution \n that works well with these diverse settings.\n\n
  • #7: The game is set in huge detailed outdoor environments…\n Where you can fly around with an armed jetpack...\n And you get to fight giant spider robot.\n Twice.\n\nKillzone 3 is a big game\n and we needed good object visibility solution \n that works well with these diverse settings.\n\n
  • #8: The game is set in huge detailed outdoor environments…\n Where you can fly around with an armed jetpack...\n And you get to fight giant spider robot.\n Twice.\n\nKillzone 3 is a big game\n and we needed good object visibility solution \n that works well with these diverse settings.\n\n
  • #9: \n
  • #10: We chose to try software rasterization running completely on SPUs. \n \nWe render the simplified version of the level geometry \n into a depth buffer (these are the occluders)\n\nAnd we then use scaled down version of this depth buffer \n to test visibility of all objects or lights in the current view frustum.\n The test itself uses bounding box of the objects.\n
  • #11: I'll try to summarize the reasons why we chose this solution \n and I'm sure you recognize some of your own experiences here.\n\nFirst and foremost we saw that our solution based on portals \n does not scale well with the big outdoor levels.\n\nEspecially since our portals were hand placed by artists and we were \n running into production stalls.\n\n Very important issue if you try to create the game in two years.\n
  • #12: The process of occluder creation can be made largely \n automatic and can be enabled from the early days of production.\n\nThe whole concept of occluders is very similar to building \n of regular geometry and it's easy to understand by artists. \n\nThis makes it very easy to step in and manually create occluders where needed.\n\n
  • #13: Unlike most other approaches, this one is completely dynamic.\n\nAny sufficiently large object on screen can serve as a good occluder. \n If you're hiding behind a destructible barrel or a metal plate in our \n game, it is an occluder.\n\nDoors can open and close and they perfectly block the visibility \n without you having to write special code for such case.\n\n
  • #14: Software rasterization maps well onto SPUs - it's easy to distribute \n and SPUs are generic enough to allow us to run complicated object culling logic.\n\nAnd unlike GPU based solutions, you get exact results \n within the same frame without complicated synchronization logic.\n
  • #15: Let’s look at the example of how the occlusion system works.\n\n
  • #16: If we look from the side, you see we don’t \n render anything behind the closed door.\n\nBut as soon as the door open...\n
  • #17: ...we start to render the rest of the visible scene. \n As I mentioned earlier, this \n happens entirely automatically.\n\nAnd since I forgot to record the occlusion \n depth buffer, you’ll have to trust on this one.\n
  • #18: We implemented the system as series of SPU jobs.\n Most run in parallel to do the heavy lifting.\n\n
  • #19: The first SPU job loads visible occluder primitives and \n outputs clipped and projected triangles for rasterization. \n\nA single list of triangles is shared between the rasterizer jobs, and each job\n has its own DMA list pointing to the subset of triangles it needs to draw.\n\nThe occluder primitives are identical to visual meshes, with vertex and index arrays.\n Therefore we introduced several caches to \n reduce bandwidth and vertex transformation costs. \n This setup is very similar to what you find in a GPU.\n
  • #20: We split our depth buffer into 16-pixel-high strips and\n run one rasterize job per strip in parallel.\n\nEach rasterize job loads the list of triangles that intersect with its\n strip using the DMA list prepared by the setup job.\n\nWe perform standard scanline rasterization \n into an internal floating point buffer.\n \nThe rasterize jobs are compute-bound, and the code is \n extensively vectorized to improve throughput.\n Inner loops are written in SPA assembler.\n\n
  • #21: For output, we conservatively compress the depth buffer.\n Each output pixel is the maximum depth value of a 16x16 pixel tile in the depth buffer.\n\nBefore compression, we patch single pixel holes to avoid leaks \n when the occluders are not water-tight.\n This is a bit of a cheat, but it’s necessary otherwise\n such hole pushes the tile way into the background.\n\nThe last step encodes depth into 16 bits, \n Occluder frustum is shorter than visual frustum so we \n reserve one bit for points behind occluder far plane.\n\n
  • #22: The last step is the actual visibility testing.\n We have one job that gathers all objects in the camera frustum\n and then spawns an occlusion test job for each batch of objects.\n\nWe have a two level hierarchy of objects and meshes\n objects live in the scene, and meshes are what we send to the GPU.\n We test objects first to avoid testing meshes.\n\nIf a mesh has many triangles, we can continue with submesh culling.\n This uses the mesh’s Kd-tree to cull away whole ranges of mesh triangles.\n Visible meshes form new primitive sent to GPU.\n\nThe output of these jobs is the final result of the occlusion query, and is sent for rendering.\n\n
  • #23: We have two kinds of visibility test we can use to cull objects and parts.\n\nThe basic test is the most accurate, but also the most expensive.\n It rasterizes a bounding box with depth testing against the small occlusion buffer.\n\nWe also have constant-time tests for small objects.\n We precompute several versions of the occlusion buffer by \n conservatively dilating the depth values.\n This allows us to perform very quick bounding sphere tests.\n\nIf an object passes the fast test, or if it is too large, we do the accurate test.\n\n
  • #24: In the final part of the presentation I’d like \n to explain how we create occluders.\n\n
  • #25: We didn’t want artists to hand-make occluders \n for the entire level, so we looked for an automatic solution.\n\nWe experimented with using visual meshes,\n but the good polygon reduction proved \n to be difficult to get right.\n\nPhysics mesh is much better choice.\n It’s available for each mesh in the game\n and it’s cleaned and sufficiently low polygon count.\n\nUnfortunately the physics meshes can be slightly larger \n than visual meshes causing objects to disappear.\n Worst offenders have to be fixed manually.\n
  • #26: Here’s an example of occluders \n generated automatically from physics mesh.\n You can notice there’s some unnecessary detail, \n but in general the quality is pretty good.\n
  • #27: Even using physics mesh there was too much occluder geometry.\n\nWe needed to reject occluders that were unlikely to \n contribute much to the occlusion buffer.\n Our simple heuristics rejects all small objects or \n objects where the name suggests that they are not good occluders.\n\nWe also reject meshes whose surface area suggests\n that they are thin or with too many holes.\n\nArtists can of course step in and override the heuristics\n or provide their custom occluders for difficult cases and optimization.\n
  • #28: Here’s an example of KZ3 multiplayer level where the \n automatic heuristics did not work well.\n
  • #29: And here’s the highly optimized occluder mesh created by artist.\n
  • #30: We like this system, it’s simple, efficient and fits well with our pipeline.\n\nUnfortunately the content creation proved to be a problem.\n The automated solution did not work well enough in some cases\n and artists had to create custom occluders for entire levels.\n Luckily the geometry is easy to create.\n\nUsing scene voxelization might be a good way \n to generate simple, robust occluders automatically.\n
  • #31: I’d like to thank Will Vale for implementing the \n system for us and help with this presentation.\n
  • #32: Questions?\n