More Web Proxy on the site http://driver.im/

research-article

Architecture considerations for tracing incoherent rays

Authors:

Tero KarrasAuthors Info & Claims

HPG '10: Proceedings of the Conference on High Performance Graphics

Pages 113 - 122

Published: 25 June 2010 Publication History

Abstract

This paper proposes a massively parallel hardware architecture for efficient tracing of incoherent rays, e.g. for global illumination. The general approach is centered around hierarchical treelet subdivision of the acceleration structure and repeated queueing/postponing of rays to reduce cache pressure. We describe a heuristic algorithm for determining the treelet subdivision, and show that our architecture can reduce the total memory bandwidth requirements by up to 90% in difficult scenes. Furthermore the architecture allows submitting rays in an arbitrary order with practically no performance penalty. We also conclude that scheduling algorithms can have an important effect on results, and that using fixed-size queues is not an appealing design choice. Increased auxiliary traffic, including traversal stacks, is identified as the foremost remaining challenge of this architecture.

References

[1]

{AL09} Aila T., Laine S.: Understanding the efficiency of ray traversal on GPUs. In Proc. High-Performance Graphics 2009 (2009), pp. 145--149.

Digital Library

[2]

{DHK08} Dammertz H., Hanika J., Keller A.: Shallow bounding volume hierarchies for fast SIMD ray tracing of incoherent rays. Comp. Graph. Forum 27, 4 (2008), 1225--1234.

Digital Library

[3]

{Hal60} Halton J.: On the efficiency of certain quasi-random sequences of points in evaluating multi-dimensional integrals. Numerische Mathematik 2, 1 (1960), 84--90.

[4]

{HSHH07} Horn D. R., Sugerman J., Houston M., Hanrahan P.: Interactive k-d tree GPU raytracing. In Proc. Symposium on Interactive 3D graphics and games (2007), ACM, pp. 167--174.

Digital Library

[5]

{Kar07} Karrenberg R.: Memory aware realtime ray tracing: The bounding plane hierarchy. Bachelor thesis, Saarland University, 2007.

[6]

{KMKY09} Kim T.-J., Moon B., Kim D., Yoon S.-E.: RACBVHs: random-accessible compressed bounding volume hierarchies. In ACM SIGGRAPH '09: Posters (2009), pp. 1--1.

Digital Library

[7]

{Kra07} Krashinsky R. M.: Vector-Thread Architecture And Implementation. PhD thesis, MIT, 2007.

Digital Library

[8]

{Lal09} Lalonde P.: Innovating in a software graphics pipeline. ACM SIGGRAPH 2009 course: Beyond programmable shading, 2009.

[9]

{LYM07} Lauterbach C., Yoon S.-E., Manocha D.: Raystrips: A compact mesh representation for interactive ray tracing. In Proc. IEEE Symposium on Interactive Ray Tracing 2007 (2007), pp. 19--26.

Digital Library

[10]

{NFLM07} Navratil P. A., Fussell D. S., Lin C., Mark W. R.: Dynamic ray scheduling to improve ray coherence and bandwidth utilization. In Proc. IEEE Symposium on Interactive Ray Tracing 2007 (2007), pp. 95--104.

Digital Library

[11]

{NVI08} NVIDIA: NVIDIA CUDA Programming Guide Version 2.1. 2008.

[12]

{NVI10} NVIDIA: Nvidia's next generation CUDA compute architecture: Fermi. Whitepaper, 2010.

[13]

{PKGH97} Pharr M., Kolb C., Gershbein R., Hanrahan P.: Rendering complex scenes with memory-coherent ray tracing. In Proc. ACM SIGGRAPH 97 (1997), pp. 101--108.

Digital Library

[14]

{RGD09} Ramani K., Gribble C. P., Davis A.: Streamray: a stream filtering architecture for coherent ray tracing. In ASPLOS '09: Proc. 14th International Conference on Architectural Support for Programming Languages and Operating Systems (2009), ACM, pp. 325--336.

Digital Library

[15]

{SFB*09} Sugerman J., Fatahalian K., Boulos S., Akeley K., Hanrahan P.: Gramps: A programming model for graphics pipelines. ACM Trans. Graph. 28, 1 (2009), 1--11.

Digital Library

[16]

{Smi98} Smits B.: Efficiency issues for ray tracing. J. Graph. Tools 3, 2 (1998), 1--14.

Digital Library

[17]

{SWS02} Schmittler J., Wald I., Slusallek P.: Saarcor: a hardware architecture for ray tracing. In Proc. ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware (2002), pp. 27--36.

Digital Library

[18]

{SWW*04} Schmittler J., Woop S., Wagner D., Paul W. J., Slusallek P.: Realtime ray tracing of dynamic scenes on an FPGA chip. In Proc. ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware (2004), pp. 95--106.

Digital Library

[19]

{WBB08} Wald I., Benthin C., Boulos S.: Getting rid of packets: Efficient SIMD single-ray traversal using multi-branching BVHs. In Proc. IEEE/Eurographics Symposium on Interactive Ray Tracing 2008 (2008), pp. 49--57.

[20]

{WK06} Wächter C., Keller A.: Instant ray tracing: The bounding interval hierarchy. In Proc. Eurographics Symposium on Rendering 2006 (2006), pp. 139--149.

Digital Library

[21]

{WSS05} Woop S., Schmittler J., Slusallek P.: RPU: a programmable ray processing unit for realtime ray tracing. ACM Trans. Graph. 24, 4 (2005), 434--444.

Digital Library

Cited By

Feng YLin WLiu ZLeng JGuo MZhao HHou XZhao JZhu Y(2024)Potamoi: Accelerating Neural Rendering via a Unified Streaming ArchitectureACM Transactions on Architecture and Code Optimization10.1145/368934021:4(1-25)Online publication date: 20-Nov-2024
https://dl.acm.org/doi/10.1145/3689340
Fouladi SShacklett BPoms FArora AOzdemir ARaghavan DHanrahan PFatahalian KWinstein K(2022)R2E2ACM Transactions on Graphics10.1145/3528223.353017141:4(1-12)Online publication date: 22-Jul-2022
https://dl.acm.org/doi/10.1145/3528223.3530171
Zhu YLee JAgrawal KSpear M(2022)RTNNProceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming10.1145/3503221.3508409(76-89)Online publication date: 2-Apr-2022
https://dl.acm.org/doi/10.1145/3503221.3508409
Show More Cited By

Index Terms

Architecture considerations for tracing incoherent rays
1. Computing methodologies
  1. Computer graphics
    1. Graphics systems and interfaces
      1. Graphics processors
    2. Rendering
      1. Ray tracing

Recommendations

Faster incoherent rays: Multi-BVH ray stream tracing
HPG '09: Proceedings of the Conference on High Performance Graphics 2009

High fidelity rendering via ray tracing requires tracing incoherent rays for global illumination and other secondary effects. Recent research show that the performance benefits from fast packet traversal schemes that exploit high coherence are lost when ...
Exploiting coherence of shadow rays
AFRIGRAPH '01: Proceedings of the 1st international conference on Computer graphics, virtual reality and visualisation

We present independent method to reduce the number of shadow ray tests. It can be used with standard acceleration ray tracing algorithms. Our method is conservative and produces the same results. We test just one shadow ray in modified scene insted of ...
Distributed ray tracing

Ray tracing is one of the most elegant techniques in computer graphics. Many phenomena that are difficult or impossible with other techniques are simple with ray tracing, including shadows, reflections, and refracted light. Ray directions, however, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

HPG '10: Proceedings of the Conference on High Performance Graphics

June 2010

189 pages

General Chairs:
Justin Hensley
AMD
,
Philipp Slusallek
DFKI & Saarland University, Germany
,
Program Chairs:
David McAllister
NVIDIA
,
Christiaan Gribble
Grove City College

Sponsors

SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
EUROGRAPHICS: The European Association for Computer Graphics

Publisher

Eurographics Association

Goslar, Germany

Publication History

Published: 25 June 2010

Check for updates

Qualifiers

Research-article

Conference

HPG'10

Sponsor:

SIGGRAPH
EUROGRAPHICS

HPG'10: High Performance Graphics

June 25 - 27, 2010

Saarbrucken, Germany

Acceptance Rates

Overall Acceptance Rate 15 of 44 submissions, 34%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

40
Total Citations
View Citations
333
Total Downloads

Downloads (Last 12 months)18
Downloads (Last 6 weeks)1

Reflects downloads up to 11 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Feng YLin WLiu ZLeng JGuo MZhao HHou XZhao JZhu Y(2024)Potamoi: Accelerating Neural Rendering via a Unified Streaming ArchitectureACM Transactions on Architecture and Code Optimization10.1145/368934021:4(1-25)Online publication date: 20-Nov-2024
https://dl.acm.org/doi/10.1145/3689340
Fouladi SShacklett BPoms FArora AOzdemir ARaghavan DHanrahan PFatahalian KWinstein K(2022)R2E2ACM Transactions on Graphics10.1145/3528223.353017141:4(1-12)Online publication date: 22-Jul-2022
https://dl.acm.org/doi/10.1145/3528223.3530171
Zhu YLee JAgrawal KSpear M(2022)RTNNProceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming10.1145/3503221.3508409(76-89)Online publication date: 2-Apr-2022
https://dl.acm.org/doi/10.1145/3503221.3508409
Jaroš MŘíha LStrakoš PŠpeťko M(2021)GPU Accelerated Path Tracing of Massive ScenesACM Transactions on Graphics10.1145/344780740:2(1-17)Online publication date: 27-Apr-2021
https://dl.acm.org/doi/10.1145/3447807
Vaidyanathan KWoop SBenthin C(2019)Wide BVH traversal with a short stackProceedings of the Conference on High-Performance Graphics10.2312/hpg.20191190(15-19)Online publication date: 8-Jul-2019
https://dl.acm.org/doi/10.2312/hpg.20191190
Lin DShkurko KMallett IYuksel CAndrews STatarchuk N(2019)Dual-split treesProceedings of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games10.1145/3306131.3317028(1-9)Online publication date: 21-May-2019
https://dl.acm.org/doi/10.1145/3306131.3317028
Koskela MImmonen KMäkitalo MFoi AViitanen TJääskeläinen PKultala HTakala J(2019)Blockwise Multi-Order Feature Regression for Real-Time Path-Tracing ReconstructionACM Transactions on Graphics10.1145/326997838:5(1-14)Online publication date: 17-Jun-2019
https://dl.acm.org/doi/10.1145/3269978
Lier AStamminger MSelgrad KMolnar S(2018)CPU-style SIMD ray traversal on GPUsProceedings of the Conference on High-Performance Graphics10.1145/3231578.3231583(1-4)Online publication date: 10-Aug-2018
https://dl.acm.org/doi/10.1145/3231578.3231583
Chitalu FDubach CKomura TMcGuire MNowrouzezahari D(2018)Bulk-synchronous parallel simultaneous BVH traversal for collision detection on GPUsProceedings of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games10.1145/3190834.3190848(1-9)Online publication date: 15-May-2018
https://dl.acm.org/doi/10.1145/3190834.3190848
Vasiou EShkurko KMallett IBrunvand EYuksel C(2018)A detailed study of ray tracing performanceThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-018-1532-834:6-8(875-885)Online publication date: 1-Jun-2018
https://dl.acm.org/doi/10.1007/s00371-018-1532-8
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents