[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1921479.1921497acmconferencesArticle/Chapter ViewAbstractPublication PageshpgConference Proceedingsconference-collections
research-article

Architecture considerations for tracing incoherent rays

Published: 25 June 2010 Publication History

Abstract

This paper proposes a massively parallel hardware architecture for efficient tracing of incoherent rays, e.g. for global illumination. The general approach is centered around hierarchical treelet subdivision of the acceleration structure and repeated queueing/postponing of rays to reduce cache pressure. We describe a heuristic algorithm for determining the treelet subdivision, and show that our architecture can reduce the total memory bandwidth requirements by up to 90% in difficult scenes. Furthermore the architecture allows submitting rays in an arbitrary order with practically no performance penalty. We also conclude that scheduling algorithms can have an important effect on results, and that using fixed-size queues is not an appealing design choice. Increased auxiliary traffic, including traversal stacks, is identified as the foremost remaining challenge of this architecture.

References

[1]
{AL09} Aila T., Laine S.: Understanding the efficiency of ray traversal on GPUs. In Proc. High-Performance Graphics 2009 (2009), pp. 145--149.
[2]
{DHK08} Dammertz H., Hanika J., Keller A.: Shallow bounding volume hierarchies for fast SIMD ray tracing of incoherent rays. Comp. Graph. Forum 27, 4 (2008), 1225--1234.
[3]
{Hal60} Halton J.: On the efficiency of certain quasi-random sequences of points in evaluating multi-dimensional integrals. Numerische Mathematik 2, 1 (1960), 84--90.
[4]
{HSHH07} Horn D. R., Sugerman J., Houston M., Hanrahan P.: Interactive k-d tree GPU raytracing. In Proc. Symposium on Interactive 3D graphics and games (2007), ACM, pp. 167--174.
[5]
{Kar07} Karrenberg R.: Memory aware realtime ray tracing: The bounding plane hierarchy. Bachelor thesis, Saarland University, 2007.
[6]
{KMKY09} Kim T.-J., Moon B., Kim D., Yoon S.-E.: RACBVHs: random-accessible compressed bounding volume hierarchies. In ACM SIGGRAPH '09: Posters (2009), pp. 1--1.
[7]
{Kra07} Krashinsky R. M.: Vector-Thread Architecture And Implementation. PhD thesis, MIT, 2007.
[8]
{Lal09} Lalonde P.: Innovating in a software graphics pipeline. ACM SIGGRAPH 2009 course: Beyond programmable shading, 2009.
[9]
{LYM07} Lauterbach C., Yoon S.-E., Manocha D.: Raystrips: A compact mesh representation for interactive ray tracing. In Proc. IEEE Symposium on Interactive Ray Tracing 2007 (2007), pp. 19--26.
[10]
{NFLM07} Navratil P. A., Fussell D. S., Lin C., Mark W. R.: Dynamic ray scheduling to improve ray coherence and bandwidth utilization. In Proc. IEEE Symposium on Interactive Ray Tracing 2007 (2007), pp. 95--104.
[11]
{NVI08} NVIDIA: NVIDIA CUDA Programming Guide Version 2.1. 2008.
[12]
{NVI10} NVIDIA: Nvidia's next generation CUDA compute architecture: Fermi. Whitepaper, 2010.
[13]
{PKGH97} Pharr M., Kolb C., Gershbein R., Hanrahan P.: Rendering complex scenes with memory-coherent ray tracing. In Proc. ACM SIGGRAPH 97 (1997), pp. 101--108.
[14]
{RGD09} Ramani K., Gribble C. P., Davis A.: Streamray: a stream filtering architecture for coherent ray tracing. In ASPLOS '09: Proc. 14th International Conference on Architectural Support for Programming Languages and Operating Systems (2009), ACM, pp. 325--336.
[15]
{SFB*09} Sugerman J., Fatahalian K., Boulos S., Akeley K., Hanrahan P.: Gramps: A programming model for graphics pipelines. ACM Trans. Graph. 28, 1 (2009), 1--11.
[16]
{Smi98} Smits B.: Efficiency issues for ray tracing. J. Graph. Tools 3, 2 (1998), 1--14.
[17]
{SWS02} Schmittler J., Wald I., Slusallek P.: Saarcor: a hardware architecture for ray tracing. In Proc. ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware (2002), pp. 27--36.
[18]
{SWW*04} Schmittler J., Woop S., Wagner D., Paul W. J., Slusallek P.: Realtime ray tracing of dynamic scenes on an FPGA chip. In Proc. ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware (2004), pp. 95--106.
[19]
{WBB08} Wald I., Benthin C., Boulos S.: Getting rid of packets: Efficient SIMD single-ray traversal using multi-branching BVHs. In Proc. IEEE/Eurographics Symposium on Interactive Ray Tracing 2008 (2008), pp. 49--57.
[20]
{WK06} Wächter C., Keller A.: Instant ray tracing: The bounding interval hierarchy. In Proc. Eurographics Symposium on Rendering 2006 (2006), pp. 139--149.
[21]
{WSS05} Woop S., Schmittler J., Slusallek P.: RPU: a programmable ray processing unit for realtime ray tracing. ACM Trans. Graph. 24, 4 (2005), 434--444.

Cited By

View all
  • (2024)Potamoi: Accelerating Neural Rendering via a Unified Streaming ArchitectureACM Transactions on Architecture and Code Optimization10.1145/368934021:4(1-25)Online publication date: 20-Nov-2024
  • (2022)R2E2ACM Transactions on Graphics10.1145/3528223.353017141:4(1-12)Online publication date: 22-Jul-2022
  • (2022)RTNNProceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming10.1145/3503221.3508409(76-89)Online publication date: 2-Apr-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
HPG '10: Proceedings of the Conference on High Performance Graphics
June 2010
189 pages

Sponsors

Publisher

Eurographics Association

Goslar, Germany

Publication History

Published: 25 June 2010

Check for updates

Qualifiers

  • Research-article

Conference

HPG'10
Sponsor:
HPG'10: High Performance Graphics
June 25 - 27, 2010
Saarbrucken, Germany

Acceptance Rates

Overall Acceptance Rate 15 of 44 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)1
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Potamoi: Accelerating Neural Rendering via a Unified Streaming ArchitectureACM Transactions on Architecture and Code Optimization10.1145/368934021:4(1-25)Online publication date: 20-Nov-2024
  • (2022)R2E2ACM Transactions on Graphics10.1145/3528223.353017141:4(1-12)Online publication date: 22-Jul-2022
  • (2022)RTNNProceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming10.1145/3503221.3508409(76-89)Online publication date: 2-Apr-2022
  • (2021)GPU Accelerated Path Tracing of Massive ScenesACM Transactions on Graphics10.1145/344780740:2(1-17)Online publication date: 27-Apr-2021
  • (2019)Wide BVH traversal with a short stackProceedings of the Conference on High-Performance Graphics10.2312/hpg.20191190(15-19)Online publication date: 8-Jul-2019
  • (2019)Dual-split treesProceedings of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games10.1145/3306131.3317028(1-9)Online publication date: 21-May-2019
  • (2019)Blockwise Multi-Order Feature Regression for Real-Time Path-Tracing ReconstructionACM Transactions on Graphics10.1145/326997838:5(1-14)Online publication date: 17-Jun-2019
  • (2018)CPU-style SIMD ray traversal on GPUsProceedings of the Conference on High-Performance Graphics10.1145/3231578.3231583(1-4)Online publication date: 10-Aug-2018
  • (2018)Bulk-synchronous parallel simultaneous BVH traversal for collision detection on GPUsProceedings of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games10.1145/3190834.3190848(1-9)Online publication date: 15-May-2018
  • (2018)A detailed study of ray tracing performanceThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-018-1532-834:6-8(875-885)Online publication date: 1-Jun-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media