[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/FCCM.2006.57guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Pipelined Mixed Precision Algorithms on FPGAs for Fast and Accurate PDE Solvers from Low Precision Components

Published: 24 April 2006 Publication History

Abstract

FPGAs are becoming more and more attractive for high precision scientific computations. One of the main problems in efficient resource utilization is the quadratically growing resource usage of multipliers depending on the operand size. Many research efforts have been devoted to the optimization of individual arithmetic and linear algebra operations. In this paper we take a higher level approach and seek to reduce the intermediate computational precision on the algorithmic level by optimizing the accuracy towards the final result of an algorithm. In our case this is the accurate solution of partial differential equations (PDEs). Using the Poisson Problem as a typical PDE example we show that most intermediate operations can be computed with floats or even smaller formats and only very few operations (e.g. 1%) must be performed in double precision to obtain the same accuracy as a full double precision solver. Thus the FPGA can be configured with many parallel float rather than few resource hungry double operations. To achieve this, we adapt the general concept of mixed precision iterative refinement methods to FPGAs and develop a fully pipelined version of the Conjugate Gradient solver. We combine this solver with different iterative refinement schemes and precision combinations to obtain resource efficient mappings of the pipelined algorithm core onto the FPGA.

Cited By

View all
  • (2016)Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing UnitsACM Transactions on Mathematical Software10.1145/290794443:2(1-27)Online publication date: 16-Aug-2016
  • (2016)A stochastic performance model for pipelined Krylov methodsConcurrency and Computation: Practice & Experience10.1002/cpe.382028:18(4532-4542)Online publication date: 25-Dec-2016
  • (2014)Series Expansion based Efficient Architectures for Double Precision Floating Point DivisionCircuits, Systems, and Signal Processing10.1007/s00034-014-9811-833:11(3499-3526)Online publication date: 1-Nov-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
FCCM '06: Proceedings of the 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
April 2006
351 pages
ISBN:0769526616

Publisher

IEEE Computer Society

United States

Publication History

Published: 24 April 2006

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing UnitsACM Transactions on Mathematical Software10.1145/290794443:2(1-27)Online publication date: 16-Aug-2016
  • (2016)A stochastic performance model for pipelined Krylov methodsConcurrency and Computation: Practice & Experience10.1002/cpe.382028:18(4532-4542)Online publication date: 25-Dec-2016
  • (2014)Series Expansion based Efficient Architectures for Double Precision Floating Point DivisionCircuits, Systems, and Signal Processing10.1007/s00034-014-9811-833:11(3499-3526)Online publication date: 1-Nov-2014
  • (2013)Automatically adapting programs for mixed-precision floating-point computationProceedings of the 27th international ACM conference on International conference on supercomputing10.1145/2464996.2465018(369-378)Online publication date: 10-Jun-2013
  • (2013)Area-efficient architectures for double precision multiplier on FPGA, with run-time-reconfigurable dual single precision supportMicroelectronics Journal10.1016/j.mejo.2013.02.02144:5(421-430)Online publication date: 1-May-2013
  • (2013)Evaluation of two formulations of the conjugate gradients method with transactional memoryProceedings of the 19th international conference on Parallel Processing10.1007/978-3-642-40047-6_52(508-520)Online publication date: 26-Aug-2013
  • (2011)High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approachProceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2063384.2063477(1-11)Online publication date: 12-Nov-2011
  • (2010)An error correction solver for linear systemsProceedings of the 9th international conference on High performance computing for computational science10.5555/1964238.1964248(58-70)Online publication date: 22-Jun-2010
  • (2010)VFloatACM Transactions on Reconfigurable Technology and Systems10.1145/1839480.18394863:3(1-34)Online publication date: 1-Sep-2010
  • (2010)A tightly coupled accelerator infrastructure for exact arithmeticsProceedings of the 23rd international conference on Architecture of Computing Systems10.1007/978-3-642-11950-7_20(222-233)Online publication date: 22-Feb-2010
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media