[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1362622.1362690acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Application development on hybrid systems

Published: 10 November 2007 Publication History

Abstract

Hybrid systems consisting of a multitude of different computing device types are interesting targets for high-performance applications. Chip multiprocessors, FPGAs, DSPs, and GPUs can be readily put together into a hybrid system; however, it is not at all clear that one can effectively deploy applications on such a system. Coordinating multiple languages, especially very different languages like hardware and software languages, is awkward and error prone. Additionally, implementing communication mechanisms between different device types unnecessarily increases development time. This is compounded by the fact that the application developer, to be effective, needs performance data about the application early in the design cycle. We describe an application development environment specifically targeted at hybrid systems, supporting data-flow semantics between application kernels deployed on a variety of device types. A specific feature of the development environment is the availability of performance estimates (via simulation) prior to actual deployment on a physical system.

References

[1]
L. Atieno, J. Allen, D. Goeckel, and R. Tessier. An adaptive Reed-Solomon errors-and-erasures decoder. In Int'l Symp. on Field Programmable Gate Arrays, pages 150--158, 2006.
[2]
Z. K. Baker and V. K. Prasanna. Efficient hardware data mining with the apriori algorithm on FPGAs. In Proc. of 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, pages 3--12, 2005.
[3]
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: Stream computing on graphics hardware. ACM Transactions on Graphics, 23(3):777--786, Aug. 2004.
[4]
R. D. Chamberlain, B. Shands, and J. White. Achieving real data throughput for an FPGA co-processor. In Proc. of 1st Workshop on Building Block Engine Architectures for Computers and Networks, Oct. 2004.
[5]
W. Chen, P. Kosmas, M. Leeser, and C. Rappaport. An FPGA implementation of the two-dimensional finite-difference time-domain (FDTD) algorithm. In Int'l Symp. on Field Programmable Gate Arrays, pages 213--222, Feb. 2004.
[6]
K. Fatahalian, J. Sugerman, and P. Hanrahan. Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In Proc. of the ACM Conf. on Graphics Hardware, pages 133--137, 2004.
[7]
M. A. Franklin, R. D. Chamberlain, M. Henrichs, B. Shands, and J. White. An architecture for fast processing of large unstructured data sets. In Proc. of IEEE 22nd Int'l Conf. on Computer Design, pages 280--287, Oct. 2004.
[8]
M. A. Franklin, E. J. Tyson, J. Buckley, P. Crowley, and J. Maschmeyer. Auto-pipe and the X language: A pipeline design tool and description language. In Proc. of Int'l Parallel and Distributed Processing Symp., Apr. 2006.
[9]
N. Galoppo, N. K. Govindaraju, M. Henson, and D. Manocha. LU-GPU: Efficient algorithms for solving dense linear systems on graphics hardware. In Proc. of the ACM/IEEE Conf. on Supercomputing, pages 3--14, 2005.
[10]
S. Gayen, E. J. Tyson, M. A. Franklin, and R. D. Chamberlain. A federated simulation environment for hybrid systems. In Proc. of 21st Int'l Workshop on Principles of Advanced and Distributed Simulation, pages 198--207, June 2007.
[11]
The Glib Team. GLib Reference Manual. http://developer.gnome.org/doc/API/glib.
[12]
N. Govindaraju, J. Gray, R. Kumar, and D. Manocha. GPUTeraSort: high performance graphics co-processor sorting for large database management. In Proc. of SIGMOD Int'l Conf. on Management of Data, pages 325--336, 2006.
[13]
Z. Guo, W. Najjar, F. Vahid, and K. Vissers. A quantitative analysis of the speedup factors of FPGAs over processors. In Int'l Symp. on Field Programmable Gate Arrays, pages 162--170, Feb. 2004.
[14]
M. C. Herbordt, J. Model, Y. Gu, B. Sukhwani, and T. VanCourt. Single pass, BLAST-like, approximate string matching on FPGAs. In Proc. of IEEE Symp. on Field-Programmable Custom Computing Machines, pages 217--226, 2006.
[15]
W. Hofmann, for the H.E.S.S. Collaboration. Status of the high energy stereoscopic system (H.E.S.S.) project. In Proc. of 27th Int'l Cosmic Ray Conf., pages 2785--2788, 2001.
[16]
D. R. Horn, M. Houston, and P. Hanrahan. ClawHMMER: A streaming HMMer-search implementation. In Proc. of ACM/IEEE Conf. on Supercomputing, pages 11--19, 2005.
[17]
M. Leeser, S. Miller, and H. Yu. Smart camera based on reconfigurable hardware enables diverse real-time applications. In Proc. of 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, pages 147--155, 2004.
[18]
National Instruments. Labview. http://www.ni.com/labview.
[19]
The Ptolemy Team. The Ptolemy Kernel - Supporting Heterogeneous Design. RASSP Digest Newsletter, 2(1):14--17, Apr. 1995.
[20]
R. Scrofano, M. Gokhale, F. Trouw, and V. K. Prasanna. Hardware/software approach to molecular dynamics on reconfigurable computers. In Proc. of 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, pages 23--34, 2006.
[21]
J. L. Tripp, H. S. Mortveit, A. A. Hansson, and M. Gokhale. Metropolitan road traffic simulation on FPGAs. In Proc. of 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, pages 117--126, 2005.
[22]
E. J. Tyson. Auto-pipe and the X language: A toolset and language for the simulation, analysis, and synthesis of heterogeneous pipelined architectures. Master's thesis, Washington University in St. Louis, Department of Computer Science and Engineering, 2006.
[23]
T. Weekes et al. VERITAS: the very energetic radiation imaging telescope array system. Astroparticle Physics, 17(2):221--243, May 2002.
[24]
D. Zaretsky, G. Mittal, X. Tang, and P. Banerjee. Overview of the FREEDOM compiler for mapping DSP software to FPGAs. In Proc. of 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, pages 37--46, 2004.

Cited By

View all
  • (2016)Virtualized I/OAttaining High Performance Communications10.1201/b10249-17(261-282)Online publication date: 19-Apr-2016
  • (2012)Using explicit platform descriptions to support programming of heterogeneous many-core systemsParallel Computing10.1016/j.parco.2011.10.00838:1-2(52-65)Online publication date: 1-Jan-2012
  • (2011)Explicit Platform Descriptions for Heterogeneous Many-Core ArchitecturesProceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum10.1109/IPDPS.2011.280(1292-1299)Online publication date: 16-May-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '07: Proceedings of the 2007 ACM/IEEE conference on Supercomputing
November 2007
723 pages
ISBN:9781595937643
DOI:10.1145/1362622
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 November 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. gamma ray astronomy
  2. hardware/software co-design
  3. hybrid systems
  4. performance modeling

Qualifiers

  • Research-article

Funding Sources

Conference

SC '07
Sponsor:

Acceptance Rates

SC '07 Paper Acceptance Rate 54 of 268 submissions, 20%;
Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)2
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Virtualized I/OAttaining High Performance Communications10.1201/b10249-17(261-282)Online publication date: 19-Apr-2016
  • (2012)Using explicit platform descriptions to support programming of heterogeneous many-core systemsParallel Computing10.1016/j.parco.2011.10.00838:1-2(52-65)Online publication date: 1-Jan-2012
  • (2011)Explicit Platform Descriptions for Heterogeneous Many-Core ArchitecturesProceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum10.1109/IPDPS.2011.280(1292-1299)Online publication date: 16-May-2011
  • (2010)State-of-the-art in heterogeneous computingScientific Programming10.1155/2010/54015918:1(1-33)Online publication date: 1-Jan-2010
  • (2010)Application-guided tool development for architecturally diverse computationProceedings of the 2010 ACM Symposium on Applied Computing10.1145/1774088.1774191(496-501)Online publication date: 22-Mar-2010
  • (2009)Sorting on architecturally diverse computer systemsProceedings of the Third International Workshop on High-Performance Reconfigurable Computing Technology and Applications10.1145/1646461.1646466(39-46)Online publication date: 15-Nov-2009
  • (2009)Embedded Algorithms Within an FPGA to Classify Nonlinear Single-Degree-of-Freedom SystemsIEEE Sensors Journal10.1109/JSEN.2009.20193229:11(1486-1493)Online publication date: Nov-2009
  • (2008)Understanding the performance of streaming applications deployed on hybrid systems2008 IEEE International Symposium on Parallel and Distributed Processing10.1109/IPDPS.2008.4536381(1-5)Online publication date: Apr-2008
  • (2008)Flexible Classification on Heterogenous Multicore Appliance Platforms2008 Proceedings of 17th International Conference on Computer Communications and Networks10.1109/ICCCN.2008.ECP.27(1-6)Online publication date: Aug-2008
  • (2008)Visions for application development on hybrid computing systemsParallel Computing10.1016/j.parco.2008.03.00134:4-5(201-216)Online publication date: 1-May-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media