[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/237090.237193acmconferencesArticle/Chapter ViewAbstractPublication PagesasplosConference Proceedingsconference-collections
Article
Free access

Exploiting dual data-memory banks in digital signal processors

Published: 01 September 1996 Publication History

Abstract

Over the past decade, digital signal processors (DSPs) have emerged as the processors of choice for implementing embedded applications in high-volume consumer products. Through their use of specialized hardware features and small chip areas, DSPs provide the high performance necessary for embedded applications at the low costs demanded by the high-volume consumer market. One feature commonly found in DSPs is the use of dual data-memory banks to double the memory system's bandwidth. When coupled with high-order data interleaving, dual memory banks provide the same bandwidth as more costly memory organizations such as a dual-ported memory. However, making effective use of dual memory banks remains difficult, especially for high-level language (HLL) DSP compilers.In this paper, we describe two algorithms --- compaction-based (CB) data partitioning and partial data duplication --- that we developed as part of our research into the effective exploitation of dual data-memory banks in HLL DSP compilers. We show that CB partitioning is an effective technique for exploiting dual data-memory banks, and that partial data duplication can augment CB partitioning in improving execution performance. Our results show that CB partitioning improves the performance of our kernel benchmarks by 13%-40% and the performance of our application benchmarks by 3%-15%. For one of the application benchmarks, partial data duplication boosts performance from 3% to 34%.

References

[1]
Gianluigi Castelli, Guest Editor's Introduction: 'Whe Seemingly Unlimited Market for Microcontroller-Based Embedded Systems," IEEE Micro, Vol. 15, No. 5, pp. 6-8, October, 1995.
[2]
Bennett Z. Kobb, "Telecommunications," IEEE Spectrum, pp. 30-34, January, 1995.
[3]
Edward A. Lee, "Programmable DSP Architectures," IEEE ASSP Magazine, Part I: pp.4-19, October, 1988; Part iI: pp. 4-14, January, 1989.
[4]
Mazen A. R. Saghir, Paul Chow, and Corinna G. Lee, ''Towards Better DSP Architectures and Compilers," Proceedings of the International Conference on Signal Processing Applications and Technology, pp. 658-664, DSP Associates, October, 1994.
[5]
Ruby B. Lee, "Accelerating Multimedia with Enhanced Microprocessors," IEEE Micro, Vol. 15, No. 2, pp. 22-32, April, 1995.
[6]
L. Kohn, G. Maturana, M. Tremblay, A. Prabhu, G. Zyner, "The Visual Instruction Set (VIS) in UltraSPARC," Proceedings of Compcon '95, pp. 462-469, March, 1995.
[7]
Upcoming issue of IEEE Micro on Media Processing, August, 1996.
[8]
Recent IC Announcements, Microprocessor Report, p. 27, August 21, 1995.
[9]
Most Significant Bits, Microprocessor Report, pp. 4-5, July 31, 1995.
[10]
John Hennessy and David Patterson, Computer Architecture: A Quantitative Approach, Second Edition, Morgan Kaufmann Publishers, Inc., 1995.
[11]
Linley Gwennap, "Improved Cost Model Puts Pentium at $180," Microprocessor Report, pp. 14-15, September 12, 1994.
[12]
Jim Turley and Phil Lapsley, "New 56301 DSP Doubles 24-Bit Performance," Microprocessor Report, pp. 14-15, December 4, 1995.
[13]
Jose Luis Pino, Soonhoi Ha, Edward A. Lee, and Joseph T. Buck, "Software Synthesis for DSP Using Ptolemy," Journal of VLSI Signal Processing,. Vol. 9, No. 1-2, pp. 7-21, January, 1995.
[14]
Vojin Zivojnovic, Harald Schraut, M. Willems, and R. Schoenen, "DSPs, GPPs, and Multimedia Applications - An Evaluation Using DSPstone," Proceedings of the International Conference on Signal Processing Applications and Technology, pp. 1779-1783, DSP Associates, October, 1995.
[15]
Alfred V. Aho, Ravi Sethi, and Jeffrey D. Ullman, Compilers: Principles, Techniques, and Tools, Addison-Wesley Publishing Company, 1986.
[16]
"MIPS Open RISC Technology R10000 Microprocessor Technical Brief', http: //www.mips. com/rl0k/ r10000_Pr_Info/R10000_Tech_Br_cv. html, October, 1994.
[17]
DSP56000/DSP56001 Digital Signal Processor User's Manual, Motorola, 1990.
[18]
Monica S. Lam, "Software Pipelining: An Effective Scheduling Technique for VLiW Machines," SIGPIAN Conference on Programming Language Design and Implementation, pp. 318-328, ACM, June, 1988.
[19]
Carla Procaskey, "Improving Compiled DSP Code Through Language Extensions," Proceedings of the International Conference on Signal Processing Applications and Technology, pp. 846-850, DSP Associates, October, 1995.
[20]
Ashok Sudarsanam and Sharad Malik, "Memory Bank and Register Allocation in Software Synthesis for ASIPS," Proceedings of the International Conference on Computer- Aided Design, pp. 388-392, IEEE/ACM, 1995.
[21]
P.G. Lowney, et. al., 'Whe Multifiow Trace Scheduling Compiler," Journal of Supercomputing, Vol. 7, Issue I-2, pp. 51-142, May, 1993.
[22]
"SUIF: An Infrastructure for Research on Parallelizing and Optimizing Compilers," http://suif, stanford. edu/suif / sui f-overview/suif, html, 1994.
[23]
Joseph Fisher, "Very Long Instruction Word Architectures and the ELI-512," Proceedings of the l Oth International Symposium on Computer Architecture, pp. 140-150, IEEE, 1983.
[24]
Richard M. Stallman, Using and Porting GNU C, Free Software Foundation, Inc., 1990.
[25]
David Landskov, Scott Davidson, Bruce Shriver, and Patrick W. Mallett, "Local Microcode Compaction Techniques,'' Computing Surveys, 12(3): pp. 261-294, ACM, September, 1980.
[26]
Michael R. Garey and David S. Johnson, Computers and Intractability: A Guide to the Theory of NP. Completeness, W. H. Freeman and Company, 1979.
[27]
Mazen A. R. Saghir, Paul Chow, and Corinna G. Lee, "Application-Driven Design of DSP Architectures and Compilers," Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pp. II-437- 440, IEEE, 1994.

Cited By

View all
  • (2017)Efficient Automated Code Partitioning for Microcontrollers with Switchable Memory BanksACM Transactions on Embedded Computing Systems10.1145/305551116:4(1-26)Online publication date: 26-May-2017
  • (2013)Minimizing code size via page selection optimization on partitioned memory architecturesProceedings of the 2013 International Conference on Compilers, Architectures and Synthesis for Embedded Systems10.5555/2555729.2555741(1-10)Online publication date: 29-Sep-2013
  • (2013)Joint variable partitioning and bank selection instruction optimization for partitioned memory architecturesACM Transactions on Embedded Computing Systems10.1145/2442116.244212612:3(1-27)Online publication date: 8-Apr-2013
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ASPLOS VII: Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
October 1996
290 pages
ISBN:0897917677
DOI:10.1145/237090
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 September 1996

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

ASPLOS96
Sponsor:

Acceptance Rates

ASPLOS VII Paper Acceptance Rate 25 of 109 submissions, 23%;
Overall Acceptance Rate 535 of 2,713 submissions, 20%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)180
  • Downloads (Last 6 weeks)21
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Efficient Automated Code Partitioning for Microcontrollers with Switchable Memory BanksACM Transactions on Embedded Computing Systems10.1145/305551116:4(1-26)Online publication date: 26-May-2017
  • (2013)Minimizing code size via page selection optimization on partitioned memory architecturesProceedings of the 2013 International Conference on Compilers, Architectures and Synthesis for Embedded Systems10.5555/2555729.2555741(1-10)Online publication date: 29-Sep-2013
  • (2013)Joint variable partitioning and bank selection instruction optimization for partitioned memory architecturesACM Transactions on Embedded Computing Systems10.1145/2442116.244212612:3(1-27)Online publication date: 8-Apr-2013
  • (2013)Minimizing code size via page selection optimization on partitioned memory architectures2013 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES)10.1109/CASES.2013.6662516(1-10)Online publication date: Sep-2013
  • (2013)C Compilers and Code Optimization for DSPsHandbook of Signal Processing Systems10.1007/978-1-4614-6859-2_31(1015-1040)Online publication date: 10-May-2013
  • (2012)Adaptive Source-Level Data Assignment to Dual Memory BanksACM Transactions on Embedded Computing Systems10.1145/2180887.218089711S:1(1-22)Online publication date: 1-Jun-2012
  • (2012)On-chip memory architecture exploration framework for DSP processor-based embedded system on chipACM Transactions on Embedded Computing Systems10.1145/2146417.214642211:1(1-25)Online publication date: 5-Apr-2012
  • (2012)Analysis and approximation for bank selection instruction minimization on partitioned memory architectureJournal of Combinatorial Optimization10.1007/s10878-010-9365-z23:2(274-291)Online publication date: 1-Feb-2012
  • (2011)BibliographyReal-Time Embedded Systems10.1201/b10935-12(187-207)Online publication date: 7-Jun-2011
  • (2011)Case Study of Efficient Parallel Memory Access Programming for the Embedded Heterogeneous Multicore DSP Architecture ePUMAProceedings of the 2011 International Conference on Complex, Intelligent, and Software Intensive Systems10.1109/CISIS.2011.103(624-629)Online publication date: 30-Jun-2011
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media