Low Complexity Reconfigurable DSP Circuit Implementations Based on Common Sub-expression Elimination

H. Ho¹,
V. Szwarc¹ &
T. Kwasniewski²

178 Accesses
Explore all metrics

Abstract

A design technique based on a combination of Common Sub-Expression Elimination and Bit-Slice (CSE-BitSlice) arithmetic for hardware and performance optimization of multiplier designs with variable operands is presented in this paper. The CSE-BitSlice technique can be extended to hardware optimization of multiplier circuits operating on vectors or matrices of variables. The CSE-BitSlice technique has been applied to the design and implementation of 12 × 12 and 42 × 42 bit real multipliers, a complex multiplier, a 6-tap FIR filter, and a 5-point DFT circuit. For comparison purposes, circuit implementations of the same arithmetic and DSP functions have been carried out using Radix-4 Booth and CSA algorithms. Simulation results based on implementations using the Xilinx FPGA 5VLX330FF1760-2 device shows that the circuits based on the CSE-BitSlice techniques require fewer logic resources and yield higher throughput as compared to the CSA and Radix-4 Booth based circuits.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Efficient Implementation of Complex Multipliers on FPGAs Using DSP Slices

Article Open access 27 April 2023

FPGA Implementation of High Speed Multiplier with Optimized Reduction Phase

Design and Implementation of Adders and Multipliers for DSP Applications

References

Pasko, R., Schaumont, P., Derudder, V., Vernalde, V., & Durackova, D. (1999). A new algorithm for elimination of common subexpressions. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 18(1), 58–68.
Article Google Scholar
Macpherson, K. N., & Steward, R. W. (2006). Area efficient FIR filters for high speed FPGA implementation. IEE Proceeding: Vision, Image and Signal Processing, 153(6), 711–720.
Article Google Scholar
Dempster, A. G., & Macleod, M. (1995). Use of minimum-adder multiplier blocks in FIR digital filters. IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, 42(9), 569–576.
Article MATH Google Scholar
Mirzaei, S., Hosangadi, A., & Kastner, R. (2006). FPGA implementation of high speed FIR filters using add and shift method. Proc. of the 14th Intl. Conference on FPGA FPGA'06, pp. 231–237.
Macleod, M. D., & Dempster, A. G. (2004). Common subexpression elimination algorithm for low-cost multiplierless implementation of matrix multipliers. Electronics Letters, 40(11), 651–652.
Article Google Scholar
Mehendale, M., Sherlekar, S. D., & Vekantesh, G. (1995). Synthesis of multiplierless FIR filters with minimum number of additions. Proceedings of the 1995 IEEE/ACM International Conference on Computer-Aided Design. Los Alamitos CA., pp. 668–671.
Hartley, R. I. (1996). Subexpression sharing in filters using canonic signed digit multipliers. IEEE Transactions Circuits and Systems II, 43, 677–688.
Article Google Scholar
Potkonjak, M., Srivastava, M. B., & Chandrakasan, A. P. (1996). Multiple constant multiplications: efficient and versatile framework and algorithms for exploring common subexpression elimination. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 15(2), 151–165.
Article Google Scholar
Park, I. C., & Kang, H. J. (2002). Digital filter synthesis based on an algorithm to generate all minimal signed digit representations. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 21(12), 1525–1529.
Article MathSciNet Google Scholar
Nguyen, H. T., & Chatterjee, A. (2000). Number-splitting with shift-and-add decomposition for power and hardware optimization in linear DSP synthesis. IEEE Transactions on Very Large Scale Integration Systems, 8(4), 419–424.
Article Google Scholar
Hosangadi, A., Fallah, F., & Kastner, R. (2005). Simultaneous optimization of delay and number of operations in multiplierless implementation of linear systems. Proc. of the 14th Intl. Workshop on Logic and Synthesis IWLS’05.
Wu, Q., Song, Q., & Sun, Y. (2004). A novel algorithm for common subexpression elimination in VLSI implementation of high speed multiplierless FIR filters. Proc. of the 6th IASTED Intl. Conference on Signal and Image Processing SIP’04, pp. 346–350.
Aksoy, L., Gunes, E. O., & Flores, P. (2008). An exact breadth-first search algorithm for the multiple constant multiplications problem. Proc. of the 26th NORCHIP Conf., Nov. 2008, pp. 41–46.
Hwang, K. (1978). Computer arithmetic: Principles, architecture and design. New York: Wiley.
Google Scholar
Ercegovac, M. D., & Lang, T. (1990). Fast multiplication without carry propagate addition. IEEE Transactions on Computers, 39, 1385–1390.
Article Google Scholar
Asato, C., Ditzen, C., & Dholakia, S. (1989). A datapath multiplier with automatic insertion of pipeline stages. IEEE Proc. of Custom Integrated Circuits Conference, pp. 23.2.1–23.2.4.
Booth, A. D. (1951). A signed binary multiplication technique. Quarterly Journal of Mechanics and Applied Mathematics, 4(2), 236–240.
Article MATH MathSciNet Google Scholar
McSorley, O. L. (1961). High speed arithmetic in binary computers. Proceedings of the Institute of Radio Engineers, 49(1), 67–91.
Google Scholar
Shin, M. C., Kang, S. H., & Park, I. C. (2001). An area-efficient iterative modified-booth multiplier based on self-timed clocking. IEEE Int’l Conf. on Computer, pp. 511–514.
Taylor, F. J. (1984). Residue arithmetic: a tutorial with examples. IEEE Computer Magazine, 17(5), 50–62.
Google Scholar
Jenkins, W., & Davidson, E. (1985). A custom-designed integrated circuit for the realization of residue number digital filters. Proc. ICASSP, pp. 220–223.
Mahesh, R., & Vinod, A. P. (2007). An architecture for integrating low complexity and reconfigurability for channel filters in software defined radio receivers. In Proc. IEEE International Symposium on Circuits and Systems, May 2007, pp. 2514–2517.
Muhammad, K., & Roy, K. (2002). Reduced computational redundancy implementation of DSP algorithms using computation sharing vector scaling. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 10(3), 292–300.
Article Google Scholar
Ho, H., Szwarc, V., & Kwasniewski, T. (2007). Hardware optimization of a configurable polyphase-FFT design using common sub-expression elimination. Proc. of the MWSCAS/NEWCAS ‘07, Montreal, Aug. 5–7, 2007.
Quan, G., Davis, J. P., Devarkal, S., & Buell, D. A. (2005). High-level synthesis for large bit–width multipliers on FPGAs: a case study. Proc. of the 3rd IEEE/ACM/IFIP Intl Conference on Hardware/Software Codesign and System Synthesis, pp. 213–218.
Gokul, G., Zhuo, L., Choi, S., & Prasanna, V. (2004). Analysis of high-performance floating-point arithmetic on FPGAs. Proc. of the 18th Parallel and Distributed Processing Symposium IPDPS’04, April 2004, p. 149.
Smith, W. W., & Smith, J. M. (1995). Handbook of real-time fast fourier transforms. Piscataway: IEEE Press.
Book MATH Google Scholar
Xilinx Corporation website, http://www.xilinx.com.

Download references

Author information

Authors and Affiliations

Communications Research Centre, 3701 Carling Ave., Station H, Box 11490, Ottawa, Ontario, Canada, K2H 8S2
H. Ho & V. Szwarc
Department of Electronics, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario, Canada, K1S 5B6
T. Kwasniewski

Authors

H. Ho
View author publications
You can also search for this author in PubMed Google Scholar
V. Szwarc
View author publications
You can also search for this author in PubMed Google Scholar
T. Kwasniewski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to H. Ho.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ho, H., Szwarc, V. & Kwasniewski, T. Low Complexity Reconfigurable DSP Circuit Implementations Based on Common Sub-expression Elimination. J Sign Process Syst 61, 353–365 (2010). https://doi.org/10.1007/s11265-010-0458-9

Download citation

Received: 26 November 2008
Revised: 01 February 2010
Accepted: 01 February 2010
Published: 06 March 2010
Issue Date: December 2010
DOI: https://doi.org/10.1007/s11265-010-0458-9

Low Complexity Reconfigurable DSP Circuit Implementations Based on Common Sub-expression Elimination

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Efficient Implementation of Complex Multipliers on FPGAs Using DSP Slices

FPGA Implementation of High Speed Multiplier with Optimized Reduction Phase

Design and Implementation of Adders and Multipliers for DSP Applications

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Low Complexity Reconfigurable DSP Circuit Implementations Based on Common Sub-expression Elimination

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Efficient Implementation of Complex Multipliers on FPGAs Using DSP Slices

FPGA Implementation of High Speed Multiplier with Optimized Reduction Phase

Design and Implementation of Adders and Multipliers for DSP Applications

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation