Analysis and efficient implementation of IEEE-754 decimal floating point adders/subtractors in FPGAs for DPD and BID encoding

Marcelo Tosini^1,2^na1,
Martín Vázquez^1,2^na1 &
Lucas Leiva^1,2^na1

122 Accesses
Explore all metrics

Abstract

This paper proposes efficient implementations for addition/subtraction based on decimal floating point with Densely Packed Decimal (DPD) and Binary Integer Decimal (BID) encoding in FPGA devices. The designs use novel techniques based on the efficient utilization of dedicated resources in programmable devices. Implementations were made in Xilinx UltraScale+. For DPD adder/subtractor, they have computation times of 7.7 ns for Decimal32, 8.1 ns for Decimal64 and 8.5 ns for Decimal128. As for BID adder/subtractor, the computation time obtained is 13.5 ns for Decimal64. The proposed architecture achieves better computation times than related works. Compared to previous architectures, the proposed DPD implementation achieves 1.86\(\times\) speedup and 47% better LUT occupation. Also, the BID adder/subtractor achieves 3\(\times\) speedup and 5% less LUT occupation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Radix-10 Restoring Square Root for 6-input LUTs Programmable Devices

Article 28 October 2020

PHAc: Posit Hardware Accelerator for Efficient Arithmetic Logic Operations

Pipeline Decimal Multiplier Using Binary Multipliers

Data availability

The data that support the findings of this study are openly available in GitHub at https://github.com/LabSET-UNICEN/, references [53, 59].

References

Nannarelli A (2014) Decimal engine for energy-efficient multicore processors. In: 2014 22nd International Conference on Very Large Scale Integration (VLSI-SoC), pp. 1–6. https://doi.org/10.1109/VLSI-SoC.2014.7004176
Dlodlo N, Mofolo M, Masoane L, Mncwabe S, Sibiya G, Mboweni L (2015) Research trends in existing technologies that are building blocks to the Internet of Things. In: Sobh T, Elleithy K (eds) Innovations and Advances in Computing, Informatics, Systems Sciences, Networking and Engineering. Springer, Cham, pp 539–548
Chapter Google Scholar
Cowlishaw MF (2003) Decimal floating-point: algorism for computers. In: Proceedings 2003 16th IEEE Symposium on Computer Arithmetic. IEEE, pp 104–111
IEEE (2008) IEEE Standard for floating-point arithmetic. IEEE Std 754-2008. https://doi.org/10.1109/IEEESTD.2008.4610935
Duale AY, Decker MH, Zipperer H-G, Aharoni M, Bohizic TJ (2007) Decimal floating-point in z9: an implementation and testing perspective. IBM J Res Dev 51(1.2):217–227. https://doi.org/10.1147/rd.511.0217
Article Google Scholar
Eisen L, Ward JW, Tast H-W, Mading N, Leenstra J, Mueller SM, Jacobi C, Preiss J, Schwarz EM, Carlough SR (2007) IBM POWER6 accelerators: VMX and DFU. IBM J Res Dev 51(6):1–21. https://doi.org/10.1147/rd.516.0663
Article Google Scholar
Schwarz EM, Kapernick JS, Cowlishaw MF (2009) Decimal floating-point support on the IBM system z10 processor. IBM J Res Dev 53(1):4–1410. https://doi.org/10.1147/JRD.2009.5388585
Article Google Scholar
IBM (2021) IBM z15 Technical Introduction. https://www.redbooks.ibm.com/redbooks/pdfs/sg248850.pdf
IBM (2021) IBM Power E1080: Technical Overview and Introduction. https://www.redbooks.ibm.com/redpapers/pdfs/redp5649.pdf
Batista F (2020) Decimal data type, version 2003. http://www.python.org/dev/peps/pep-0327
Cowlishaw MF (2010) The decnumber library, v3.68. http://speleotrove.com/decimal/ decnumber.pdf
Crismer P (2019) Eiffel decimal arithmetic library. http://www.gobosoft.com/eiffel/gobo/ math/decimal/index.html
Currie D (2007) Lua decnumber library. http://files.luaforge.net/releases/ldecnumber/ldecnumber/ldecNumber-21
Dorrigiv M, Jaberipur G (2014) Low area/power decimal addition with carry-select correction and carry-select sum-digits. Integration 47(4):443–451. https://doi.org/10.1016/j.vlsi.2014.01.004
Article Google Scholar
Cornea M, Harrison J, Anderson C, Tang P, Ping T, Schneider E, Gvozdev E (2009) A software implementation of the IEEE 754R decimal floating-point arithmetic using the binary encoding format. IEEE Trans Comput 58(2):148–162. https://doi.org/10.1109/TC.2008.209
Article MathSciNet Google Scholar
Anderson MJ, Tsen C, Wang L, Compton K, Schulte MJ (2009) Performance analysis of decimal floating-point libraries and its impact on decimal hardware and software solutions, pp 465–471
Schulte MJ, Lindberg N, Laxminarain A (2005) Performance evaluation of decimal floating-point arithmetic. In: Proceedings of the 6th IBM Austin Center for Advanced Studies Conference, vol 14. CiteSeer
Cowlishaw MF (2002) Densely packed decimal encoding. In: IEEE Proceedings of the Computers and Digital Techniques, vol 149. IEEE, pp 102–104
Bhat M, Crawford J, Morin R, Shiv K (2007) Performance characterization of decimal arithmetic in commercial java workloads. In: 2007 IEEE International Symposium on Performance Analysis of Systems & Software. IEEE, pp 54–61
Vazquez A, Antelo E (2009) A high-performance significand BCD adder with IEEE 754-2008 decimal rounding. In: 19th IEEE Symposium on Computer Arithmetic, ARITH 2009. IEEE, pp 1335–144
Vázquez M, Todorovich E (2016) FPGA-specific decimal sign-magnitude addition and subtraction. Int J Electron 103(7):1166–1185. https://doi.org/10.1080/00207217.2015.1089945
Article Google Scholar
Vázquez M, Leiva L, Sutter G (2019) Radix-10 decimal logarithm by direct selection for 6-input LUTs programmable devices. Microprocess Microsyst 64:143–158. https://doi.org/10.1016/j.micpro.2018.11.001
Article Google Scholar
Vázquez M, Tosini M, Leiva L (2021) Radix-10 restoring square root for 6-input LUTs programmable devices. Circuits Syst Signal Process. https://doi.org/10.1007/s00034-020-01571-y
Article Google Scholar
Xilinx (2016) 7 Series FPGAs configuration-user guides. www.xilinx.com
Xilinx (2018) Ultrascale architecture and product Data sheet: Overview. www.xilinx.com
Tsen C, Gonzalez-Navarro S, Schulte M (2007) Hardware design of a binary integer decimal-based floating-point adder. In: 2007 25th International Conference on Computer Design, pp 288–295. https://doi.org/10.1109/ICCD.2007.4601915
Farmahini-Farahani A, Tsen C, Compton K (2009) FPGA implementation of a 64-bit bid-based decimal floating-point adder/subtractor. In: 2009 International Conference on Field-Programmable Technology, pp 518–521. https://doi.org/10.1109/FPT.2009.5377636
Erle MA (2009) Algorithms and Hardware Designs for Decimal Multiplication. Lehigh University, Bethlehem
Google Scholar
Schmookler MS, Weinberger A (1971) High speed decimal addition. IEEE Trans Comput C 20(8):862–866. https://doi.org/10.1109/T-C.1971.223362
Article Google Scholar
Veeramachaneni S, Krishna MK, Avinash L, Reddy PS, Srinivas MB (2007) Novel, high-speed 16-digit BCD adders conforming to IEEE 754R format. In: IEEE Computer Society Annual Symposium on VLSI (ISVLSI’07), pp 343–350. https://doi.org/10.1109/ISVLSI.2007.71
Bayrakci A, Akass A (2007) Reduced delay BCD adder. In: IEEE International Conference on Application-Specific Systems, Architectures and Processors. IEEE, pp 266–271 (2007)
Juang T-B, Peng H-H, Kuo H-L (2012) Parallel and digit-serial implementations of area-efficient 3-operand decimal adders. In: 2012 International SoC Design Conference (ISOCC), pp 239–242. https://doi.org/10.1109/ISOCC.2012.6407084
Svoboda A (1969) Decimal adder with signed digit arithmetic. IEEE Trans Comput C 18(3):212–215. https://doi.org/10.1109/T-C.1969.222633
Article Google Scholar
Shirazi B, Yun DYY, Zhang CN (1989) RBCD: redundant binary coded decimal adder. IEEE Proc E (Comput Digit Tech) 136:156–1604
Article Google Scholar
Han L, Chen D, Wahid KA, Ko S-B (2011) Nonspeculative decimal signed digit adder. In: 2011 IEEE International Symposium of Circuits and Systems (ISCAS), pp 1053–1056. https://doi.org/10.1109/ISCAS.2011.5937750
Han L, Ko S-B (2013) High-speed parallel decimal multiplication with redundant internal encodings. IEEE Trans Comput 62(5):956–968. https://doi.org/10.1109/TC.2012.35
Article MathSciNet Google Scholar
Yehia K, Fahmy HAH, Hassan M (2010) A redundant decimal floating-point adder. In: 2010 Conference Record of the Forty Fourth Asilomar Conference on Signals, Systems and Computers, pp 1144–1147. https://doi.org/10.1109/ACSSC.2010.5757583
Bioul G, Vazquez M, Deschamps J, Sutter G (2010) High-speed FPGA 10’s complement adders–subtractors. Int J Reconfig Comput 2010:219764–121976414
Article Google Scholar
Vázquez A (2009) High-performance decimal floating-point units. PhD Thesis, Universidade de Santiago de Compostela
Wang L-K, Schulte MJ (2007) Decimal floating-point adder and multifunction unit with injection-based rounding. In: 18th IEEE Symposium on Computer Arithmetic (ARITH ’07), pp. 56–68. https://doi.org/10.1109/ARITH.2007.13
Kenney RD, Schulte MJ (2005) High-speed multioperand decimal adders. IEEE Trans Comput 54(8):953–963. https://doi.org/10.1109/TC.2005.129
Article Google Scholar
Gorgin S, Jaberipur G (2009) A fully redundant decimal adder and its application in parallel decimal multipliers. Microelectron J 40(10):1471–1481. https://doi.org/10.1016/j.mejo.2009.07.002
Article Google Scholar
Vazquez A, de Dinechin F (2010) Multi-operand decimal tree adders for FPGAs. Research Report
Neto H, Véstias M (2017) Decimal addition on FPGA based on a mixed BCD/excess-6 representation. Microprocess Microsyst 55:91–99. https://doi.org/10.1016/j.micpro.2017.10.004
Article Google Scholar
Véstias M, Neto H (2021) Decimal multiplication in FPGA with a novel decimal adder/subtractor. Algorithms 14:198. https://doi.org/10.3390/a14070198
Article Google Scholar
Bennedek M (1977) Developing large binary to BCD conversion structures. IEEE Trans Comput C 26(7):688–700
Article Google Scholar
Iguchi Y, Sasao T, Matsuura M (2007) On designs of radix converters using arithmetic decompositions—binary to decimal converters. In: 37th International Symposium on Multiple-Valued Logic (ISMVL07). IEEE Computer Society, Los Alamitos, p 32. https://doi.org/10.1109/ISMVL.2007.39
Nicoud J-D (1971) Iterative arrays for radix conversion. IEEE Trans Comput C 20(12):1479–1489. https://doi.org/10.1109/T-C.1971.223160
Article Google Scholar
Thompson J, Karra N, Schulte MJ (2004) A 64-bit decimal floating-point adder. In: IEEE Computer Society Annual Symposium on VLSI, pp 297–298. https://doi.org/10.1109/ISVLSI.2004.1339563
Wang L-K, Schulte MJ, Thompson JD, Jairam N (2009) Hardware designs for decimal floating-point addition and related operations. IEEE Trans Comput 58(3):322–335. https://doi.org/10.1109/TC.2008.147
Article MathSciNet Google Scholar
Minchola C, Vazquez M, Sutter G (2011) A FPGA IEEE-754-2008 decimal64 floating-point adder/subtractor. In: 2011 VII Southern Conference on Programmable Logic (SPL), pp 251–256. https://doi.org/10.1109/SPL.2011.5782657
Tsen C, Schulte M, Gonzalez-Navarro S (2007) Hardware design of a binary integer decimal-based ieee p754 rounding unit. In: 2007 IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP), pp 115–121. https://doi.org/10.1109/ASAP.2007.4429967
LabSET (2022) VHDL implementation of DPD encoded adder/subtractor. https://github.com/LabSET-UNICEN/DPD-Adder-Subtractor
Xilinx (2018) Vivado design suite user guide-design flows overview. www.xilinx.com
Xilinx (2018) Vivado design suite user guide-synthesis. www.xilinx.com
Xilinx (2013) ISE design suite 14: Release notes, installation and licensing. www.xilinx.com
Xilinx (2013) XST user guide for virtex-6, spartan-6 and 7 series devices. www.xilinx.com
Milenković N, Stankovic V, Milić M (2015) Modular design of fast leading zeros counting circuit. J Electr Eng 66:329–333. https://doi.org/10.2478/jee-2015-0054
Article Google Scholar
LabSET (2022) VHDL implementation of BID encoded adder/subtractor. https://github.com/LabSET-UNICEN/BID-Adder-Subtractor
Cornea M, Anderson C, Harrison J, Tak P, Tang P, Schneider E, Tsen C (2007) An implementation of the IEEE 754R decimal floating-point arithmetic using the binary encoding format. In: Proceedings of the 18th IEEE Symposium on Computer Arithmetic. IEEE Computer Society, Montpellier, France, pp 29–37
Xilinx (2017) ADM-PCIE-7V Data sheet. www.xilinx.com
Amazon (2017) Amazon EC2 F1 Instance. https://aws.amazon.com/ec2/instance-types/f1/
Choi Y-K, Cong J, Fang Z, Hao Y, Reinman G, Wei P (2019) In-depth analysis on microarchitectures of modern heterogeneous CPU-FPGA platforms. ACM Trans Reconfig Technol Syst. https://doi.org/10.1145/3294054
Article Google Scholar

Download references

Funding

Partial financial support was received from the Research Secretary of the Faculty of Engineering of FASTA University and SeCAT of UNICEN University.

Author information

Marcelo Tosini, Martín Vázquez and Lucas Leiva have contributed equally to this work.

Authors and Affiliations

Computer and Systems Department, UNICEN, Pinto 399, 7000, Tandil, Buenos Aires, Argentina
Marcelo Tosini, Martín Vázquez & Lucas Leiva
Engineering Faculty, FASTA University, Gascón 3145, 7600, Mar del Plata, Buenos Aires, Argentina
Marcelo Tosini, Martín Vázquez & Lucas Leiva

Authors

Marcelo Tosini
View author publications
You can also search for this author in PubMed Google Scholar
Martín Vázquez
View author publications
You can also search for this author in PubMed Google Scholar
Lucas Leiva
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MT and MV helped in conceptualization, methodology, and experimentation. MT, MV, and LL were involved in writing, review, and editing .

Corresponding authors

Correspondence to Marcelo Tosini, Martín Vázquez or Lucas Leiva.

Ethics declarations

Conflict of interest

The authors declare no competing of interest.

Ethical approval

This material is the authors’ own original work, which has not been previously published elsewhere. The paper is not currently being considered for publication elsewhere. The paper reflects the authors’ own research and analysis in a truthful and complete manner.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Tosini, M., Vázquez, M. & Leiva, L. Analysis and efficient implementation of IEEE-754 decimal floating point adders/subtractors in FPGAs for DPD and BID encoding. J Supercomput 80, 9298–9326 (2024). https://doi.org/10.1007/s11227-023-05808-w

Download citation

Accepted: 10 November 2023
Published: 06 December 2023
Issue Date: May 2024
DOI: https://doi.org/10.1007/s11227-023-05808-w

Analysis and efficient implementation of IEEE-754 decimal floating point adders/subtractors in FPGAs for DPD and BID encoding

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Radix-10 Restoring Square Root for 6-input LUTs Programmable Devices

PHAc: Posit Hardware Accelerator for Efficient Arithmetic Logic Operations

Pipeline Decimal Multiplier Using Binary Multipliers

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Analysis and efficient implementation of IEEE-754 decimal floating point adders/subtractors in FPGAs for DPD and BID encoding

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Radix-10 Restoring Square Root for 6-input LUTs Programmable Devices

PHAc: Posit Hardware Accelerator for Efficient Arithmetic Logic Operations

Pipeline Decimal Multiplier Using Binary Multipliers

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation