JP2000259394A

JP2000259394A - Floating point multiplier

Info

Publication number: JP2000259394A
Application number: JP11062387A
Authority: JP
Inventors: Takashi Osada; 孝士長田
Original assignee: NEC Computertechno Ltd
Current assignee: NEC Computertechno Ltd
Priority date: 1999-03-09
Filing date: 1999-03-09
Publication date: 2000-09-22

Abstract

PROBLEM TO BE SOLVED: To provide a floating point multiplier which performs floating point multiplication at high speed by generating a sticky bit in parallel to the multiplication of mantissa part of floating point data. SOLUTION: Mantissa part M0 and M1 of the floating point data are inputted to a multiplication array 1 and also inputted to zero counting means 4-1 and 4-2 at the same time. The zero counting means 4-1 and 4-2 count the number of zero during the period until 1 appears for the first time from the least significant digit bits of the mantissa part M0 and M1 and an adder 5 sums up their zero count results of the mantissa parts M0 and M1. A comparison circuit 6 compares the adding results of the adder 5 with a constant, 1 as the sticky bit is outputted if the constant is larger than the adding result of the adder 5 and 0 as the sticky bit is outputted if the constant is smaller than the adding result of the adder 5 or the constant is equal to the result. Consequently, it becomes possible to obtain a result of the floating multiplication at high speed because a sticky bit can be generated without using the result which is from the multiplication array 1 and a mantissa part adder 2.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は浮動小数点乗算器に
関し、特に浮動小数点データの仮数部の乗算動作に並行
してスティッキービットを生成することにより、高速に
浮動小数点乗算を行う乗算器に関する。[0001] 1. Field of the Invention [0002] The present invention relates to a floating-point multiplier, and more particularly, to a multiplier that performs high-speed floating-point multiplication by generating sticky bits in parallel with a multiplication operation of a mantissa part of floating-point data.

【０００２】[0002]

【従来の技術】従来、浮動小数点乗算器は図２に示すよ
うに、指数部加算器３と、乗算アレイ１と、仮数部加算
器２と、論理和回路８と、丸め桁合わせ回路７とを有し
て構成されていた。この構成における動作は、まず前処
理段階で切り出された浮動小数点データの指数部Ｅ０及
びＥ１を指数部加算器３により加算し、浮動小数点デー
タのｍ（正の整数）ビットの仮数部Ｍ０及びＭ１を乗算
アレイ１に入力して乗算を行い、乗算アレイ１の２出力
Ａ及びＢを仮数部加算器２にて加算することにより（２
ｍ−１）ビットの仮数部乗算結果Ｃを得る。仮数部加算
器２の出力のうち、切り捨てられる下位（ｍ−１）ビッ
トＪの総論理和Ｓ’を論理和回路８で求める。丸め桁合
わせ回路７は、この総論理和Ｓ’を制御信号として指数
部加算器３の出力Ｄと仮数部加算器２の出力の上位ｍビ
ットＣから、浮動小数点乗算器の出力Ｉを出力するとい
うものであった。この構成において、丸め桁合わせ回路
７が必要とする仮数部加算器の出力は、上位ｍビットＣ
と、切り捨てられた下位（ｍ−１）ビットＪの総論理和
Ｓ’で示されるスティッキービットである。この総論理
和Ｓ’は、乗算が完全に終了してからでないと求められ
ない。従って、従来の浮動小数点乗算器の全遅延時間
は、前処理＋仮数部乗算＋総論理和Ｓ’の算出＋丸め桁
合わせ、となり、この総論理和Ｓ’を求める時間が従来
の浮動小数点乗算器における最大遅延経路のうちの一つ
となっていた。乗算の終了を待たずに論理和回路への入
力を得る方法としては、例えば特開平２−２２４１２１
号公報には、図３に示すように乗算アレイ１００、加算
器１０１による仮数部データの乗算回路と並列に設けた
零計数手段１０３、演算手段１０４及び論理和回路１０
５によりスティッキービットを求める技術が記載されて
いる。零計数手段１０３、演算手段１０４を通過した結
果を論理和回路１０５への入力とすることにより、仮数
部データの乗算回路の出力を待たずに総論理和を求める
動作を開始する。2. Description of the Related Art Conventionally, as shown in FIG. 2, an exponent adder 3, a multiplication array 1, a mantissa adder 2, an OR circuit 8, a rounding digit matching circuit 7, It was constituted having. The operation in this configuration is as follows. First, the exponent parts E0 and E1 of the floating-point data cut out in the preprocessing stage are added by the exponent part adder 3, and the mantissa parts M0 and M1 of m (positive integer) bits of the floating-point data. Is input to the multiplication array 1 to perform multiplication, and the two outputs A and B of the multiplication array 1 are added by the mantissa adder 2 to obtain (2
(m-1) A mantissa multiplication result C of bits is obtained. In the output of the mantissa adder 2, the total OR S ′ of the lower (m−1) bits J to be truncated is obtained by the OR circuit 8. The rounding and digit matching circuit 7 outputs the output I of the floating point multiplier from the output D of the exponent part adder 3 and the upper m bits C of the output of the mantissa part adder 2 using the total OR S ′ as a control signal. It was that. In this configuration, the output of the mantissa adder required by the rounding digit matching circuit 7 is the upper m bits C
And the sticky bit indicated by the total logical sum S ′ of the truncated lower (m−1) bits J. The total OR S 'can be obtained only after the multiplication is completely completed. Therefore, the total delay time of the conventional floating-point multiplier is: pre-processing + mantissa multiplication + calculation of total OR S ′ + rounding digit alignment. One of the maximum delay paths in the vessel. A method for obtaining an input to the OR circuit without waiting for the end of the multiplication is described in, for example, Japanese Patent Application Laid-Open No. 2-224121.
In the publication, as shown in FIG. 3, a multiplication array 100, a zero counting means 103 provided in parallel with a multiplication circuit for mantissa data by an adder 101, an operation means 104, and an OR circuit 10
5 describes a technique for obtaining a sticky bit. By using the result of passing through the zero counting means 103 and the arithmetic means 104 as an input to the OR circuit 105, the operation of calculating the total OR is started without waiting for the output of the multiplication circuit of the mantissa data.

【０００３】[0003]

【発明が解決しようとする課題】しかし、この従来技術
は、次のような問題点があった。すなわち問題点は、ス
ティッキービット生成に要する時間が大きい、というこ
とである。スティッキービット生成は、浮動小数点乗算
器の全遅延時間のうちの１工程を占めているため、ステ
ィッキービット生成に時間がかかると浮動小数点乗算器
全体としての演算速度が低くなってしまう。その理由
は、論理和回路にて用いる制御信号に仮数部データの乗
算回路からの出力を経由する信号を使用していることに
ある。これに対し、特許２６７６４１０には仮数部デー
タの乗算回路からスティッキービットを生成する技術で
はなく、被乗数仮数部データと乗数仮数部データとを入
力して、それぞれの最下位ビットから１が現れるまでの
０の個数をカウントする零計数手段による技術が公開さ
れている。本発明もまた零計数手段により問題点を解決
する浮動小数点乗算器を提供する。However, this prior art has the following problems. That is, the problem is that the time required for sticky bit generation is long. Since sticky bit generation occupies one step of the total delay time of the floating point multiplier, if the sticky bit generation takes time, the operation speed of the floating point multiplier as a whole decreases. The reason is that a signal passing through the output of the multiplication circuit of the mantissa data is used as the control signal used in the OR circuit. On the other hand, Japanese Patent No. 2676410 does not describe a technique of generating sticky bits from a multiplication circuit of mantissa data, but inputs multiplicand mantissa data and multiplier mantissa data and performs processing from the least significant bit until 1 appears. A technique using zero counting means for counting the number of zeros has been disclosed. The present invention also provides a floating point multiplier that solves the problem by means of zero counting.

【０００４】[0004]

【課題を解決するための手段】本発明における浮動小数
点乗算器は、浮動小数点データの仮数部の乗算動作に並
行してスティッキービットを生成することにより、高速
に浮動小数点乗算を行うものである。図１において、
浮動小数点データの仮数部Ｍ０及びＭ１は乗算アレイ１
への入力と同時に零計数手段４−１及び４−２への入力
となる。零計数手段４−１及び４−２にて、仮数部Ｍ０
及びＭ１の最下位ビットから１が現れるまでの０の個数
をカウントし、仮数部Ｍ０及びＭ１の零計数結果を加算
器５にて加算する。比較回路６にて加算器５の加算結果
と定数とを比較し、加算器５の加算結果より定数の方が
大きければスティッキービットとして１を出力し、加算
器５の加算結果より定数の方が小さい、または等しけれ
ばスティッキービットとして０を出力する。これによ
り、乗算アレイ１と仮数部加算器２を経由した結果を用
いずにスティッキービットを生成できるため浮動小数点
乗算の結果を高速に求めることができる。請求項１に記
載の発明は、浮動小数点データの仮数部データの乗算動
作に並行して、仮数部データからスティッキービットを
直に生成することにより、丸め桁合わせ処理を行う浮動
小数点乗算器において、上記のスティッキービット生成
手段は、被乗数仮数部データと乗数仮数部データとを入
力して、それぞれの最下位ビットから１が現れるまでの
０の個数をカウントする２個の零計数手段と、上記２個
の零計数手段の零計数を加算する加算器と、上記加算器
の加算結果と定数とを比較し、加算器の加算結果より定
数の方が大きければスティッキービット＝１を出力し、
加算器の加算結果より定数の方が小さい、または等しけ
ればスティッキービット＝０を出力する比較回路と、を
具備することを特徴としている。請求項２に記載の発明
は、請求項１に記載の浮動小数点乗算器において、上記
被乗数仮数部データの桁数をｍ、上記乗数仮数部データ
の桁数をｍに設定したとき、上記比較回路で比較される
定数をｍ−１とすることを特徴としている。請求項３に
記載の発明は、被乗数仮数部データおよび乗数仮数部デ
ータのそれぞれの最下位ビットから１が現れるまでの０
の個数をカウントする２個の零計数手段と、上記２個の
零計数手段の零計数を加算する加算器と、上記加算器の
加算結果と定数とを比較し、加算器の加算結果より定数
の方が大きければスティッキービット＝１を出力し、加
算器の加算結果より定数の方が小さい、または等しけれ
ばスティッキービット＝０を出力する比較回路と、を具
備するスティッキービット生成手段と、被乗数仮数部デ
ータと乗数仮数部データとを入力して、両者の乗算によ
り部分積を算出し、複数の部分積を加算して２出力の部
分積を出力する乗算アレイと、上記２出力部分積を加算
し、仮数部加算結果を出力する仮数部加算器と、上記仮
数部加算器の出力のうち、切り捨てられる下位ビットの
総論理和を算出する論理和回路と、を具備するスティッ
キービット生成手段の両手段により生成されるスティッ
キービットを比較するチェック回路を有することを特徴
としている。SUMMARY OF THE INVENTION A floating-point multiplier according to the present invention performs high-speed floating-point multiplication by generating sticky bits in parallel with a multiplication operation of a mantissa of floating-point data. In FIG.
The mantissa parts M0 and M1 of the floating-point data are the multiplication array 1
At the same time as the input to the zero counting means 4-1 and 4-2. In the zero counting means 4-1 and 4-2, the mantissa M0
, And the number of zeros from the least significant bit of the M1 to the appearance of a 1 is counted, and the adder 5 adds the zero count results of the mantissa parts M0 and M1. The comparison circuit 6 compares the addition result of the adder 5 with the constant. If the constant is larger than the addition result of the adder 5, 1 is output as a sticky bit. If they are smaller or equal, 0 is output as a sticky bit. As a result, the sticky bit can be generated without using the result passed through the multiplication array 1 and the mantissa adder 2, so that the result of the floating-point multiplication can be obtained at high speed. The floating-point multiplier according to claim 1 performs a rounding digit matching process by directly generating sticky bits from the mantissa data in parallel with the multiplication operation of the mantissa data of the floating-point data, The sticky bit generating means receives the multiplicand mantissa data and the multiplier mantissa data and counts the number of zeros from the least significant bit to the appearance of one. An adder for adding the zero counts of the zero counting means, and comparing the addition result of the adder with a constant, and outputting a sticky bit = 1 when the constant is larger than the addition result of the adder;
A comparison circuit that outputs a sticky bit = 0 if the constant is smaller than or equal to the addition result of the adder. According to a second aspect of the present invention, in the floating-point multiplier according to the first aspect, when the number of digits of the multiplicand mantissa data is set to m and the number of digits of the multiplicand mantissa data is set to m, the comparison circuit Is set to m-1. According to a third aspect of the present invention, there is provided a method in which the least significant bit of each of the multiplicand mantissa data and the multiplier mantissa data is changed from the least significant bit until 1 appears.
Two zero counting means for counting the number of zeros, an adder for adding the zero counts of the two zero counting means, and comparing the addition result of the adder with a constant. A sticky bit generating means having a sticky bit = 1 if the value is larger, a comparing circuit outputting a sticky bit = 0 if the constant is smaller than or equal to the addition result of the adder, and a multiplicand mantissa. A multiplication array that inputs partial data and multiplier mantissa data, calculates a partial product by multiplying the two, adds a plurality of partial products, and outputs a two-output partial product, and adds the two-output partial product A sticky bit generator comprising: a mantissa adder for outputting a result of the mantissa addition; and an OR circuit for calculating a total OR of the lower bits to be truncated among the outputs of the mantissa adder. It is characterized by having a check circuit for comparing the sticky bits generated by both means.

【０００５】[0005]

【発明の実施の形態】図１は本発明の実施例における浮
動小数点乗算器の構成例を示すブロック図である。乗算
アレイ１は、仮数部加算器２と接続され、前処理段階で
切り出された浮動小数点データのｍ（正の整数）ビット
の仮数部Ｍ０及びＭ１を入力として乗算を行い、２個の
部分積Ａ，Ｂを得る。乗算アレイ１の出力である２個の
部分積Ａ，Ｂの和が仮数部の乗算結果となる。仮数部加
算器２は、乗算器アレイ１と丸め桁合わせ回路７に接続
され、乗算アレイ１の出力である２個の部分積Ａ，Ｂを
入力として加算を行い、結果のうち有効桁となる上位ｍ
ビットＣを丸め桁合わせ回路７に出力する。指数部加算
器３は、丸め桁合わせ回路７と接続され、前処理段階で
切り出された浮動小数点データの指数部Ｅ０及びＥ１を
加算し、指数部加算結果Ｄを丸め桁合わせ回路７に出力
する。零計数手段４−１及び４−２は、加算器５と接続
され、仮数部Ｍ０及びＭ１の最下位ビットから１が現れ
るまでの０の個数をそれぞれカウントし、カウント結果
Ｆ及びＧを加算器５へ出力する。加算器５は、零計数手
段４−１及び４−２と比較回路６に接続され、零計数手
段４−１及び４−２の出力Ｆ及びＧを加算し、加算結果
Ｈを比較回路６へ出力する。比較回路６は、加算器５と
丸め桁合わせ回路７に接続され、加算器５の加算結果Ｈ
と、仮数部の有効桁ｍから１を減じた定数（ｍ−１）と
を比較し、結果をスティッキービットＳとして丸め桁合
わせ回路７へ出力する。丸め桁合わせ回路７は、仮数部
加算器２と指数部加算器３と比較回路６に接続され、比
較回路６からの出力であるスティッキービットＳを制
御信号として指数部加算器３の出力Ｄと仮数部加算器２
の出力の上位ｍビットＣから浮動小数点乗算器の乗算結
果Ｉを出力する。FIG. 1 is a block diagram showing a configuration example of a floating-point multiplier according to an embodiment of the present invention. The multiplication array 1 is connected to the mantissa adder 2 and performs multiplication by using the mantissas M0 and M1 of m (positive integer) bits of the floating-point data cut out in the preprocessing stage as inputs, and performs two partial products. A and B are obtained. The sum of the two partial products A and B output from the multiplication array 1 is the result of multiplication of the mantissa. The mantissa adder 2 is connected to the multiplier array 1 and the rounding / digit matching circuit 7 and performs an addition using the two partial products A and B, which are the outputs of the multiplier array 1, as an input. Top m
The bit C is output to the rounding digit matching circuit 7. The exponent part adder 3 is connected to the rounding and digit matching circuit 7, adds the exponent parts E0 and E1 of the floating-point data cut out in the preprocessing stage, and outputs the exponent part addition result D to the rounding and digit matching circuit 7. . The zero counting means 4-1 and 4-2 are connected to the adder 5, count the number of zeros from the least significant bit of the mantissa parts M0 and M1 to the appearance of one, and add the count results F and G to the adder. Output to 5 The adder 5 is connected to the zero counting means 4-1 and 4-2 and the comparison circuit 6, adds the outputs F and G of the zero counting means 4-1 and 4-2, and outputs the addition result H to the comparison circuit 6. Output. The comparison circuit 6 is connected to the adder 5 and the rounding digit matching circuit 7, and the addition result H of the adder 5
And a constant (m-1) obtained by subtracting 1 from the significant digit m of the mantissa, and outputs the result to the rounding digit matching circuit 7 as a sticky bit S. The rounding / digit matching circuit 7 is connected to the mantissa adder 2, the exponent adder 3, and the comparison circuit 6, and uses the sticky bit S output from the comparison circuit 6 as a control signal to output the output D of the exponent adder 3. And mantissa adder 2
Outputs the multiplication result I of the floating-point multiplier from the upper m bits C of the output.

【０００６】図４は、本発明の実施例におけるｍ＝５２
の時の零計数手段４−１及び４−２の詳細な構成図であ
る。零計数手段４は、図５で示される零計数回路２０を
３段と、セレクタ２１及びセレクタ２２により構成され
る。零計数手段４は５２ビットの仮数部データの最下位
ビットをＭ０［００］、最上位ビットをＭ０［５１］と
して最下位ビットから４ビットずつ零計数回路２０へ入
力する。零計数回路２０は４ビットの入力に対し、最下
位ビットからの０の個数を３ビットの２進数として出力
する。出力の際に３ビットのうち最上位ビットのみ反転
する。１段目の零計数回路２０の出力のうち、最上位ビ
ットは次段の零計数回路２０への入力となり、下位２ビ
ットはセレクタ２１への入力となる。セレクタ２１は零
計数回路２０とセレクタ２２に接続され、零計数回路２
０の出力の下位２ビット４組を入力として、２段目の零
計数回路２０の出力の下位２ビットを制御信号として４
組のうちの１つを選択してセレクタ２２へ出力する。セ
レクタ２２はセレクタ２１と零計数回路２０に接続さ
れ、２段目の零計数回路２０の出力の下位２ビットとセ
レクタ２１の出力２ビットを合わせた４ビット４組を入
力として、３段目の零計数回路２０の出力の下位２ビッ
トを制御信号として４組のうちの１つを選択して出力す
る。３段目の零計数回路２０の出力の下位２ビットと、
セレクタ２２の出力４ビットを合わせた６ビットの結果
が、零計数手段４−１の出力Ｆとなる。かくして出力さ
れた零計数手段４−１の出力Ｆと、同様に出力された零
計数手段４−２の出力Ｇとを加算器５に入力して得られ
た出力Ｈと、定数ｍ−１とを比較してスティッキービッ
トＳが得られる。図６には出力Ｈに対応するスティッキ
ービットＳの値を示す（比較回路６によるスティッキー
ビットの決定は下記乗算器の動作で説明する）。FIG. 4 shows an embodiment of the present invention where m = 52.
FIG. 4 is a detailed configuration diagram of the zero counting means 4-1 and 4-2 at the time of FIG. The zero counting means 4 includes three stages of the zero counting circuit 20 shown in FIG. The zero counting means 4 inputs the least significant bit of the 52-bit mantissa data to M0 [00] and the most significant bit to M0 [51] to the zero counting circuit 20 four bits at a time from the least significant bit. The zero counting circuit 20 outputs the number of 0s from the least significant bit to a 4-bit input as a 3-bit binary number. At the time of output, only the most significant bit of the three bits is inverted. Of the output of the first stage zero counting circuit 20, the most significant bit becomes an input to the next stage zero counting circuit 20, and the lower two bits become an input to the selector 21. The selector 21 is connected to the zero counting circuit 20 and the selector 22, and the zero counting circuit 2
The four low-order 2 bits of the output of 0 are input, and the low-order 2 bits of the output of the second stage zero counting circuit 20 are used as a control signal.
One of the sets is selected and output to the selector 22. The selector 22 is connected to the selector 21 and the zero-counting circuit 20, and receives as input a 4-bit, 4-bit combination of the lower two bits of the output of the second-stage zero-counting circuit 20 and the output 2 bits of the selector 21, and the third-stage One of four sets is selected and output using the lower 2 bits of the output of the zero counting circuit 20 as a control signal. The lower 2 bits of the output of the third stage zero counting circuit 20;
The 6-bit result obtained by adding the 4 bits of the output of the selector 22 becomes the output F of the zero counting means 4-1. An output H obtained by inputting the output F of the zero counting means 4-1 thus output and the output G of the zero counting means 4-2 similarly output to the adder 5, a constant m-1 and To obtain a sticky bit S. FIG. 6 shows the value of the sticky bit S corresponding to the output H (the determination of the sticky bit by the comparison circuit 6 will be described in the following operation of the multiplier).

【０００７】図７は、本発明の実施例におけるｍ＝５２
の時の比較回路６の詳細な回路図である。ｍ＝５２の
時、比較回路６は加算器５の７ビットの出力Ｈ［６：
０］を入力として、定数（ｍ−１＝５１）との比較を行
い、定数（ｍ−１）がＨより大きい場合に１を出力す
る。FIG. 7 shows an embodiment of the present invention in which m = 52.
9 is a detailed circuit diagram of the comparison circuit 6 at the time of FIG. When m = 52, the comparison circuit 6 outputs the 7-bit output H [6:
[0] is input, a comparison is made with a constant (m-1 = 51), and when the constant (m-1) is larger than H, 1 is output.

【０００８】次に図１の乗算器の動作について、図を参
照して説明する。浮動小数点データは、１ビットの符号
ビット、ｎ（正の整数）ビットの指数部Ｅ、ｍ（正の整
数）ビットの仮数部Ｍで構成され、前処理回路で切り出
される。浮動小数点データの乗算は、指数部の加算と仮
数部の乗算を行った後に、丸め及び桁合わせを行うこと
により結果を得ることができる。まず、前処理段階で切
り出された浮動小数点データの指数部Ｅ０及びＥ１を、
指数部加算器３により加算し、得られた指数部加算結果
Ｄを丸め桁合わせ回路７に出力する。前処理段階で切り
出された浮動小数点データのｍビットの仮数部Ｍ０及び
Ｍ１は、乗算アレイ１及び零計数手段４−１及び４−２
に入力される。乗算アレイ１は図８を参照すると、入力
された仮数部Ｍ０を被乗数、Ｍ１を乗数として、乗数の
各ビットに被乗数を乗じたもの（部分積と呼ぶ）を２進
数の筆算の形に並べ、これを加算することによって積を
求める。各部分積の加算には、図９に示すような全加算
器で構成される加算回路を用いることにより、ｍ個の部
分積を２個になるまで加算し、最終的に得られた２つの
部分積Ａ及びＢを仮数部加算器２に出力する。仮数部加
算器２は乗算アレイ１の２出力Ａ及びＢを加算し、ｍビ
ットの仮数部Ｍ１とＭ２の乗算結果として（２ｍ−１）
ビットの積を得る。この積のうち、仮数部有効桁である
上位ｍビットＣを丸め桁合わせ回路７へ出力する。な
お、切り捨てられる下位（ｍ−１）ビットの総論理和
を、スティッキービットとして丸めに用いるのが一般的
である。ここで切り捨てられる下位（ｍ−１）ビットが
全て０であればスティッキービットは０である。図８を
参照すると、仮数部Ｍ０とＭ１の積について下位ビット
から数えて１が現れるまでの０の個数は、仮数部Ｍ０と
Ｍ１それぞれの下位ビットから数えて１が現れるまでの
０の個数Ｆ及びＧの和Ｈに等しいことがわかる。そこで
仮数部Ｍ０及びＭ１の下位ビットから数えて１が現れる
までの０の個数Ｆ及びＧの和Ｈを求め、この値と切り捨
てられるビット数（ｍ−１）とを比較し、仮数部Ｍ０及
びＭ１の下位ビットから数えて１が現れるまでの０の個
数Ｆ及びＧの和Ｈの方が切り捨てられるビット数（ｍ−
１）より大きい、または等しい場合には、切り捨てられ
るビット中に１は存在しないため、スティッキービット
は０となり、切り捨てられるビット数（ｍ−１）の方が
大きければ、切り捨てられるビット中に１が存在するこ
とになり、スティッキービットは１となる。図１を参照
すると、零計数手段４−１及び４−２はそれぞれ仮数部
Ｍ０、Ｍ１の最下位ビットから数えて１が現れるまでの
０の個数をカウントし、カウント結果Ｆ及びＧをそれぞ
れ加算器５に入力する。図４に示されるｍ＝５２の時の
零計数手段４の構成図を参照すると、零計数手段４の内
部にて仮数部Ｍ０は最下位ビットをＭ０［００］とし
て、最下位ビットから４ビットずつ零計数回路２０へ入
力される。１段目の零計数回路２０にてそれぞれ４ビッ
トのうちの下位ビットからの０の個数をカウントし、３
ビットの２進数として出力（但し最上位ビットは反転し
て出力）する。最上位ビットは次段の零計数回路２０へ
の入力となり、下位２ビットはセレクタ２１への入力と
なる。２段目の零計数回路２０は、１段目の零計数回路
２０の出力の最上位ビットを入力として０の個数をカウ
ントし、３ビットの２進数として出力する。セレクタ２
１は１段目の零計数回路２０の出力の下位２ビット４組
を入力とし、２段目の零計数回路２０の出力の下位２ビ
ットを制御信号として４組のうちの１つを選択し、セレ
クタ２２へ出力する。３段目の零計数回路２０は、２段
目の零計数回路２０の出力の最上位ビットを入力として
０の個数をカウントし、３ビットの２進数として出力す
る。３ビットの出力のうち、最上位ビットは使用せず
（ｍが６４以下のため）、下位２ビットＦ［５］Ｆ
［４］がそれぞれ１０進数で３２、１６を表す仮数部Ｍ
０のカウント値となる。セレクタ２２は、２段目の零計
数回路２０の出力の下位２ビットとセレクタ２１の出力
２ビットを合わせた４ビット４組を入力とし、３段目の
零計数回路２０の出力の下位２ビットを制御信号として
４組のうちの１つを選択し出力する。セレクタ２２の４
ビット出力Ｆ［３］〜Ｆ［０］が、それぞれ１０進数で
８、４、２、１を表す仮数部Ｍ０のカウント値となる。
Ｆ［５］〜Ｆ［０］の６ビット出力が、仮数部５２ビッ
トの最下位ビットからの０のカウント値となる。なお、
図４を参照すると、零計数手段４が要する論理段数は最
大７段となる。加算器５はカウント結果Ｆ及びＧを加算
し、加算結果Ｈを比較回路６へ送出する。比較回路６は
加算結果Ｈと定数（ｍ−１）とを比較し、加算結果Ｈの
方が定数（ｍ−１）よりも大きい、または等しい場合に
スティッキービットＳとして０を出力し、定数（ｍ−
１）の方が大きければＳ＝１を出力する。ｍ＝５２の
時、加算結果Ｈは２進数７ビットで表され、この時の
（ｍ−１＝５１）との比較結果であるＳの真理値表は図
６の様に表される。図６をもとに比較回路６を回路図に
表したものが図７であり、最大５段の論理段数で実現し
ている。丸め桁合わせ回路７は、指数部加算器３からの
出力である指数部加算結果Ｄと仮数部加算器２からの出
力である仮数部加算結果Ｃと比較回路６からの出力であ
るスティッキービットＳを用いて、スティッキービット
Ｓを制御信号として指数部加算結果Ｄと仮数部加算結果
Ｃより乗算結果の出力Ｉを出力する。Next, the operation of the multiplier of FIG. 1 will be described with reference to the drawings. The floating-point data is composed of a sign bit of 1 bit, an exponent E of n (positive integer) bits, and a mantissa M of m (positive integer) bits, and is cut out by a preprocessing circuit. In the multiplication of floating-point data, a result can be obtained by performing addition of an exponent part and multiplication of a mantissa part, and then performing rounding and digit alignment. First, the exponent parts E0 and E1 of the floating-point data cut out in the preprocessing stage are
The result is added by the exponent part adder 3, and the obtained exponent part addition result D is output to the rounding and digit matching circuit 7. The m-bit mantissa parts M0 and M1 of the floating-point data cut out in the preprocessing stage are used as the multiplication array 1 and the zero counting means 4-1 and 4-2.
Is input to With reference to FIG. 8, the multiplication array 1 arranges the input mantissa M0 as a multiplicand and M1 as a multiplier, and multiplies each bit of the multiplier by a multiplicand (called a partial product) in a binary arithmetic form. The product is obtained by adding these. The addition of each partial product is performed by using an adder circuit composed of a full adder as shown in FIG. 9 so that m partial products are added up to two, and two finally obtained products are obtained. The partial products A and B are output to the mantissa adder 2. The mantissa adder 2 adds the two outputs A and B of the multiplication array 1 and obtains (2m-1) as a multiplication result of the m-bit mantissas M1 and M2.
Get the product of bits. Among these products, the higher-order m bits C, which are significant digits of the mantissa, are output to the rounding / digit matching circuit 7. Generally, the total OR of the lower (m-1) bits to be truncated is used as a sticky bit for rounding. If all the lower (m-1) bits to be discarded are 0, the sticky bit is 0. Referring to FIG. 8, the number of zeros until the 1 appears from the lower bits of the product of the mantissa parts M0 and M1 is the number F of the zeros until the 1 appears from the lower bits of each of the mantissas M0 and M1. And G are equal to the sum H. Therefore, the sum H of the numbers F and G of 0s until the 1 appears from the lower bits of the mantissa parts M0 and M1 is calculated, and this value is compared with the number of bits to be truncated (m-1). The number of bits (m−m−m) that is the sum of the number H of 0 and the sum H of G until a 1 appears from the lower bit of M1
1) If greater or equal, there is no 1 in the bits to be truncated, so the sticky bit is 0, and if the number of bits to be truncated (m-1) is larger, then 1 in the bits to be truncated. It will be present and the sticky bit will be 1. Referring to FIG. 1, the zero counting means 4-1 and 4-2 count the number of zeros from the least significant bit of the mantissa parts M0 and M1 until 1 appears, and add the count results F and G, respectively. Input to the container 5. Referring to the configuration diagram of the zero counting means 4 when m = 52 shown in FIG. 4, inside the zero counting means 4, the mantissa part M0 sets the least significant bit to M0 [00] and sets 4 bits from the least significant bit. Are input to the zero counting circuit 20. The first-stage zero counting circuit 20 counts the number of zeros from the lower bits of the four bits, and
Output as a binary number of bits (however, the most significant bit is inverted and output). The most significant bit becomes an input to the next stage zero counting circuit 20, and the lower two bits become an input to the selector 21. The second-stage zero counting circuit 20 receives the most significant bit of the output of the first-stage zero counting circuit 20 as an input, counts the number of zeros, and outputs it as a 3-bit binary number. Selector 2
Reference numeral 1 designates one of the four sets of the lower two bits of the output of the first stage zero counting circuit 20 as a control signal and the lower two bits of the output of the second stage zero counting circuit 20 as a control signal. , To the selector 22. The third-stage zero counting circuit 20 counts the number of zeros by using the most significant bit of the output of the second-stage zero counting circuit 20 as an input, and outputs it as a 3-bit binary number. Of the 3-bit output, the most significant bit is not used (m is 64 or less), and the lower 2 bits F [5] F
[4] is a mantissa M representing 32 and 16 in decimal respectively
The count value becomes 0. The selector 22 receives four 4-bit combinations of the lower two bits of the output of the second stage zero counting circuit 20 and the two bits of the output of the selector 21, and receives the lower two bits of the output of the third stage zero counting circuit 20. Is used as a control signal to select and output one of the four sets. 4 of selector 22
The bit outputs F [3] to F [0] are the count values of the mantissa M0 representing 8, 4, 2, and 1 in decimal, respectively.
The 6-bit output of F [5] to F [0] is the count value of 0 from the least significant bit of the mantissa part 52 bits. In addition,
Referring to FIG. 4, the number of logical stages required by the zero counting means 4 is a maximum of seven stages. The adder 5 adds the count results F and G, and sends the addition result H to the comparison circuit 6. The comparison circuit 6 compares the addition result H with a constant (m-1), and outputs 0 as the sticky bit S when the addition result H is larger than or equal to the constant (m-1), and outputs a constant ( m-
If 1) is larger, S = 1 is output. When m = 52, the addition result H is represented by 7-bit binary numbers, and the truth table of S, which is the result of comparison with (m-1 = 51), is shown in FIG. FIG. 7 is a circuit diagram of the comparison circuit 6 based on FIG. 6, which is realized with a maximum of five logic stages. The rounding / digit matching circuit 7 outputs an exponent part addition result D output from the exponent part adder 3, a mantissa addition result C output from the mantissa part adder 2, and a sticky bit S output from the comparison circuit 6. And the output I of the multiplication result is output from the exponent addition result D and the mantissa addition result C using the sticky bit S as a control signal.

【０００９】以上の様に仮数部を入力とする零計数手段
４と、零計数手段４の出力を入力とする加算器５と、加
算器５の出力を入力として定数と比較を行うことにより
スティッキービットを生成する比較回路６を設けたこと
により、仮数部の有効桁ｍ＝５２ビット時のスティッキ
ービット生成に要する工程は零計数手段４（論理段数７
段）＋加算器５（６ビット加算器）＋比較回路６（論理
段数５段）となり、これは、乗算アレイ１（論理積１段
＋全加算器２＊５段）＋仮数部加算器２（１０３ビット
加算器）＋論理和回路８（論理段数４段）による構成よ
りもスティッキービット生成を高速に実現することがで
きる。As described above, the zero counting means 4 having the input of the mantissa part, the adder 5 having the input of the output of the zero counting means 4 as an input, and the output of the adder 5 being an input being compared with a constant to be sticky. By providing the comparison circuit 6 for generating the bits, the step required for generating the sticky bit when the significant digit m of the mantissa is 52 bits is zero counting means 4 (the number of logic stages is 7).
Stage) + adder 5 (6-bit adder) + comparator circuit 6 (5 logical stages), which is multiplication array 1 (1 logical product + full adder 2 * 5 stages) + mantissa adder 2 Sticky bit generation can be realized at higher speed than in the configuration of (103-bit adder) + OR circuit 8 (4 logical stages).

【００１０】本発明の他の実施例について図面を参照し
て詳細に説明する。図１０を参照すると、仮数部加算器
２の出力Ｊを入力とする論理和回路８と、論理和回路８
の出力Ｓ’と比較回路６の出力Ｓを入力とするチェック
回路９が設けられている。仮数部加算器２の加算結果の
切り捨てられる下位（ｍ−１）ビットＪを論理和回路８
へ出力し、論理和回路８にてＪの総論理和Ｓ’を得る。
Ｓ’は従来の方式にて求められるスティッキービットで
あるため、このＳ’と比較回路６の出力であるスティッ
キービットＳとをチェック回路９にて比較することによ
り、仮数部より求めたスティッキービットＳが、乗算ア
レイ及び仮数部加算器を通過した仮数部乗算結果から求
めたスティッキービットＳ’と合致していることを確認
することができる。この実施例は、乗算アレイや仮数部
加算器等のハードウェア量の大きな回路を二重化せず
に、少ないハードウェア量でスティッキービットをチェ
ックできるという新たな効果を有する。Another embodiment of the present invention will be described in detail with reference to the drawings. Referring to FIG. 10, a logical sum circuit 8 having an input of an output J of the mantissa adder 2 and a logical sum circuit 8
And a check circuit 9 which receives the output S ′ of the comparator circuit 6 and the output S of the comparison circuit 6 as inputs. The lower (m-1) bits J of the addition result of the mantissa adder 2 which are truncated are added to the OR circuit 8
And the OR circuit 8 obtains the total OR S 'of J.
Since S 'is a sticky bit obtained by the conventional method, the check circuit 9 compares this S' with the sticky bit S output from the comparison circuit 6 to obtain the sticky bit S obtained from the mantissa. Is consistent with the sticky bit S ′ obtained from the result of the mantissa multiplication that has passed through the multiplication array and the mantissa adder. This embodiment has a new effect that a sticky bit can be checked with a small amount of hardware without duplicating circuits having a large amount of hardware such as a multiplication array and a mantissa adder.

【００１１】[0011]

【発明の効果】以上説明したように、本発明によれば次
のような効果が期待できる。すなわち最大の効果は、浮
動小数点乗算器全体としての演算速度を高速化できると
いうことである。その理由は、乗算アレイと、乗算アレ
イの２つの出力を入力とする仮数部加算器とは別に、仮
数部を入力とする零計数手段と、零計数手段の出力を入
力とする加算器と、加算器の出力を入力として定数と比
較を行うことによりスティッキービットを生成する比較
回路を設けたことにより、切り捨てられた仮数部乗算結
果の下位ビットの総論理和を求めなくともスティッキー
ビットを得られるからである。As described above, according to the present invention, the following effects can be expected. That is, the greatest effect is that the operation speed of the whole floating-point multiplier can be increased. The reason is that, apart from a multiplication array and a mantissa adder which receives two outputs of the multiplication array as inputs, a zero counting means having a mantissa as an input, an adder having an output of the zero counting means as an input, By providing a comparison circuit that generates a sticky bit by comparing an output of the adder with a constant and generating a sticky bit, a sticky bit can be obtained without obtaining the total OR of the lower bits of the truncated mantissa multiplication result Because.

[Brief description of the drawings]

【図１】本発明の浮動小数点乗算器の構成を示すブロ
ック図である。FIG. 1 is a block diagram illustrating a configuration of a floating-point multiplier according to the present invention.

【図２】従来の浮動小数点乗算器の１実施例の構成を
示すブロック図である。FIG. 2 is a block diagram showing a configuration of one embodiment of a conventional floating point multiplier.

【図３】従来のスティッキービット生成手段の１実施
例の構成を示すブロック図である。FIG. 3 is a block diagram showing a configuration of one embodiment of a conventional sticky bit generation unit.

【図４】本発明の浮動小数点乗算器の零計数手段の構
成を示すブロック図である（浮動小数点データの仮数部
ビット桁数は５２）。FIG. 4 is a block diagram showing a configuration of zero counting means of the floating-point multiplier of the present invention (the number of bits of the mantissa part of the floating-point data is 52).

【図５】本発明の零計数手段を構成する零計数回路の
回路図である。FIG. 5 is a circuit diagram of a zero counting circuit constituting the zero counting means of the present invention.

【図６】スティッキービットＳの真理値表である。FIG. 6 is a truth table of the sticky bit S;

【図７】スティッキービットＳの真理値表に基づく比
較回路の回路図である。FIG. 7 is a circuit diagram of a comparison circuit based on a truth table of sticky bits S;

【図８】５桁の被乗数と５桁の乗数の乗算を筆算形式
に並べた図である。FIG. 8 is a diagram in which multiplications of a 5-digit multiplicand and a 5-digit multiplier are arranged in a handwriting format.

【図９】乗算アレイ用として、全加算器で構成される
加算回路のブロック図である。FIG. 9 is a block diagram of an adder circuit including a full adder for a multiplication array.

【図１０】本発明の浮動小数点乗算器の他の実施例の
構成を示すブロック図である。FIG. 10 is a block diagram showing a configuration of another embodiment of the floating-point multiplier of the present invention.

[Explanation of symbols]

１…乗算アレイ２…仮数
部加算器３…指数部加算器４…零計
数手段５…加算器６…比較
回路７…丸め桁合わせ回路８…論理
和回路１００…乗算アレイ１０１…
加算器１０２…シフタ１０３…
零計数手段１０４…演算手段１０５…
論理和回路２０…零計数回路２１…セ
レクタ２２…セレクタ９…チェ
ック回路REFERENCE SIGNS LIST 1 multiplication array 2 mantissa adder 3 exponent adder 4 zero counting means 5 adder 6 comparison circuit 7 rounding digit matching circuit 8 OR circuit 100 multiplication array 101
Adder 102 ... Shifter 103 ...
Zero counting means 104 ... Calculation means 105 ...
OR circuit 20 ... Zero counting circuit 21 ... Selector 22 ... Selector 9 ... Check circuit

Claims

[Claims]

1. A floating-point multiplier for performing rounding and digit matching processing by directly generating sticky bits from mantissa data in parallel with an operation of multiplying mantissa data of floating-point data. The generating means inputs the multiplicand mantissa data and the multiplier mantissa data,
Two zero counting means for counting the number of zeros from each least significant bit until a 1 appears, an adder for adding the zero counts of the two zero counting means, an addition result of the adder and a constant And if the constant is larger than the addition result of the adder, the sticky bit = 1
And a comparing circuit that outputs a sticky bit = 0 if the constant is smaller than or equal to the addition result of the adder.

2. The method according to claim 1, wherein when the number of digits of the multiplicand mantissa data is set to m and the number of digits of the multiplicand mantissa data is set to m, a constant compared by the comparison circuit is m-1. The floating point multiplier according to claim 1.

3. From the least significant bit of each of the multiplicand mantissa data and the multiplier mantissa data, 0 is used until 1 appears.
Two zero counting means for counting the number of zeros; an adder for adding the zero counts of the two zero counting means; comparing the addition result of the adder with a constant; Is larger, sticky bit = 1
And a comparison circuit that outputs a sticky bit = 0 if the constant is smaller than or equal to the addition result of the adder; and a sticky bit generation means comprising: and multiplicative mantissa data and multiplier mantissa data. Enter
A multiplication array that calculates a partial product by multiplication of the two, adds a plurality of partial products and outputs a two-output partial product, and a mantissa adder that adds the two output partial products and outputs a mantissa addition result And an OR circuit for calculating the total OR of the lower bits to be truncated among the outputs of the mantissa adder, and a check circuit for comparing the sticky bits generated by both of the sticky bit generation means, comprising: A floating point multiplier comprising: