|
|
Floating Point Coprocessor - Double PrecisionThe DFPMU-DP is a Floating Point Coprocessor, designed to assist CPU in performing the floating point mathematic computations. The DFPMU-DP directly replaces C software functions, by equivalent, very fast hardware operations, which significantly accelerate system performance. It does not require any programming, so it also does not require any modifications made in the main software. Everything is done automatically during software compilation by the DFPMU-DP C driver. The DFPMU-DP was designed to operate with DCD's DP8051/DP80390 microcontrollers, but can also operate with any other 8-, 16- and 32-bit processor. Drivers for all popular 8051 C compilers are delivered together with the DFPMU-DP package. The DFPMU-DP supports also a popular 32 Bit procesors: NIOS II and MicroBlaze, and the C drivers for those processors are delivered with the DFPMU-DP for free. The DFPMU-DP uses the specialized CORDIC and standard algorithms to compute math functions. It supports addition, subtraction, multiplication, division, square root, comparison, and trigonometric functions: sine, cosine, tangent and arctangent. It has built-in conversion instructions from integer type to floating point type and vice versa. The input numbers format is according to IEEE-754 standard. The DFPMU-DP supports double and single precision real numbers, 8-bit, 16-bit and 32-bit integers. DFPMU-DP is prepared to use with 8-, 16- and 32-bit processors. The DFPMU-DP is a technology independent design that can be implemented in a variety of process technologies.
The table and figures below illustrates the system with DFPMU-DP performance improvements for typical 32-bit RISCCPU. The DFPMU-DP floating point instructions performance has been compared to standard C library functions delivered with every commercial C compiler. Each program was executed in the same system environments. Number of clock periods were measured between input data loading into work registers and output result storing after operation. The results are placed in tables below. Improvement has been computed as a number of clock cycles reuired by the CPU to compute FP operation, by the number of clocks required to compute the same operation by system of CPU with DFPMU-DP:
datai1 (31:0)
![]() datao1 (31:0)
![]() irq
![]() addr2 (4:0)
![]() we
![]() cs
![]()
clk
![]() rst
![]() InterfaceMakes interface between external device and core internal 32-bit modules. It contains data, control and status registers. It can be configured to work with 8-, 16- and 32-bit processors..1 - data bus can be configured as 8-, 16- or 32- bit depends on processor’s bus size 2 - address bus is aligned to work with 8- (3:0), 16- (3:1) or 32- (4:2) bit processors Control UnitIt manages execution of all instructions and internal operation required to execute particular function.AlignIt performs the numbers analyze against IEEE-754 standard compliance. Information about the data classes are passed as result to appro-priate internal module.ExponentIt performs operations on exponent part of number. The addition, subtraction, shifting, comparison and conversion operations are executed in this module. It contains exponents and work registers.MantissaIt performs operations on mantissa part of number. The addition, subtraction, multiplication, division, square root, comparison and conversion operations are executed in this module. It contains mantissas and work registers.ShifterIt performs mantissa shifting during normalization, denormalization operations. Information about shifted-out bits are stored for rounding process.CORDICCORDIC performs trigonometric operations on input data. The sine, cosine, tangent and arctangent operations are executed in this module. It contains three work registers.Each core has been tested in variety of FPGA and ASIC technologies. Its implementation's results are summarized below.
DFPMU-DP implementation results for ALTERA devices. The all features have been included.
DFPMU-DP implementation results for XILINX devices. The all features have been included.
The main features of each Arithmetic Coprocessors family member has been summarized in table above. It gives a briefly member characterization helping user to select the most suitable IP Core for its application. |
Home
Site map
Contact Us




clk
datai1 (31:0)
addr2 (4:0)

