Abstract
Background: With the growing demand of image processing and the use of Digital Signal
Processors (DSP), the efficiency of the Multipliers and Accumulators has become a bottleneck to get
through. We revised a few patents on an Application Specific Instruction Set Processor (ASIP),
where the design considerations are proposed for application-specific computing in an efficient way
to enhance the throughput.
Objective: The study aims to develop and analyze a computationally efficient method to optimize the
speed performance of MAC.
Methods: The work presented here proposes the design of an Application Specific Instruction Set
Processor, exploiting a Multiplier Accumulator integrated as the dedicated hardware. This MAC is
optimized for high-speed performance and is the application-specific part of the processor; here it
can be the DSP block of an image processor while a 16-bit Reduced Instruction Set Computer
(RISC) processor core gives the flexibility to the design for any computing. The design was emulated
on a Xilinx Field Programmable Gate Array (FPGA) and tested for various real-time computing.
Results: The synthesis of the hardware logic on FPGA tools gave the operating frequencies of the
legacy methods and the proposed method, the simulation of the logic verified the functionality.
Conclusion: With the proposed method, a significant improvement of 16% increase in throughput
has been observed for 256 steps iterations of multiplier and accumulators on an 8-bit sample data.
Such an improvement can help in reducing the computation time in many digital signal processing
applications where multiplication and addition are done iteratively.
Keywords:
ASIP, MAC, DSP, image processing, multiplier, wallace tree, modified booth, FPGA.
Graphical Abstract
[1]
C. Vinoth, V.S.K. Bhaaskaran, and B. Brindha, "A novel low power and high speed Wallace tree multiplier for RISC processor", 3rd International Conference on Electronics Computer Technology Kanyakumari, India 2011, pp. 330-34.
[2]
G. Kappen, and T.G. Noll, "Application specific instruction processor based implementation of a GNSS receiver on an FPGA", Proceedings of the Design Automation & Test in Europe Conference, Munich, Germany, 2006, pp. 1-6.
[3]
S.Y. Kung, VLSI array processors., Prentice Hall Inc.: Upper Saddle River, NJ, 1998.
[4]
S. Wallace, "A suggestion for a fast multiplier", IEEE Trans. Electron. Comput., vol. EC-13, pp. 14-17, 1964.
[5]
Lakshmanan M, "Othman and M. A. M. Ali, “High performance parallel multiplier using Wallace-booth algorithm”, Proceedings on", IEEE International Conference on Semiconductor Electronics, ICSE, 2002pp. 433-436
[6]
X. Zeng, Microcontroller comprising a plurality of registers and instruction modes. U.S. Patent 8443168B2, 2013.
[7]
F. Elguibaly, "A fast parallel multiplier-accumulator using the modified Booth algorithm", IEEE Trans. Circuits Syst. II: Analog Digital Signal Proc., vol. 47, pp. 902-908, 2000.
[8]
P.K. Srimani, "Generalized proof of modified Booth’s algorithm", Comput. Electr. Eng., vol. 8, pp. 7-9, 1981.
[9]
J.F. Ardekani, "M*N booth encoded multiplier generator using optimized Wallace trees", IEEE Trans. Very Large Scale Integ. Syst. (VLSI), vol. 1, pp. 120-125, 1993.
[10]
A.D. Booth, "A Signed Binary Multiplication Technique", Q. J. of Mech. Appl. Math., vol. 4, pp. 236-240, 1951.
[11]
J.Y. Kang, and J.L. Gaudiot, "A fast and well-structured multiplier", Euromicro Symp. Dig. Syst. Des., vol. 2004, pp. 508-515, 2004.
[12]
J.Y. Kang, W.H. Lee, and T.D. Han, "A design of a multiplier module generator using 4-2 compressor", Proc. Korea Inst. Tele. Elect. (KITE) Fall Conf., vol. 16, pp. 388-392 1993
[13]
O.J. Bedrij, Carry-select adders. IRE Trans. Elect. Comp., vol. EC-II, pp. 340-46, 1962.
[14]
A. Augusteijn, and J.A.J. Leijten, Application profile based ASIP design. WO Patent 2009/058017, 2009.