Publication: Fast multiplier generator for FPGAs with LUT based partial product generation and column/row compression
dc.contributor.author | Kakacak, Ahmet | |
dc.contributor.author | Guzel, Aydın Emre | |
dc.contributor.author | Cihangir, Ozan | |
dc.contributor.author | Gören, S. | |
dc.contributor.author | Uğurdağ, Hasan Fatih | |
dc.contributor.department | Electrical & Electronics Engineering | |
dc.contributor.ozuauthor | UĞURDAĞ, Hasan Fatih | |
dc.contributor.ozugradstudent | Kakacak, Ahmet | |
dc.contributor.ozugradstudent | Guzel, Aydın Emre | |
dc.contributor.ozugradstudent | Cihangir, Ozan | |
dc.date.accessioned | 2017-02-02T11:17:53Z | |
dc.date.available | 2017-02-02T11:17:53Z | |
dc.date.issued | 2017 | |
dc.description.abstract | We present a new parallel integer multiplier generator for FPGAs. It combines (i) a new Generalized Parallel Counter (GPC) grouping algorithm for column compression with (ii) a LUT based partial product generation, is (iii) unique as it automatically generates placement pragmas, (iv) uses a ternary adder as a final adder to exploit FPGA's internal carry-chains, and (v) employs a novel GPC based row compression, which aims to reduce the width of the final adder. We wrote Verilog generators for our method as well as one leading work in the literature. For synthesis, we wrote a script that can do “binary search” for the optimum latency. Our extensive implementation results on Xilinx Virtex-6 FPGAs show that we almost always produce circuits with smaller latency (i.e., timing) and Area-Timing Product (ATP) compared to the state-of-the-art in the literature, by 18% and 12% (on the average), respectively. We also offer smaller latency compared to the HDL * operator by 9% on the average at a cost of 12% larger ATP on the average. We are worse in latency in 6 cases out of 33, in all of which synthesis maps * to DSP slices. We also include area and energy results on Virtex-6 as well as a limited amount of latency, area, and ATP results on Virtex-5 and Altera Stratix III. | en_US |
dc.identifier.doi | 10.1016/j.vlsi.2016.12.012 | en_US |
dc.identifier.endpage | 157 | en_US |
dc.identifier.issn | 0167-9260 | |
dc.identifier.scopus | 2-s2.0-85008165502 | |
dc.identifier.startpage | 147 | en_US |
dc.identifier.uri | http://hdl.handle.net/10679/4764 | |
dc.identifier.uri | https://doi.org/10.1016/j.vlsi.2016.12.012 | |
dc.identifier.volume | 57 | en_US |
dc.identifier.wos | 000395609000015 | |
dc.language.iso | eng | en_US |
dc.peerreviewed | yes | en_US |
dc.publicationstatus | published | en_US |
dc.publisher | Elsevier | en_US |
dc.relation.ispartof | Integration, the VLSI Journal | en_US |
dc.rights | restrictedAccess | |
dc.subject.keywords | Fast multipliers | en_US |
dc.subject.keywords | FPGA | en_US |
dc.subject.keywords | Look-up table | en_US |
dc.subject.keywords | Partial product generation | en_US |
dc.subject.keywords | Column compression tree | en_US |
dc.subject.keywords | Carry-save tree | en_US |
dc.subject.keywords | Generalized parallel counter | en_US |
dc.title | Fast multiplier generator for FPGAs with LUT based partial product generation and column/row compression | en_US |
dc.type | article | en_US |
dspace.entity.type | Publication | |
relation.isOrgUnitOfPublication | 7b58c5c4-dccc-40a3-aaf2-9b209113b763 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 7b58c5c4-dccc-40a3-aaf2-9b209113b763 |
Files
License bundle
1 - 1 of 1
- Name:
- license.txt
- Size:
- 1.45 KB
- Format:
- Item-specific license agreed upon to submission
- Description: