Fast parallel prefix logic circuits for n2n round-robin arbitration

Uğurdağ, Hasan Fatih; Baskirt, O.

Publication:
Fast parallel prefix logic circuits for n2n round-robin arbitration

dc.contributor.author	Uğurdağ, Hasan Fatih
dc.contributor.author	Baskirt, O.
dc.contributor.department	Electrical & Electronics Engineering
dc.contributor.ozuauthor	UĞURDAĞ, Hasan Fatih
dc.date.accessioned	2014-07-11T08:09:28Z
dc.date.available	2014-07-11T08:09:28Z
dc.date.issued	2012-08
dc.description	Due to copyright restrictions, the access to the full text of this article is only available via subscription.
dc.description.abstract	An n2n round-robin arbiter (RRA) searches its n inputs for a 1, starting from the highest-priority input. It picks the first 1 and outputs its index in one-hot encoding. RRA aims to be fair to its inputs and maintains fairness by simply rotating the input priorities, i.e., the last arbitrated input becomes the lowest-priority input. Arbiters are used to multiplex the usage of shared resources among requestors as well as in dispatch logic where the purpose is load balancing among multiple resources. Today, arbiters have hundreds of ports and usually need to run at very high clock speeds. This article presents a new gate-level RRA circuit called Thermo Coded-Parallel Prefix Arbiter (TC-PPA) that scales to any number of requestors. It uses parallel prefix network topologies (borrowed from fast carry lookahead adders) to generate a thermometer-coded pointer, thus greatly reducing critical path. Code generators were written not only for TC-PPA but also for the 5 highly competitive circuits in the literature (9 including their variants), and a rich set of timing/area results were obtained using a standard-cell based logic synthesis flow with a novel iterative strategy based on binary search. Synthesis runs include results with wire-load and without. Results show that for 54 or more ports (except 256) TC-PPA offers the best timing (lowest latency) as well as competitive area. Contributions also include transaction-level simulations that show when pipelining is used to boost clock rate, latency and input FIFO sizes are adversely affected, and hence pipelining cannot be indiscriminately exploited to trim clock period.
dc.identifier.doi	10.1016/j.mejo.2012.04.005
dc.identifier.endpage	581
dc.identifier.issn	0026-2692
dc.identifier.issue	8
dc.identifier.scopus	2-s2.0-84862533608
dc.identifier.startpage	573
dc.identifier.uri	http://hdl.handle.net/10679/491
dc.identifier.uri	https://doi.org/10.1016/j.mejo.2012.04.005
dc.identifier.volume	43
dc.identifier.wos	000306383500009
dc.language.iso	eng
dc.peerreviewed	yes
dc.publicationstatus	published
dc.publisher	Elsevier
dc.relation.ispartof	Microelectronics Journal
dc.rights	restrictedAccess
dc.subject.keywords	Circuits for networking
dc.subject.keywords	Computer arithmetic
dc.subject.keywords	Logic synthesis
dc.subject.keywords	Priority encoder
dc.subject.keywords	Timing optimization
dc.title	Fast parallel prefix logic circuits for n2n round-robin arbitration
dc.type	article
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	7b58c5c4-dccc-40a3-aaf2-9b209113b763
relation.isOrgUnitOfPublication.latestForDiscovery	7b58c5c4-dccc-40a3-aaf2-9b209113b763

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electrical & Electronics Engineering

Publication: Fast parallel prefix logic circuits for n2n round-robin arbitration

Files

License bundle

Collections

Publication:
Fast parallel prefix logic circuits for n2n round-robin arbitration