SlideShare a Scribd company logo
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 516
VLSI DESIGN PROCESS FOR LOW POWER DESIGN METHODOLOGY
USING RECONFIGURABLE FPGA
R. Rajendra Prasad1
, M. V. Subramanyam2
, K. Satya Prasad3
1
Asso.professor of ECE Department, N.B.K.R.Institute of Science and Technology, Vidyanagar, S. P. S. R. Nellore ,
Andhra Pradesh, India. rajendra_831@yahoo.co.in, rechalabhi812@gmail.com
2
Principal and Prof ECE Department, Santhi Ram Engineering College, Nandyal, India, mvsraj@yahoo.com
3
Professor of ECE Department and Rector, JNTU Kakinada, Kakinada, India., prasad_kodati@yahoo.co.in
Abstract
Modern digital processing applications have an increasing demand for computational power while needing to preserve low power
dissipation and high flexibility. For many applications, the growth of algorithmic complexity is already faster than the growth of
computational power provided by discrete general-purpose processors. A typical approach to address this problem is the combination
of a processor core with dedicated accelerators. Since changes in standards or algorithms can change the demands on the
accelerators, an attractive alternative to highly customized VLSI macros is suggested with the usage of reconfigurable embedded
FPGAs (eFPGAs).
Keyword: embedded FPGA, Fast computing, Hybrid design.
------------------------------------------------------------------------***----------------------------------------------------------------------
1. INTRODUCTION
FPGAs are widely used as an attractive compromise between
highly efficient physically optimized VLSI designs and
software programmable processors. Due to their
reconfigurability, FPGAs are highly flexible and allow for
relatively short design cycles since no physical changes to the
underlying hardware have to be made in case of a redesign.
However, they offer lower physical implementation costs
compared to software programmable processors, as the
inherent parallelism of many algorithms can be exploited in
contrast to sequential processor architectures. As a result,
commercial FPGA-architectures have been optimized to suit a
wide variety of applications from network related and digital
signal processing to the realization of soft-core processors. For
an embedded FPGA used as configurable accelerator,
however, the requirements concerning the provided resources
are often well defined and much narrower than for discrete or
“general purpose” FPGAs. Hence, eFPGAs can be optimized
for a certain set of applications and thus achieve higher
efficiency in terms of power dissipation, area and speed. First
investigations on a reconfigurable ASIP with a reconfigurable
accelerator based on a parametrisable eFPGA-architecture
have shown significant improvements in energy- and area-
efficiency [5].
2. eFPGA-ARCHITECTURE
The eFPGA architecture presented here is based on a highly
parametrisable architecture template targeting an arithmetic-
oriented application domain. Some of them are described by a
single value (e.g. the number of LEs in a row and Column),
while others require a more complex definition (e.g. the
connectivity per switch point). In the following, the
architectural components and the according parameters are
discussed in detail. A typical characteristic of arithmetic data
paths is the organization in function slices and bit slices. A
function slice represents one of many consecutively processed
elementary functions (e.g. n-bit addition, n-bit XOR operation
etc.), while a bit slice represents all processing elements in the
same column corresponding to the same bit value (e.g. bit 0 of
two successive function slices).
Typical arithmetic data path scheme most communication
between function slices and bit slices is local, i.e. only
between direct neighbors. In addition, operands are typically
fed to the data path using a broadcast scheme. The eFPGA-
architecture reflects typical arithmetic data path schemes by
using two-dimensional clusters of logic elements with a
distributed interconnect rather than one-dimensional clusters
with a central connection box. The signals coming from the
connection box are distributed to the logic elements in rows
and columns according to the function slices and bit slices,
such that all logic elements in a row or column share the same
input signals using so-called broadcast lines. This reduces the
number of signals that need to be provided by the connection
box and hence reduces the significant overhead imposed by
the configurable connection boxes.
The size of the cluster can be varied in the horizontal and
vertical direction independently. Also, the number of
broadcast lines per row and column can be changed in designs
based on the template. Broadcast lines can be fed to the cluster
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 517
from all four directions, and in the same way the outputs of the
LEs at all four cluster borders can be fed to the connection
box. The actual connectivity can be any set of the four
possible data directions (north, east, south, west) for inputs
and outputs independently. Between adjacent clusters, feed
through stages are provided to use the broadcast lines of the
neighboring clusters as inputs for the current cluster, hence
creating virtually larger clusters by cascading several of them.
a) Logic elements
The local connectivity between the logic elements is provided
by dedicated routing blocks (DRB) located in the logic
element. Each DRB is a set of multiplexes used to connect
broadcast signals or local signals to the core logic of the logic
element. The actual connectivity can be defined in the
architecture template by stating all sources connected to the
DRBs with their offset to the actual LE. The functionality of
the core logic itself is specified by a list of elementary
Boolean functions that the LE can process (e.g. full addition,
gated full addition etc.). Registers can be inserted per logic
element or with a reduced density, e.g. every second LE-row.
Cluster broadcast line core logic dedicated routing block
(DRB) broadcast lines local Connections.
b) Configuration memories
To reduce the overhead of the memory cells (typically
SRAMs) used to store the configuration of the FPGAs logic
and routing resources, the present architecture template allows
for sharing the configuration bits and thus configuring several
adjacent elements identically. This scheme is applied to the
logic elements as well as to the interconnect resources, where
adjacent switch points or connection points can share a single
SRAM block. The degree of SRAM sharing can be adjusted in
the architecture template in reasonable limits.
c) Routing switch
The routing switch of an FPGA is a set of switch points those
are located at crossing points of horizontal and vertical routing
tracks. The number of switch points available as well as their
connectivity determines the flexibility of the complete routing
switch. It was shown that it is not necessary to provide a fully
populated routing switch to achieve a good amount of
flexibility [6]. The architecture template presented here is very
flexible concerning the definition of available routing
resources. The number of routing tracks in horizontal and
vertical direction can be chosen independently. Each switch
point is defined by its position in the matrix of crossing lines
as well as the connectivity inside the switch point. Different
switch points can have different flexibility. In addition, the
segmentation of the interconnect can be adjusted by assigning
each routing track a certain segment length, corresponding to
the number of routing switches that are bypassed before the
line connects to the next routing switch.
d) Connection box
Finally, the architecture template offers a highly flexible
description of the connection box similar to the routing switch
definition. Three types of routing channels are supported: fully
connected, periodic connectivity and unconnected. Fully
connected tracks offer full population of the connection box,
i.e. each track can connect to each according broadcast line of
a cluster. However, they have the highest implementation
costs. Unconnected tracks can be implemented for fast signal
routing, as the capacitive load of these wires can be kept very
low. Periodic tracks use a special connection type best suited
for arithmetic data paths, where signals on a bus are typically
ordered by the weight of their bits. Accordingly, periodic
routing channels have a window of connection points that
slides across the tracks with a given velocity. The connection
box defined in the architecture template can be composed of
any mix of tracks with different channel widths and sliding
window specifications.
3. PROPOSED SYSTEM DESIGN
As many of the architectural features in the presented eFPGA
are unique and not common in standard FPGAs, there is
currently no tool support available. Most research conducted
in the field of eFPGA-architectures is based on the VPR
design flow [7] which can only be used to model standard
island style FPGA-architectures with LUT-based logic
elements and a small choice of routing switch architectures.
Hence, an important goal of this work is the creation of a self-
contained design methodology to design application domain
specific eFPGAs and the according basic tool support.
Figure 1: design flow for the implementation of layout design
The architecture template described above was formulated as a
high level description using C language. Based on this
architecture description, three main steps are supported by the
design flow. First, a layout generator creates a VLSI-layout of
the specified eFPGA based on a small set of handcrafted,
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 518
physically optimized basic cells such as the multiplexes for the
DRB or the switch points of the routing switch. Several
studies concerning this part of the design flow have been
published before [8] [9]. While the first automatically
generated eFPGA-layouts still needed to be configured
manually (i.e. each SRAM cell had to be configured with the
proper value), an automatic bit stream generator supporting
the complete architecture template is currently under
construction. The configuration bit stream is used to conduct
net list simulations based on layout-extracted net lists. To
verify the functionality of the eFPGA-macro, a VHDL-
generator creates a functional description that can be
simulated using common simulation tools like ModelSim. The
output of the configuration is based on a net list that describes
the signal flow graph mapped to the eFPGA. Currently, the net
list description is still complex, as each logic element and each
routing resource has to be described here. However, for
arithmetic data paths this net list is highly regular which
reduces the effort to generate it manually. Currently, a very
time-consuming and error-prone work is the generation of the
configuration bits and the routing of signals between the logic
elements. The placement process is less complex due to the
regularity of the examples considered here i.e. arithmetic data
paths. Consequently, a future step will be the implementation
of a routing tool supporting the parametrisable architecture.
Similar approaches to automatic eFPGA design have been
proposed e.g. with GILES [10] or PYTHAGOR [11].
However, those design flows have significant constraints
regarding the FPGA-architectures (e.g. only island style
FPGAs are supported) and the physical implementation style.
As an example, using standard cell implementations (e.g.
proposed by PYTHAGOR) leads to unfavorable physical
implementation costs concerning area, performance and power
dissipation. The first results presented here are based on a
completely functional co design of a routing switch layout, the
according VHDL model and the configuration bit stream
based on a given net list.
a) Layout generator
The layout generator is based on a prior work on Automated
VLSI-design of regular data paths [12]. This so-called data
path generator uses a textual description of a signal flow graph
(SFG) and a small set of hand designed layout cells to
generate a layout. Since the textual SFG description can be
parameterized, the data path generator allows for a very
flexible implementation process, e.g. when parameters like
word lengths are changed in the SFG. Starting from the C-
based high level description of the eFPGA architecture a data
path generator suited SFG description is automatically
generated. Due to the highly modular design style, the
eFPGA-macro can be ported to different CMOS-technologies
with small effort, since only few hand designed layout cells
are required. After the layout is generated, standard net list
extraction and simulation tools can be used to characterize the
eFPGA macro in terms of area, timing and power dissipation.
b) VHDL-generator
Based on the architecture description, a VHDL-model of the
eFPGA is created automatically. It incorporates the functional
description of the basic configurable elements like routing
switch points or logic elements and combines them according
to the architectural parameters. The VHDL-model of the
eFPGA is used to verify the functionality defined by the net
list using existing simulation tools. It is also useful to test the
eFPGA macro created by the layout generator for correctness
by co simulation of the layout extracted net list and the
functional VHDL model.
c) Configuration
To enable simulations of the eFPGA-macro (on functional as
well as on net list level), all configuration bits have to be set
properly. Due to the very large number of configuration bits, it
is necessary to have an automated way of creating the bit
stream from the mapped net list. Existing bit stream generators
like DAGGER [13] lack the Support for highly
parametrisable eFPGA architectures as the one described here.
The configuration elaborated as part of the design
methodology presented here creates configuration bit streams
based on the net list and the architecture specifications. It also
uses the information from the layout generator to determine
the actual position of all blocks to be configured in the macro.
The configuration bit stream is composed of elementary
configuration table entries that must be provided for the basic
eFPGA elements like routing switch points or logic elements.
The elementary tables can be created with small effort, as only
few bits are required to configure these basic elements. The bit
stream is then concatenated according to the position of the
elements in the overall macro.
4. DESIGN EXAMPLE
As a first step in verifying the proposed design Methodology,
routing switches generator comprising all elements of the
design flow was implemented. As an example, a routing
switch with 32 tracks both in horizontal and vertical direction
was specified in the according architecture description. Switch
points with different flexibility are provided as exemplary
basic components for the routing switch. As the C based
description is on an abstract level, the architecture
specification for the complete routing switch can be created
within few minutesThree global parameters describe the
channel widths and the use of configuration sharing . Each
switch point is defined by a set of potential signal routes
according to their input and output directions (north, east,
south, west). From the flexibility required by each switch
point, the routing switch generator extracts a set of basic
layout cells that need to be designed for the VLSI-
implementation. The design of the according switch point
macros and the SRAM-cell required for the configuration
storage takes some hours for a skilled designer.The layout
generator automatically determines the optimum placement of
the basic cells for a given aspect ratio of the routing switch. It
also calculates the optimum aspect ratios of the configuration
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 519
blocks that are required per set of switch points sharing the
same configuration. To analyses the influence of different
parameters on the area, timing and power dissipation can be
conducted very quickly. A simple net list describing the
connections to be provided by the routing switch was used
to automatically generate the configuration bit stream with
more than 200 bits and the test bench for ModelSim
(functional simulation) and Xilinx (Realization). The backend
tool for the analysis is developed for an eFPGA application
and is processed for system level implementation.
Figure 2: Simulation model for designing board level
implementation using proposed analytical tool
A conventional application to such a design is carried out
using HDL simulator and xilinx synthesizer tool to evaluate
the feasibility of the flow of implementations for the suggested
tool.
Figure 3: timing simulation observation for a coding system
using HDL timing simulator
Figure 4: timing simulation observation for a decoding system
using HDL timing simulator
Figure 5: physical layout obtained for a
developed simulation model using xilinx synthesizer
Design static’s obtained fro the designed HDL definition.
Design Statistics
# IOs : 6
Cell Usage :
# BELS : 1041
Macro Statistics
# Registers : 156
Maximum operating Frequency: 102.008MHz
CONCLUSIONS
The design methodology presented in this paper is an
important step for the evaluation of embedded FPGAs that are
optimized for a certain application domain. By using a
common, highly flexible architecture template, the eFPGA
architecture can be tailored to a given application domain
systematically. The self-contained design methodology
presented here enables the VLSI-design as well as basic tools
for verification and simulation. Hence, the complexity of
mapping exemplary data paths to the eFPGA is reduced
significantly compared to previous work. Using the simulation
results based on actual VLSI layouts of the eFPGA, a high-
level model of the architecture is currently evolving that
allows for a systematic analysis of the dependencies between
eFPGA architecture, mapped data paths and the according
efficiency.
IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163
__________________________________________________________________________________________
Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 520
REFERENCES
[1] J. Hauser, “Integrated Circuits for Next Generation
Wireless Systems”, Proceedings of the European Solid- State
Circuits Conference (ESSCIRC) 2001, pp. 26-29
[2] Stretch S6000, http://www.stretchinc.com
[3] MENTA eFPGA-augmented RISC CPUs (website),
http://www.menta.fr/efpga_cpu.html
[4] A. Ye and J. Rose, “Using Bus-Based Connections to
Improve Field-Programmable Gate Array Density
Implementing Data path Circuits”,IEEE Transactions on Very
Large Scale Integration (VLSI) Systems, Vol. 14, No. 5, pp.
462-473, May 2006.
[5] T. von Sydow, M. Korb, B. Neumann, H. Blume and T. G.
Noll, “Modeling and Quantitative Analysis of Coupling
Mechanisms of Programmable Processor Cores and
Arithmetic Oriented eFPGA-macros”, in Proc. Reconfigurable
Computing and FPGA's, pp. 252-261, 2006.
[6] G. Lemieux and D. Lewis, “Design of Interconnection
Networks for Programmable Logic” Kluwer Academic
Publishers, 2004.
[7] V. Betz, J. Rose and A. Marquardt, “Architecture and
CAD for Deep-Sub micron FPGAs” in Kluwer International
Series in Engineering and Computer Science, 1999.
[8] T. von Sydow, B. Neumann, H. Blume and T. G. Noll,
“Quantitative Analysis of embedded FPGA Architectures for
Arithmetic”, in Proc. Application Specific Systems,
Architectures and Processors Conference ,pp. 125-131, 2006.
[9] B. Neumann, T. von Sydow, H. Blume and T. G. Noll,
“Design and quantitative analysis of parametrisable eFPGA-
architectures for arithmetic“ in Advances in Radio Science,
Vol. 4, pp. 251-259, 2006.
[10] I. Kuon, A. Egier and J. Rose, “Design, Layout and
Verification of an FPGA using Automated Tools”, in Proc.
2005 ACM/SIGDA 13th international symposium on Field
programmable gate arrays, pp. 215–226, 2005.
[11] A. Danilin, M. Bennebroek and S. Sawitzki, “A novel
toolset for the development of FPGA-like reconfigurable
logic”, in Proc. FPL 2005, pp. 640-643, 2005.
[12] O. Weiss, M. Gansen and T. G. Noll, “A flexible
Datapath Generator for Physical Oriented Design” in Proc.
European Solid-State Circuits Conference, pp. 408-411, 2001.
[13] K. Siozios et. al, “DAGGER: A Novel Generic
Methodology for FPGA Bitstream Generation and its Software
Tool Implementation”, in Proc. Parallel and Distributed
Processing Symposium 2005,p.p. 165b, 2005.

More Related Content

What's hot (20)

PDF
Low Power Design flow using Power Format
ijsrd.com
 
PDF
Low Power Design Verification of Complex Chips
DVClub
 
DOC
Multi_Vdd_IEEE_Paper
Mohammed Musab
 
DOC
sirishamadishetty
Sirisha Madishetty
 
DOCX
Gate Diffusion Input Technology (Very Large Scale Integration)
Ashwin Shroff
 
PDF
IRJET- A Novel High Speed Power Efficient Double Tail Comparator in 180nm...
IRJET Journal
 
PDF
Comparative Performance Analysis of Low Power Full Adder Design in Different ...
ijcisjournal
 
PDF
IRJET- A Implementation of High Speed On-Chip Monitoring Circuit by using SAR...
IRJET Journal
 
PPTX
Final Viva Presenation 1309136702 ppt (7-05-2016)
Devyani Balyan
 
PDF
Design of Low Power High Speed 4-Bit TIQ Based CMOS Flash ADC
Aman JanGra
 
PDF
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
PDF
IRJET - Design and Analysis of a Comparator for ADC in Tanner EDA
IRJET Journal
 
PDF
Design of a 45nm TIQ Comparator for High Speed and Low Power 4-Bit Flash ADC
IDES Editor
 
PDF
20 9747 series paper id 0018(edit ty)
IAESIJEECS
 
PDF
A Novel Fuzzy Variable-Band Hysteresis Current Controller For Shunt Active Po...
IDES Editor
 
PDF
A new switching look-up table for direct power control of grid connected 3L-N...
International Journal of Power Electronics and Drive Systems
 
PPTX
Bridgeless PFC-Modified SEPIC Rectifier With Extended Gain for Universal Inpu...
IISTech2015
 
PDF
Dc analysis of a pdn essentials for the digital designer
jigg1777
 
PDF
Analysis and design of single switch forward-flyback two-channel led driver w...
LeMeniz Infotech
 
PDF
A Novel Single Phase bridgeless AC/DC PFC converter for Low Total Harmonics D...
International Journal of Power Electronics and Drive Systems
 
Low Power Design flow using Power Format
ijsrd.com
 
Low Power Design Verification of Complex Chips
DVClub
 
Multi_Vdd_IEEE_Paper
Mohammed Musab
 
sirishamadishetty
Sirisha Madishetty
 
Gate Diffusion Input Technology (Very Large Scale Integration)
Ashwin Shroff
 
IRJET- A Novel High Speed Power Efficient Double Tail Comparator in 180nm...
IRJET Journal
 
Comparative Performance Analysis of Low Power Full Adder Design in Different ...
ijcisjournal
 
IRJET- A Implementation of High Speed On-Chip Monitoring Circuit by using SAR...
IRJET Journal
 
Final Viva Presenation 1309136702 ppt (7-05-2016)
Devyani Balyan
 
Design of Low Power High Speed 4-Bit TIQ Based CMOS Flash ADC
Aman JanGra
 
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
IRJET - Design and Analysis of a Comparator for ADC in Tanner EDA
IRJET Journal
 
Design of a 45nm TIQ Comparator for High Speed and Low Power 4-Bit Flash ADC
IDES Editor
 
20 9747 series paper id 0018(edit ty)
IAESIJEECS
 
A Novel Fuzzy Variable-Band Hysteresis Current Controller For Shunt Active Po...
IDES Editor
 
A new switching look-up table for direct power control of grid connected 3L-N...
International Journal of Power Electronics and Drive Systems
 
Bridgeless PFC-Modified SEPIC Rectifier With Extended Gain for Universal Inpu...
IISTech2015
 
Dc analysis of a pdn essentials for the digital designer
jigg1777
 
Analysis and design of single switch forward-flyback two-channel led driver w...
LeMeniz Infotech
 
A Novel Single Phase bridgeless AC/DC PFC converter for Low Total Harmonics D...
International Journal of Power Electronics and Drive Systems
 

Viewers also liked (20)

PDF
Enhancement of qos in multihop wireless networks by delivering cbr using lb a...
eSAT Publishing House
 
PDF
Attributes enhancing industrial productivity an empirical analysis
eSAT Publishing House
 
PDF
Mining elevated service itemsets on transactional recordsets using slicing
eSAT Publishing House
 
PDF
Ivrs based news extracting system
eSAT Publishing House
 
PDF
Applications of matlab in optimization of bridge
eSAT Publishing House
 
PDF
Background differencing algorithm for moving object detection using system ge...
eSAT Publishing House
 
PDF
Stress analysis of stick reinforced granite periwinkle concrete slab under un...
eSAT Publishing House
 
PDF
Open domain question answering system using semantic role labeling
eSAT Publishing House
 
PDF
Phytochemical analysis, protein content &
eSAT Publishing House
 
PDF
A novel rrw framework to resist accidental attacks
eSAT Publishing House
 
PDF
In data streams using classification and clustering
eSAT Publishing House
 
PDF
Supervised machine learning based dynamic estimation
eSAT Publishing House
 
PDF
Co axial fed microstrip rectangular patch antenna
eSAT Publishing House
 
PDF
Preliminary study of on cladding process on gray cast
eSAT Publishing House
 
PDF
Efficiently searching nearest neighbor in documents
eSAT Publishing House
 
PDF
Performance investigation of a flat plate collector
eSAT Publishing House
 
PDF
Design and characterization of various shapes of microcantilever for human im...
eSAT Publishing House
 
PDF
Statistical process control
eSAT Publishing House
 
PDF
Behaviour of concrete beams reinforced with glass
eSAT Publishing House
 
PDF
Finite element analysis on temperature distribution in turning process using ...
eSAT Publishing House
 
Enhancement of qos in multihop wireless networks by delivering cbr using lb a...
eSAT Publishing House
 
Attributes enhancing industrial productivity an empirical analysis
eSAT Publishing House
 
Mining elevated service itemsets on transactional recordsets using slicing
eSAT Publishing House
 
Ivrs based news extracting system
eSAT Publishing House
 
Applications of matlab in optimization of bridge
eSAT Publishing House
 
Background differencing algorithm for moving object detection using system ge...
eSAT Publishing House
 
Stress analysis of stick reinforced granite periwinkle concrete slab under un...
eSAT Publishing House
 
Open domain question answering system using semantic role labeling
eSAT Publishing House
 
Phytochemical analysis, protein content &
eSAT Publishing House
 
A novel rrw framework to resist accidental attacks
eSAT Publishing House
 
In data streams using classification and clustering
eSAT Publishing House
 
Supervised machine learning based dynamic estimation
eSAT Publishing House
 
Co axial fed microstrip rectangular patch antenna
eSAT Publishing House
 
Preliminary study of on cladding process on gray cast
eSAT Publishing House
 
Efficiently searching nearest neighbor in documents
eSAT Publishing House
 
Performance investigation of a flat plate collector
eSAT Publishing House
 
Design and characterization of various shapes of microcantilever for human im...
eSAT Publishing House
 
Statistical process control
eSAT Publishing House
 
Behaviour of concrete beams reinforced with glass
eSAT Publishing House
 
Finite element analysis on temperature distribution in turning process using ...
eSAT Publishing House
 
Ad

Similar to Vlsi design process for low power design methodology using reconfigurable fpga (20)

PDF
System designing and modelling using fpga
IAEME Publication
 
PDF
91 94
Editor IJARCET
 
PDF
91 94
Editor IJARCET
 
PDF
Tutorial on FPGA Routing
Daniel Gomez-Prado
 
DOCX
Fpga lecture
Zhwan Rashid
 
PPT
Fpga Knowledge
ranvirsingh
 
DOCX
Fpga
KRKANHAIYA
 
PPTX
FPGA TECHNOLOGY AND FAMILIES
revathilakshmi2
 
PPT
FPGA-Arch.ppt
Priya Tharsini
 
PDF
FPGA Based VLSI Design
Debargha Chakraborty
 
DOC
Remote sensing and control of an irrigation system using a distributed wirele...
nithinreddykaithi
 
PPTX
Intermediate Fabrics
Team-VLSI-ITMU
 
PDF
Coarse Grain Reconfigurable Floating Point Unit
AM Publications,India
 
PPTX
FPGA-Architecture Basics and Description.pptx
ENAULHAQSHAIK
 
PPT
FPGA-Arch.ppt
KrishnaChaitanya139768
 
PPT
FPGA architecture.ppt
EcAlwinjolly
 
PPT
FPGA-Arch.ppt
gopikahari7
 
PDF
Implementation of resource sharing strategy for power optimization in embedde...
Alexander Decker
 
PPT
FPGA-Arch (3).ppt
Arthi579360
 
PPTX
module nenddhd dhdbdh dehrbdbddnd d 1.pptx
kashinathvpillai51
 
System designing and modelling using fpga
IAEME Publication
 
Tutorial on FPGA Routing
Daniel Gomez-Prado
 
Fpga lecture
Zhwan Rashid
 
Fpga Knowledge
ranvirsingh
 
FPGA TECHNOLOGY AND FAMILIES
revathilakshmi2
 
FPGA-Arch.ppt
Priya Tharsini
 
FPGA Based VLSI Design
Debargha Chakraborty
 
Remote sensing and control of an irrigation system using a distributed wirele...
nithinreddykaithi
 
Intermediate Fabrics
Team-VLSI-ITMU
 
Coarse Grain Reconfigurable Floating Point Unit
AM Publications,India
 
FPGA-Architecture Basics and Description.pptx
ENAULHAQSHAIK
 
FPGA-Arch.ppt
KrishnaChaitanya139768
 
FPGA architecture.ppt
EcAlwinjolly
 
FPGA-Arch.ppt
gopikahari7
 
Implementation of resource sharing strategy for power optimization in embedde...
Alexander Decker
 
FPGA-Arch (3).ppt
Arthi579360
 
module nenddhd dhdbdh dehrbdbddnd d 1.pptx
kashinathvpillai51
 
Ad

More from eSAT Publishing House (20)

PDF
Likely impacts of hudhud on the environment of visakhapatnam
eSAT Publishing House
 
PDF
Impact of flood disaster in a drought prone area – case study of alampur vill...
eSAT Publishing House
 
PDF
Hudhud cyclone – a severe disaster in visakhapatnam
eSAT Publishing House
 
PDF
Groundwater investigation using geophysical methods a case study of pydibhim...
eSAT Publishing House
 
PDF
Flood related disasters concerned to urban flooding in bangalore, india
eSAT Publishing House
 
PDF
Enhancing post disaster recovery by optimal infrastructure capacity building
eSAT Publishing House
 
PDF
Effect of lintel and lintel band on the global performance of reinforced conc...
eSAT Publishing House
 
PDF
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
eSAT Publishing House
 
PDF
Wind damage to buildings, infrastrucuture and landscape elements along the be...
eSAT Publishing House
 
PDF
Shear strength of rc deep beam panels – a review
eSAT Publishing House
 
PDF
Role of voluntary teams of professional engineers in dissater management – ex...
eSAT Publishing House
 
PDF
Risk analysis and environmental hazard management
eSAT Publishing House
 
PDF
Review study on performance of seismically tested repaired shear walls
eSAT Publishing House
 
PDF
Monitoring and assessment of air quality with reference to dust particles (pm...
eSAT Publishing House
 
PDF
Low cost wireless sensor networks and smartphone applications for disaster ma...
eSAT Publishing House
 
PDF
Coastal zones – seismic vulnerability an analysis from east coast of india
eSAT Publishing House
 
PDF
Can fracture mechanics predict damage due disaster of structures
eSAT Publishing House
 
PDF
Assessment of seismic susceptibility of rc buildings
eSAT Publishing House
 
PDF
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
eSAT Publishing House
 
PDF
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
eSAT Publishing House
 
Likely impacts of hudhud on the environment of visakhapatnam
eSAT Publishing House
 
Impact of flood disaster in a drought prone area – case study of alampur vill...
eSAT Publishing House
 
Hudhud cyclone – a severe disaster in visakhapatnam
eSAT Publishing House
 
Groundwater investigation using geophysical methods a case study of pydibhim...
eSAT Publishing House
 
Flood related disasters concerned to urban flooding in bangalore, india
eSAT Publishing House
 
Enhancing post disaster recovery by optimal infrastructure capacity building
eSAT Publishing House
 
Effect of lintel and lintel band on the global performance of reinforced conc...
eSAT Publishing House
 
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
eSAT Publishing House
 
Wind damage to buildings, infrastrucuture and landscape elements along the be...
eSAT Publishing House
 
Shear strength of rc deep beam panels – a review
eSAT Publishing House
 
Role of voluntary teams of professional engineers in dissater management – ex...
eSAT Publishing House
 
Risk analysis and environmental hazard management
eSAT Publishing House
 
Review study on performance of seismically tested repaired shear walls
eSAT Publishing House
 
Monitoring and assessment of air quality with reference to dust particles (pm...
eSAT Publishing House
 
Low cost wireless sensor networks and smartphone applications for disaster ma...
eSAT Publishing House
 
Coastal zones – seismic vulnerability an analysis from east coast of india
eSAT Publishing House
 
Can fracture mechanics predict damage due disaster of structures
eSAT Publishing House
 
Assessment of seismic susceptibility of rc buildings
eSAT Publishing House
 
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
eSAT Publishing House
 
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
eSAT Publishing House
 

Recently uploaded (20)

PDF
Unified_Cloud_Comm_Presentation anil singh ppt
anilsingh298751
 
PDF
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
PDF
MAD Unit - 1 Introduction of Android IT Department
JappanMavani
 
PDF
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PPTX
Introduction to Neural Networks and Perceptron Learning Algorithm.pptx
Kayalvizhi A
 
PDF
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
PDF
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 
PPTX
GitOps_Without_K8s_Training_detailed git repository
DanialHabibi2
 
PPTX
Break Statement in Programming with 6 Real Examples
manojpoojary2004
 
PPTX
Product Development & DevelopmentLecture02.pptx
zeeshanwazir2
 
PPTX
Arduino Based Gas Leakage Detector Project
CircuitDigest
 
PPTX
Server Side Web Development Unit 1 of Nodejs.pptx
sneha852132
 
DOC
MRRS Strength and Durability of Concrete
CivilMythili
 
PPTX
GitOps_Repo_Structure for begeinner(Scaffolindg)
DanialHabibi2
 
PPTX
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
PPTX
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
PPTX
Introduction to Design of Machine Elements
PradeepKumarS27
 
PDF
6th International Conference on Machine Learning Techniques and Data Science ...
ijistjournal
 
PPTX
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
PPTX
Snet+Pro+Service+Software_SNET+Pro+2+Instructions.pptx
jenilsatikuvar1
 
Unified_Cloud_Comm_Presentation anil singh ppt
anilsingh298751
 
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
MAD Unit - 1 Introduction of Android IT Department
JappanMavani
 
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
Introduction to Neural Networks and Perceptron Learning Algorithm.pptx
Kayalvizhi A
 
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 
GitOps_Without_K8s_Training_detailed git repository
DanialHabibi2
 
Break Statement in Programming with 6 Real Examples
manojpoojary2004
 
Product Development & DevelopmentLecture02.pptx
zeeshanwazir2
 
Arduino Based Gas Leakage Detector Project
CircuitDigest
 
Server Side Web Development Unit 1 of Nodejs.pptx
sneha852132
 
MRRS Strength and Durability of Concrete
CivilMythili
 
GitOps_Repo_Structure for begeinner(Scaffolindg)
DanialHabibi2
 
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
Introduction to Design of Machine Elements
PradeepKumarS27
 
6th International Conference on Machine Learning Techniques and Data Science ...
ijistjournal
 
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
Snet+Pro+Service+Software_SNET+Pro+2+Instructions.pptx
jenilsatikuvar1
 

Vlsi design process for low power design methodology using reconfigurable fpga

  • 1. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 516 VLSI DESIGN PROCESS FOR LOW POWER DESIGN METHODOLOGY USING RECONFIGURABLE FPGA R. Rajendra Prasad1 , M. V. Subramanyam2 , K. Satya Prasad3 1 Asso.professor of ECE Department, N.B.K.R.Institute of Science and Technology, Vidyanagar, S. P. S. R. Nellore , Andhra Pradesh, India. [email protected], [email protected] 2 Principal and Prof ECE Department, Santhi Ram Engineering College, Nandyal, India, [email protected] 3 Professor of ECE Department and Rector, JNTU Kakinada, Kakinada, India., [email protected] Abstract Modern digital processing applications have an increasing demand for computational power while needing to preserve low power dissipation and high flexibility. For many applications, the growth of algorithmic complexity is already faster than the growth of computational power provided by discrete general-purpose processors. A typical approach to address this problem is the combination of a processor core with dedicated accelerators. Since changes in standards or algorithms can change the demands on the accelerators, an attractive alternative to highly customized VLSI macros is suggested with the usage of reconfigurable embedded FPGAs (eFPGAs). Keyword: embedded FPGA, Fast computing, Hybrid design. ------------------------------------------------------------------------***---------------------------------------------------------------------- 1. INTRODUCTION FPGAs are widely used as an attractive compromise between highly efficient physically optimized VLSI designs and software programmable processors. Due to their reconfigurability, FPGAs are highly flexible and allow for relatively short design cycles since no physical changes to the underlying hardware have to be made in case of a redesign. However, they offer lower physical implementation costs compared to software programmable processors, as the inherent parallelism of many algorithms can be exploited in contrast to sequential processor architectures. As a result, commercial FPGA-architectures have been optimized to suit a wide variety of applications from network related and digital signal processing to the realization of soft-core processors. For an embedded FPGA used as configurable accelerator, however, the requirements concerning the provided resources are often well defined and much narrower than for discrete or “general purpose” FPGAs. Hence, eFPGAs can be optimized for a certain set of applications and thus achieve higher efficiency in terms of power dissipation, area and speed. First investigations on a reconfigurable ASIP with a reconfigurable accelerator based on a parametrisable eFPGA-architecture have shown significant improvements in energy- and area- efficiency [5]. 2. eFPGA-ARCHITECTURE The eFPGA architecture presented here is based on a highly parametrisable architecture template targeting an arithmetic- oriented application domain. Some of them are described by a single value (e.g. the number of LEs in a row and Column), while others require a more complex definition (e.g. the connectivity per switch point). In the following, the architectural components and the according parameters are discussed in detail. A typical characteristic of arithmetic data paths is the organization in function slices and bit slices. A function slice represents one of many consecutively processed elementary functions (e.g. n-bit addition, n-bit XOR operation etc.), while a bit slice represents all processing elements in the same column corresponding to the same bit value (e.g. bit 0 of two successive function slices). Typical arithmetic data path scheme most communication between function slices and bit slices is local, i.e. only between direct neighbors. In addition, operands are typically fed to the data path using a broadcast scheme. The eFPGA- architecture reflects typical arithmetic data path schemes by using two-dimensional clusters of logic elements with a distributed interconnect rather than one-dimensional clusters with a central connection box. The signals coming from the connection box are distributed to the logic elements in rows and columns according to the function slices and bit slices, such that all logic elements in a row or column share the same input signals using so-called broadcast lines. This reduces the number of signals that need to be provided by the connection box and hence reduces the significant overhead imposed by the configurable connection boxes. The size of the cluster can be varied in the horizontal and vertical direction independently. Also, the number of broadcast lines per row and column can be changed in designs based on the template. Broadcast lines can be fed to the cluster
  • 2. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 517 from all four directions, and in the same way the outputs of the LEs at all four cluster borders can be fed to the connection box. The actual connectivity can be any set of the four possible data directions (north, east, south, west) for inputs and outputs independently. Between adjacent clusters, feed through stages are provided to use the broadcast lines of the neighboring clusters as inputs for the current cluster, hence creating virtually larger clusters by cascading several of them. a) Logic elements The local connectivity between the logic elements is provided by dedicated routing blocks (DRB) located in the logic element. Each DRB is a set of multiplexes used to connect broadcast signals or local signals to the core logic of the logic element. The actual connectivity can be defined in the architecture template by stating all sources connected to the DRBs with their offset to the actual LE. The functionality of the core logic itself is specified by a list of elementary Boolean functions that the LE can process (e.g. full addition, gated full addition etc.). Registers can be inserted per logic element or with a reduced density, e.g. every second LE-row. Cluster broadcast line core logic dedicated routing block (DRB) broadcast lines local Connections. b) Configuration memories To reduce the overhead of the memory cells (typically SRAMs) used to store the configuration of the FPGAs logic and routing resources, the present architecture template allows for sharing the configuration bits and thus configuring several adjacent elements identically. This scheme is applied to the logic elements as well as to the interconnect resources, where adjacent switch points or connection points can share a single SRAM block. The degree of SRAM sharing can be adjusted in the architecture template in reasonable limits. c) Routing switch The routing switch of an FPGA is a set of switch points those are located at crossing points of horizontal and vertical routing tracks. The number of switch points available as well as their connectivity determines the flexibility of the complete routing switch. It was shown that it is not necessary to provide a fully populated routing switch to achieve a good amount of flexibility [6]. The architecture template presented here is very flexible concerning the definition of available routing resources. The number of routing tracks in horizontal and vertical direction can be chosen independently. Each switch point is defined by its position in the matrix of crossing lines as well as the connectivity inside the switch point. Different switch points can have different flexibility. In addition, the segmentation of the interconnect can be adjusted by assigning each routing track a certain segment length, corresponding to the number of routing switches that are bypassed before the line connects to the next routing switch. d) Connection box Finally, the architecture template offers a highly flexible description of the connection box similar to the routing switch definition. Three types of routing channels are supported: fully connected, periodic connectivity and unconnected. Fully connected tracks offer full population of the connection box, i.e. each track can connect to each according broadcast line of a cluster. However, they have the highest implementation costs. Unconnected tracks can be implemented for fast signal routing, as the capacitive load of these wires can be kept very low. Periodic tracks use a special connection type best suited for arithmetic data paths, where signals on a bus are typically ordered by the weight of their bits. Accordingly, periodic routing channels have a window of connection points that slides across the tracks with a given velocity. The connection box defined in the architecture template can be composed of any mix of tracks with different channel widths and sliding window specifications. 3. PROPOSED SYSTEM DESIGN As many of the architectural features in the presented eFPGA are unique and not common in standard FPGAs, there is currently no tool support available. Most research conducted in the field of eFPGA-architectures is based on the VPR design flow [7] which can only be used to model standard island style FPGA-architectures with LUT-based logic elements and a small choice of routing switch architectures. Hence, an important goal of this work is the creation of a self- contained design methodology to design application domain specific eFPGAs and the according basic tool support. Figure 1: design flow for the implementation of layout design The architecture template described above was formulated as a high level description using C language. Based on this architecture description, three main steps are supported by the design flow. First, a layout generator creates a VLSI-layout of the specified eFPGA based on a small set of handcrafted,
  • 3. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 518 physically optimized basic cells such as the multiplexes for the DRB or the switch points of the routing switch. Several studies concerning this part of the design flow have been published before [8] [9]. While the first automatically generated eFPGA-layouts still needed to be configured manually (i.e. each SRAM cell had to be configured with the proper value), an automatic bit stream generator supporting the complete architecture template is currently under construction. The configuration bit stream is used to conduct net list simulations based on layout-extracted net lists. To verify the functionality of the eFPGA-macro, a VHDL- generator creates a functional description that can be simulated using common simulation tools like ModelSim. The output of the configuration is based on a net list that describes the signal flow graph mapped to the eFPGA. Currently, the net list description is still complex, as each logic element and each routing resource has to be described here. However, for arithmetic data paths this net list is highly regular which reduces the effort to generate it manually. Currently, a very time-consuming and error-prone work is the generation of the configuration bits and the routing of signals between the logic elements. The placement process is less complex due to the regularity of the examples considered here i.e. arithmetic data paths. Consequently, a future step will be the implementation of a routing tool supporting the parametrisable architecture. Similar approaches to automatic eFPGA design have been proposed e.g. with GILES [10] or PYTHAGOR [11]. However, those design flows have significant constraints regarding the FPGA-architectures (e.g. only island style FPGAs are supported) and the physical implementation style. As an example, using standard cell implementations (e.g. proposed by PYTHAGOR) leads to unfavorable physical implementation costs concerning area, performance and power dissipation. The first results presented here are based on a completely functional co design of a routing switch layout, the according VHDL model and the configuration bit stream based on a given net list. a) Layout generator The layout generator is based on a prior work on Automated VLSI-design of regular data paths [12]. This so-called data path generator uses a textual description of a signal flow graph (SFG) and a small set of hand designed layout cells to generate a layout. Since the textual SFG description can be parameterized, the data path generator allows for a very flexible implementation process, e.g. when parameters like word lengths are changed in the SFG. Starting from the C- based high level description of the eFPGA architecture a data path generator suited SFG description is automatically generated. Due to the highly modular design style, the eFPGA-macro can be ported to different CMOS-technologies with small effort, since only few hand designed layout cells are required. After the layout is generated, standard net list extraction and simulation tools can be used to characterize the eFPGA macro in terms of area, timing and power dissipation. b) VHDL-generator Based on the architecture description, a VHDL-model of the eFPGA is created automatically. It incorporates the functional description of the basic configurable elements like routing switch points or logic elements and combines them according to the architectural parameters. The VHDL-model of the eFPGA is used to verify the functionality defined by the net list using existing simulation tools. It is also useful to test the eFPGA macro created by the layout generator for correctness by co simulation of the layout extracted net list and the functional VHDL model. c) Configuration To enable simulations of the eFPGA-macro (on functional as well as on net list level), all configuration bits have to be set properly. Due to the very large number of configuration bits, it is necessary to have an automated way of creating the bit stream from the mapped net list. Existing bit stream generators like DAGGER [13] lack the Support for highly parametrisable eFPGA architectures as the one described here. The configuration elaborated as part of the design methodology presented here creates configuration bit streams based on the net list and the architecture specifications. It also uses the information from the layout generator to determine the actual position of all blocks to be configured in the macro. The configuration bit stream is composed of elementary configuration table entries that must be provided for the basic eFPGA elements like routing switch points or logic elements. The elementary tables can be created with small effort, as only few bits are required to configure these basic elements. The bit stream is then concatenated according to the position of the elements in the overall macro. 4. DESIGN EXAMPLE As a first step in verifying the proposed design Methodology, routing switches generator comprising all elements of the design flow was implemented. As an example, a routing switch with 32 tracks both in horizontal and vertical direction was specified in the according architecture description. Switch points with different flexibility are provided as exemplary basic components for the routing switch. As the C based description is on an abstract level, the architecture specification for the complete routing switch can be created within few minutesThree global parameters describe the channel widths and the use of configuration sharing . Each switch point is defined by a set of potential signal routes according to their input and output directions (north, east, south, west). From the flexibility required by each switch point, the routing switch generator extracts a set of basic layout cells that need to be designed for the VLSI- implementation. The design of the according switch point macros and the SRAM-cell required for the configuration storage takes some hours for a skilled designer.The layout generator automatically determines the optimum placement of the basic cells for a given aspect ratio of the routing switch. It also calculates the optimum aspect ratios of the configuration
  • 4. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 519 blocks that are required per set of switch points sharing the same configuration. To analyses the influence of different parameters on the area, timing and power dissipation can be conducted very quickly. A simple net list describing the connections to be provided by the routing switch was used to automatically generate the configuration bit stream with more than 200 bits and the test bench for ModelSim (functional simulation) and Xilinx (Realization). The backend tool for the analysis is developed for an eFPGA application and is processed for system level implementation. Figure 2: Simulation model for designing board level implementation using proposed analytical tool A conventional application to such a design is carried out using HDL simulator and xilinx synthesizer tool to evaluate the feasibility of the flow of implementations for the suggested tool. Figure 3: timing simulation observation for a coding system using HDL timing simulator Figure 4: timing simulation observation for a decoding system using HDL timing simulator Figure 5: physical layout obtained for a developed simulation model using xilinx synthesizer Design static’s obtained fro the designed HDL definition. Design Statistics # IOs : 6 Cell Usage : # BELS : 1041 Macro Statistics # Registers : 156 Maximum operating Frequency: 102.008MHz CONCLUSIONS The design methodology presented in this paper is an important step for the evaluation of embedded FPGAs that are optimized for a certain application domain. By using a common, highly flexible architecture template, the eFPGA architecture can be tailored to a given application domain systematically. The self-contained design methodology presented here enables the VLSI-design as well as basic tools for verification and simulation. Hence, the complexity of mapping exemplary data paths to the eFPGA is reduced significantly compared to previous work. Using the simulation results based on actual VLSI layouts of the eFPGA, a high- level model of the architecture is currently evolving that allows for a systematic analysis of the dependencies between eFPGA architecture, mapped data paths and the according efficiency.
  • 5. IJRET: International Journal of Research in Engineering and Technology ISSN: 2319-1163 __________________________________________________________________________________________ Volume: 01 Issue: 03 | Nov-2012, Available @ http://www.ijret.org 520 REFERENCES [1] J. Hauser, “Integrated Circuits for Next Generation Wireless Systems”, Proceedings of the European Solid- State Circuits Conference (ESSCIRC) 2001, pp. 26-29 [2] Stretch S6000, http://www.stretchinc.com [3] MENTA eFPGA-augmented RISC CPUs (website), http://www.menta.fr/efpga_cpu.html [4] A. Ye and J. Rose, “Using Bus-Based Connections to Improve Field-Programmable Gate Array Density Implementing Data path Circuits”,IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Vol. 14, No. 5, pp. 462-473, May 2006. [5] T. von Sydow, M. Korb, B. Neumann, H. Blume and T. G. Noll, “Modeling and Quantitative Analysis of Coupling Mechanisms of Programmable Processor Cores and Arithmetic Oriented eFPGA-macros”, in Proc. Reconfigurable Computing and FPGA's, pp. 252-261, 2006. [6] G. Lemieux and D. Lewis, “Design of Interconnection Networks for Programmable Logic” Kluwer Academic Publishers, 2004. [7] V. Betz, J. Rose and A. Marquardt, “Architecture and CAD for Deep-Sub micron FPGAs” in Kluwer International Series in Engineering and Computer Science, 1999. [8] T. von Sydow, B. Neumann, H. Blume and T. G. Noll, “Quantitative Analysis of embedded FPGA Architectures for Arithmetic”, in Proc. Application Specific Systems, Architectures and Processors Conference ,pp. 125-131, 2006. [9] B. Neumann, T. von Sydow, H. Blume and T. G. Noll, “Design and quantitative analysis of parametrisable eFPGA- architectures for arithmetic“ in Advances in Radio Science, Vol. 4, pp. 251-259, 2006. [10] I. Kuon, A. Egier and J. Rose, “Design, Layout and Verification of an FPGA using Automated Tools”, in Proc. 2005 ACM/SIGDA 13th international symposium on Field programmable gate arrays, pp. 215–226, 2005. [11] A. Danilin, M. Bennebroek and S. Sawitzki, “A novel toolset for the development of FPGA-like reconfigurable logic”, in Proc. FPL 2005, pp. 640-643, 2005. [12] O. Weiss, M. Gansen and T. G. Noll, “A flexible Datapath Generator for Physical Oriented Design” in Proc. European Solid-State Circuits Conference, pp. 408-411, 2001. [13] K. Siozios et. al, “DAGGER: A Novel Generic Methodology for FPGA Bitstream Generation and its Software Tool Implementation”, in Proc. Parallel and Distributed Processing Symposium 2005,p.p. 165b, 2005.