|
Device power consumption is a primary
issue in the semiconductor industry as
process technologies get smaller and faster,
they normally consume more power, putting
power concerns and performance at
odds. The new Virtex-4 FPGA family
from Xilinx® employs innovative architectural
features and clever IC design techniques
that dramatically reduce power
consumption, without compromising performance.
This bucks expected trends normally
associated with the reduced feature
sizes of 90 nm process technology.
In this article, well explore how Xilinx IC
designers achieved remarkable power efficiency
in the high-performance Virtex-4 FPGA.
Components of Power Consumption
There are two main components to power
consumption: static and dynamic. Static or
quiescent power is mainly dominated by
transistor leakage current. When this current
is listed in data sheets, it is listed as ICCINTQ
and is the current drawn through the
VCCINT supply powering the FPGA core.
Dynamic or active power has components
from both the switching power of the core of
the FPGA and the I/O being switched. The
dynamic power consumption is determined
by the node capacitance, supply voltage, and
switching frequency and governed by the
basic formula P=CV 2.
Both static and dynamic power have
been significantly reduced in Virtex-4
devices, even when compared to Virtex-II
Pro devices.
Dramatic Power Reduction
The Virtex-4 product family has reduced
power consumption in several key areas.
The power-per-CLB has been cut in half,
with static power reduced by 40% and
dynamic power reduced by 50% when
compared to the 130 nm Virtex-II Pro
FPGA and other 90 nm FPGAs.
Furthermore, certain hard-logic silicon
functions in the Virtex-4 FPGA reduce
power consumption by 80-95%, a whopping
factor when compared to the same
functions implemented in configurable
logic blocks and programmable interconnect
routing.
Additionally, comprehensive power
planning tools are available to help you
get an idea, up front, of power consumption
for your Xilinx FPGA under its operating
conditions.
Reduced Power Consumption Benefits
Reduced power consumption benefits cut
across a few areas of product design in
reduced thermal concerns as well as eased
power supply design (see Figure 1).
- Reduced thermal concerns When
you reduce power consumption in a
device or system, you use smaller heat
sinks, or no heat sinks at all in some
cases. You also have simpler thermal
system design from the point of view
of reducing airflows and fan size needs.
- Easier power supply design You can
also use smaller supply circuitry and
reduce the number of components in
the power supply. Using less PCB
space allows you to reduce the cost of
the power system. Plus, by not having
your device consume as much power,
you can achieve higher reliability by
lowering the temperature of the
FPGA die.
Static Power Trends in 90 nm Technology
The reduction in transistor size in 90 nm
technology has several effects on power consumption.
The biggest potential problem is
in the area of static power.
Scaling Trends for Static Power
As we mentioned earlier, static power is dominated
by transistor leakage current.
Unfortunately, channel leakage increases as
transistor size decreases. This is especially true
for low VT transistors where VT refers to voltage
threshold between the gate and drain.
Low VT transistors are the fastest transistors
the ones with the shortest turn-on and
propagation delay that IC designers use
inside the FPGA when the highest speed performance
is needed. Regular VT transistors are
also used when less performance is acceptable,
but this only helps so much with leakage.
Figure 2 shows that leakage goes up dramatically
when moving from 130 nm to 90
nm technology. The Virtex-II Pro device
uses 130 nm process technology, whereas
the new Virtex-4 device uses 90 nm process
technology.
Triple-Oxide The Savior of Static Power
Triple-oxide simply means that we use a
third thickness of oxide in making some of
the transistors in the FPGA (two oxide
thicknesses are used in devices like the
Virtex-II Pro FPGA). Most transistors in the
past had a thin oxide layer. Within those
transistors could be low VT, regular VT,
NMOS, or PMOS transistors. Thick-oxide
transistors are mostly used for I/O drivers
and a few other functions.
Oxide deposition thickness is a very stable
and controllable process in the semiconductor
industry because it depends on
temperature, concentration, and exposure time. Figure 3a/ 3b shows the Virtex-4
transistor with the middle oxide thickness
used in the triple-oxide process. You may
notice that the oxide thickness is still very,
very thin, but this thicker oxide transistor
has much lower leakage than the standard
thin-oxide low VT and regular VT transistors
used in Virtex-II Pro FPGAs and in
various parts of Virtex-4 FPGAs.
Why Doesnt Everyone Use Triple-Oxide?
If triple-oxide is such a great process, why
dont other companies like Intel or
IBM use it in their own ASICs?
They probably would if
it benefited them. The reason
they dont is that all of
their transistors need to run
at speed; hence, they must
use the low VT leakier transistors
for everything. FPGAs can have many different
transistor types,
which can be selected for
function, power, or performance.
FPGAs can use different transistor
types for different functions, and Xilinx
designers have accomplished this balance.
Optimizing Performance and Leakage
Our IC designers have many things that
they can do to adjust the mix to optimize for
certain factors. The Virtex-4 FPGA is the
first Platform FPGA designed for high speed
and low power.
Low VT transistors are used only where
necessary for maximum speed, while the middle
thickness of oxide from the triple-oxide
process may be used for less aggressive performance
with very low leakage. You may use
different sizes and types of transistors for performance
and function. Combinations are
also possible, such as small and medium-sized
low VT fast transistors and small and medium-sized middle oxide thickness transistors. It
is not a one-size-fits-all procedure.
Xilinx IC designers were given a directive
to reduce power, among other things, in the
Virtex-4 platform while maintaining the
highest system performance. These transistors
are used across the various FPGA functions of
LUTs, I/O, interconnect, and configuration
memory cells. Even within a given FPGA
function, all transistors dont need to be the
same, and that is up to the Xilinx IC designers
(see Figure 4).
The surprising result of this balancing is
that the overall static current in Virtex-4
devices with 90 nm process is reduced by 40%
when compared to Virtex-II Pro devices with
130 nm process. Table 1 shows a chart of the
weighted average changes to the transistors in
the Virtex-4 die compared to Virtex-II Pro
die, which allows you to arrive at the reduced
transistor leakage in the Virtex-4 FPGA.
Dynamic Power Reduction
Static power reduction, while dramatic, is
not the only power winner that you can
take advantage of. Dynamic power is also
reduced by 50% when compared to
Virtex-II Pro FPGAs.
The dynamic power in the FPGA is
governed by the following equation:
PDynamic=FPGACore (CV 2 )+FPGAI/O (CV 2 )
The Virtex-4 family of FPGAs has the
following:
- Reduced FPGA core dynamic power
- Internal operating voltage is the
dominant factor
- Secondary scaling by frequency (f )
and node capacitance (C)
- Constant FPGA I/O dynamic power
- Unchanged voltage swing (VI/O),
toggle rate (f ), and pin/pad capacitance
(C) for a given I/O standard
So you can see that we may be able to
have an effect on dynamic power inside the
device, but that dynamic power consumed
by I/O switching remains unchanged.
When we go from the 130 nm process
of the Virtex-II Pro FPGA to the 90 nm
process of the Virtex-4 FPGA, the internal
supply voltage changes from 1.5V to
1.2V. This reduces the dynamic power
consumption for every internal transistor
by of that in the Virtex-II Pro FPGA.
Additionally, the FPGA internal composite
capacitance is reduced in the Virtex-4
FPGA. This internal capacitance comprises
transistor parasitic capacitances and traceto-
metal and trace-to-trace capacitances for
the interconnecting metal traces. Figure 5
shows the capacitance involved relative to
their structures.
Does low-K reduce power? Low-K refers
to the dielectric insulating material
between the metal traces in the FPGA.
Lower K dielectric insulating layers do
reduce internal capacitances per unit trace
length, but low-K is a relative term.
Xilinx has reduced-K-insulating materials,
and in the past has used low-K itself; we
may do so again in the future.
As mentioned earlier, dynamic power
is related to the bulk capacitance and
internal voltage levels being switched,
P=CV 2. All things being equal, having
a lower internal capacitance for the interconnects
would be a benefit for dynamic
power and reduced resistor-capacitor
delay, but other factors contribute to
interconnect capacitance, such as distance
above the metal plane, interconnect
width, and interconnect length.
Additionally, other parasitic capacitances
such as gate-to-drain and gate-tosource
are also part of the equation. Total
capacitance for a path is based on a complex
combination of parasitic capacitance
in the transistors; the architecture of the
interconnect paths and actual path
lengths; and the number of hops through
interconnect switches. Xilinx has reduced
the overall capacitance for those components
in the Virtex-4 FPGA.
The overall effect is mostly due to
reduced gate capacitance and lowers capacitance
by 20% for Virtex-4 FPGAs when
compared to Virtex-II Pro FPGAs. Table 2
shows a dynamic power reduction of 50%
for the Virtex-4 FPGA when compared to
the Virtex-II Pro FPGA. We have a 23%
reduction in dynamic power when running
at a 50% higher frequency.
Because the Virtex-4 FPGA is a much
higher performance device than the Virtex-II Pro FPGA, you may need to operate it at
higher clock speeds to meet newer
demanding performance targets that could
never be achieved in previous systems.
Embedded Blocks
Another major area of improvement in
power consumption is in the area of
embedded functions. This has always
been a strength in Xilinx FPGAs, but it is
more so in the Virtex-4 FPGA, even
when compared to the feature-rich
Virtex-II Pro FPGA.
In Virtex-4 FPGAs you can take further
advantage of both static and dynamic power
reduction by using the embedded functions,
which are built as hard-logic functions.
When embedded functions are implemented
as hard-logic functions instead of
in configurable logic blocks and programmable
interconnects, there is a lot less
static and dynamic power consumed. This
is because far fewer transistors are used for
hard, fixed logic than for programmable
logic. Additionally, there are no transistors
needed to make connections for interconnects
in the embedded functions, because
there are no programmable interconnects.
Xilinx has carefully studied some of the
functions that engineers like you have
struggled with and that we have also
found tedious to implement within the
FPGA programmable logic. The new
embedded functions lower power by 80-95% compared to their configurable logic
blocks and routed counterparts in programmable
silicon.
Comprehensive Power Planning Tools
Another useful thing in planning power is
that Xilinx data sheets show you both typical
and maximum power consumption
numbers. Maximum numbers are for
worst-case process, temperature, and voltage,
but many designers are very happy to
work with typical numbers, depending on
their application and the number of parts
being used in one system.
One additional very useful thing that
you can take advantage of in planning for
power consumption in Xilinx FPGAs are
power planning tools. Xilinx web power
tools are available for estimating power
early in the design cycle. Also, as part of the
Xilinx design flow, XPower looks in more
detail at a mapped or routed design. Both
can be found, along with power application
notes, by searching the Xilinx website for
the phrase Xilinx Power Tools.
Conclusion
Xilinx has made profound improvements in
both static and dynamic power in the Virtex-4 90 nm family of FPGAs when compared
to Virtex-II Pro FPGAs and (we believe) in
comparison to our competitors. We have
done this through a multi-pronged, purposeful
approach in the areas of reduced leakage
current, reduced dynamic power consumption,
and embedded functions, without
compromising performance. These, along
with comprehensive power planning tools,
make the Virtex-4 device an excellent choice
for a high-performance FPGA system.
For more information about power consumption
in Virtex-4 and other Xilinx
FPGAs, visit www.xilinx.com/products/design_resources/design_tool/grouping/power_tools.htm.
Virtex-4 Embedded Functions and
Reduction of Dynamic Power
- PowerPC 50% power reduction
compared to Virtex-II Pro PowerPC
- 10:1 power reduction over FPGA
fabric-built version
- DSP XtremeDSP slice greatly
reduces logic cells, which previously
needed many filtering functions
- 20:1 power reduction over Virtex-II
Pro separated multiply/accumulate
functions
- SSIO New ChipSync block
reduces logic cell count for SSIO
(source synchronous I/O) designs
- Significant logic cell savings for various
memory and networking interface
designs leads to reduction in
overall power up to 9:1 for selected
designs (see Table 3)
- Embedded Ethernet MAC(s) No
need to use logic and interconnect
for MAC function, which saves
>3,000 logic cells for the Xilinx
implementation
- FIFO SmartRAM memory
includes built-in FIFO controllers,
which can save hundreds of logic
cells per FIFO and greatly simplify
design as well
|
Printable PDF version of this article with graphics. (1/15/05) 328 KB |