Global Sources
EE Times-Asia
Stay in touch with EE Times Asia
EE Times-Asia > Networks

New tech pares down I/O power draw

Posted: 02 Apr 2007 ?? ?Print Version ?Bookmark and Share

Keywords:Serdes transceiver? interconnect standards? high-speed Serdes?

Rambus Inc. and a group of leading academics have set a high-water mark in bringing fast chip-to-chip links into the era of low-power design. Their advance suggests the industry needs to rethink some fundamental assumptions in the way it defines, measures and implements power in interconnect designs.

At the International Solid-State Circuits Conference in February, the Rambus group described a 6Gbps Serdes transceiver that draws just 2.2mW/Gbps in 90nm process technology. Today's designs can suck as much as 10 times that amount of power from a processor.

By comparison, in the same conference, Texas Instruments Inc. disclosed a 12.5Gbps Serdes made in 65nm process technology that consumes 27.5mW/Gbps. Sony Corp. presented a paper on a 10Gbps transceiver in 90nm technology that dissipates 25mW/Gbps.

Even Rambus' own current products are power hogs by comparison. Both the Rambus XDR and FlexIO interconnects consume about 20mW/Gbps.

Rambus officials won't comment on just when or how the company will bring the technology to market. But they did say there are no major hurdles standing in the way.

Mainstream opportunity
"This test chip is amazingly good, and we are all very happy about that," said chief scientist Mark Horowitz, a Rambus founder who was one of the paper's co-authors. "For many papers, you have to stand on your tippy-toes and pray to the demo gods that it will be just so. That's not the case here. There are some robustness and volume-manufacturing issues, but nothing that concerns me." Horowitz is also a professor of computer science and electrical engineering at Stanford University.

Indeed, the technology may represent a more mainstream opportunity than the high-end, speed-driven interconnects for which Rambus is usually known.

"This is intended for high-volume computer and consumer applications," said Kevin Donnelly, senior VP of engineering at Rambus. A separate project in the works at Rambus aims to hit the same 2.2mW/Gbps power level, but will deliver as much as four times the performance using 65nm technology. "The power budget for the next generation has to be the same as the old power budget," said Donnelly.

Where did the power go? A rough analysis of typical Serdes transceiver shows power dissipation is widely distributed.

In fact, the company hopes to establish the metric of 1mW/Gbps as the new gold standard for measuring I/O power, just as MIPS per watt has largely replaced megahertz as the main figure of merit for microprocessors.

Besides resetting the bar in low-power Serdes design, the paper's message to the industry is that the 1V signaling written into a host of interconnect standards is no longer acceptable. The Rambus team used signaling rates below 200mV to drive a new level in low-power interconnects.

Sharpening skills
Engineers will need to sharpen their pencils on a broad range of techniques the team used to build receivers that can handle the low-voltage signaling. Those techniques include aggressively driving functions from logic to software and shifting much of the focus on adaptive signal equalization from the transmitter to the receiver.

"A demo like this can change people's views about what's possible," said Horowitz. "If standards specify 1V signaling, they are not doing low-power I/O."

"The reason people require 1V swings is that they can't figure out how to make better receivers. It requires use of sophisticated and uncommon techniques," said Bill Dally, a co-author of the paper and chairman of the computer science department at Stanford.

Low-power chip-to-chip links are fast becoming a requirement as high-end microprocessors begin demanding as much as a terabit per second of I/O. "We have to do this work or I/O power will dominate systems as we move to chips with hundreds and even thousands of pins in the future," said a senior engineer who asked not to be named. "Most high-end chips being developed today are power-limited," added Horowitz.

In the same conference session where Rambus presented, other researchers from UCLA will describe a 10Gbps-per-pin interconnect that uses two RF techniques along with capacitive coupling to create an UWB link that dissipates as little as 2.7mW/pin in 180nm technology. The design aims to link dice in a 3D stack of chips.

Receiver keeps hardware budget trim. Software processing and reused logic let Rambus eliminate interpolation block.

Sun Microsystems Inc. has been developing a form of capacitive-coupling technology for chip-to-chip links for at least three years. Sun's Proximity technology was closely tied to its efforts on a government supercomputing project opportunity that Sun lost late last year. At the time, Sun executives said development would move forward for the technology. Proximity promises 100 times more links than today's BGA packages.

Power milestone
The Rambus work used a combination of techniques to achieve its 2.2mW/Gbps milestone, including a voltage-mode signaling technique developed by UCLA researcher Ken Yang. "There isn't any one thing you can do to reduce the overall power," said Horowitz. "You need innovations in logic, clocking, signaling, as well as in the basic transmitter and receiver. We had some synergies in some areas, and in others, we just had to be clever."

Two big areas of innovation were in shifting the focus of adaptive equalization from the transmitter to the receiver and driving more of the equalization work from dedicated hardware to software on a basic MCU. "If you start at the far end of the receiver and work your way back after you deliver a receiver that is as sensitive as possible, you get a cycle of goodness in power savings all across the system," said co-author John Poulton, a computer science professor at the University of North Carolina at Chapel Hill.

Researchers found a way to use existing information from a clock-data recovery (CDR) unit to measure the loss of high frequencies in the signal and compensate by effectively turning up what Dally called "a treble knob" in the receiver.

CDR data on when the system clock is early and late "gives you exactly what you need to know about whether the signal eye is closed or open," said lead author Robert Palmer, who presented the ISSCC paper. Having that data, along with some clever use of other existing blocks, eliminates the need for oscillator and interpolator logic used in typical receivers.

"There was no extra hardware to handle the adaptive equalization," said Palmer. "In a typical design, the interpolation logic uses as much power as our whole link," said Dally. The resulting sensitivity of the receiver allowed engineers to eliminate signal pre-emphasis and multitap equalization with hardware FIR filters typically used at the transmitter stage for high-speed Serdes. "It takes a lot of energy to use precompensation," said Horowitz. "Many people will be surprised at how aggressive we have been in pushing more functions from logic into software."

- Rick Merritt
EE Times

Article Comments - New tech pares down I/O power draw
*? You can enter [0] more charecters.
*Verify code:


Visit Asia Webinars to learn about the latest in technology and get practical design tips.

Back to Top