Nvidia wants to drive AI costs down as 'reasoning' models rise

Nvidia's CEO said the AI chipmaker is building new chips on a one-year cycle

We may earn a commission from links on this page.
Jensen Huang sitting in a chair, speaking, with both of us hands up, in front of a purple backdrop
Nvidia CEO Jensen Huang at the Bipartisan Policy Center on September 27, 2024 in Washington, D.C.
Photo: Chip Somodevilla (Getty Images)
In This Story

Nvidia’s (NVDA+3.20%) chips have been a driver of the current artificial intelligence boom — and the chipmaker only wants to make it move faster, chief executive Jensen Huang said.

During an appearance on the Tech Unheard podcast, Huang was asked by host Rene Haas, chief executive of British semiconductor company Arm (ARM+1.23%), if the pace of AI innovation is moving “faster than you imagined.”

Advertisement

Huang replied, “No,” and said Nvidia is “trying to make it go faster.”

“We’ve gone to a one-year cycle,” Huang said about the company’s production of new chips. “And the reason for that is because the technology has the opportunity to move fast.”

Advertisement

Nvidia is designing “six or seven new chips per system,” Huang said, then using “co-design to reinvent the entire system” and inventing other technologies that allow it to improve performance by two or three times while using the same amount of energy and cost each year.

“That’s another way of, essentially, reducing the cost of AI by two or three times per year,” Huang said. “That is way faster than Moore’s Law.”

Advertisement

Over the next few years, Huang said, Nvidia wants to drive down the cost of AI amid the rise of even more complex models. This evolution is already underway. In September, OpenAI released a new series of “reasoning” AI models called o1, which are “designed to spend more time thinking before they respond,” the way humans do.

In the future, AI services such as OpenAI’s ChatGPT, which Huang said he uses every day, will “iteratively reason about the answer” and could go through hundreds or thousands of inferences before producing an output.

Advertisement

This level of complex processing would require significantly more computational power than current models. Despite the increased demands, Huang believes the trade-off is worthwhile.

“[T]he quality of the answer is so much better,” Huang said. “We want to drive the cost down so that we could deliver this new type of reasoning inference with the same level of cost and responsiveness as the past.”

Advertisement

On Wednesday, Nvidia’s stock was climbing back toward its record high $135 close in June. The chipmaker’s shares were down 0.27% during midday trading, but opened up almost 1% at around $134 per share.