Nvidia built a massive dual GPU to power models like ChatGPT

Share

Nvidia’s semi-annual GPU Technology Conference (GTC) usually focuses on advancements in AI, but this year, Nvidia is responding to the massive rise of ChatGPT with a slate of new GPUs. Chief among them is the H100 NVL, which stitches two of Nvidia’s H100 GPUs together to deploy Large Language Models (LLM) like ChatGPT.

The H100 isn’t a new GPU. Nvidia announced it a year ago at GTC, sporting its Hopper architecture and promising to speed up AI inference in a variety of tasks. The new NVL model with its massive 94GB of memory is said to work best when deploying LLMs at scale, offering up to 12 times faster inference compared to last-gen’s A100.

Nvidia

These GPUs are at the heart of models like ChatGPT. Nvidia and Microsoft recently revealed that thousands of A100 GPUs were used to train ChatGPT, which is a project that’s been more than five years in the making.

ChatGPT was down nearly all day, chat history still in progress
Here’s the ChatGPT word limit and how to get around it
GPT-4: how to use, new features, availability, and more

The H100 NVL works by combining two H100 GPUs over Nvidia high bandwidth NVLink interconnect. This is already possible with current H100 GPUs — in fact, you can connect up to 256 H100s together through NVLink — but this dedicated unit is built for smaller deployments.

This is a product built for businesses more than anything, so don’t expect to see the H100 NVL pop up on the shelf at your local Micro Center. However, Nvidia says enterprise customers can expect to see it around the second half of the year.

In addition to the H100 NVL, Nvidia also announced the L4 GPU, which is specifically built to power AI-generated videos. Nvidia says it’s 120 times more powerful for AI-generated videos than a CPU, and offers 99% better energy efficiency. In addition to generative AI video, Nvidia says the GPU sports video decoding and transcoding capabilities and can be leveraged for augmented reality.

Nvidia says Google Cloud is among the first to integrate the L4. Google plans on offering L4 instances to customers through its Vertex AI platform later today. Nvidia said the GPU will be available from partners later, including Lenovo, Dell, Asus, HP, Gigabyte, and HP, among others.

News

Company:

Razer, somehow, made a mouse pad exciting

HMD’s first phones just leaked, and I’m mighty disappointed

Own an RTX 4090? We have some bad news

Best Buy is shaving 50% off the Google Pixel 7a with activation — no, I’m not kidding

OnePlus’ next foldable could bring the heat to the Galaxy Z Flip in a whole new way

Spigen Rugged Armor Samsung Galaxy S24 case review: Should you buy it?

I reviewed a pair of tiny earbuds that helped me sleep better

Spigen Thin Fit Samsung Galaxy S24 case review: Should you buy it?

OnePlus 12 review: the new Android phone to beat in 2024

Spigen Optik Armor Samsung Galaxy S24 case review: Should you buy it?

I’ve worn two of the best smart rings. Here’s which one you should buy

I did a camera test with two $1,800 phones. Then something annoying happened

Google Pixel 7a vs. Pixel 7: don’t buy the wrong Pixel

This is the most unusual Galaxy S23 Ultra camera test I’ve ever done

I tested the Galaxy S23 Ultra and iPhone 14 Pro cameras. Only one is a winner

How to transfer a Wear OS smartwatch from one phone to another

How to type an em dash in Windows

Ask Jerry: How to fight email spam

How to automatically unlock your Pixel with Watch Unlock

How to insert a checkbox in Word on Windows and Mac

8 iPhone browser apps you should use instead of Safari

Are Facebook and Instagram still down? Here’s what we know

Are Facebook and Instagram still down? Here’s what we know

The 1Password Android app just got a huge upgrade

I never knew I needed this mini Mac app, but now I can’t live without it

Nvidia built a massive dual GPU to power models like ChatGPT

Spigen Rugged Armor Samsung Galaxy S24 case review: Should you buy it?

Razer, somehow, made a mouse pad exciting

HMD’s first phones just leaked, and I’m mighty disappointed

Own an RTX 4090? We have some bad news

Best Buy is shaving 50% off the Google Pixel 7a with activation — no, I’m not kidding

More News

Spigen Rugged Armor Samsung Galaxy S24 case review: Should you buy it?

Razer, somehow, made a mouse pad exciting

HMD’s first phones just leaked, and I’m mighty disappointed

Own an RTX 4090? We have some bad news