News Huawei's brute force AI tactic seems to be working — CloudMatrix 384 claimed to outperform Nvidia processors running DeepSeek R1

Admin · Jun 20, 2025

A new report says that Huawei's CloudMatrix 384 outperforms Nvidia processors running DeepSeek R1, which is to be expected given the energy use involved.

Huawei's brute force AI tactic seems to be working — CloudMatrix 384 claimed to outperform Nvidia processors running DeepSeek R1 : Read more

setx · Jun 20, 2025

The result was pretty obvious.

Why so biased title? Why mention "using four times the energy" without "for 1.7x speed"? It would look very different.

Also, if you think that proper interconnect and software stack is "brute force" – go ahead and try replicating it.

jp7189 · Jun 20, 2025

Those are some cherry picked numbers. NVL72 can do 360 fp16 pflops with sparsity, and scales pretty well with a doubling of performance for fp8 and another doubling with fp4. Point being, that a different benchmark may tell a vastly different story.

Getting 16 racks to work together is quite a feet and the engineering in the interconnects sounds like they are carrying the show. How far can it scale? Will it double the performance with a 32 rack deployment? Nvidia is not a slouch in interconnects either. I honestly don't know the scaling of a 16 rack system, but NVL72 can be deployed in a single rack. What is the performance of 16 of those?

Lastly, didn't I read on Tom's somewhere that the 910C was manufacturered by TSMC and was done so against their knowledge? How many of these chips can be sourced going forward? Catching up by deploying 4x as many chips only works if you can get 4x as many chips.

Search

News Huawei's brute force AI tactic seems to be working — CloudMatrix 384 claimed to outperform Nvidia processors running DeepSeek R1

Admin

Administrator

setx

Distinguished

jp7189

Distinguished

TRENDING THREADS

Latest posts

Moderators online

Share this page