DeepSeek: Nvidia GPUs Trail AMD MI300 by 25% in Edge AI Inference

NVDANVDA•26d ago

DeepSeek’s benchmarking reveals Nvidia’s Hopper GPUs deliver 25% lower inference throughput per watt than AMD’s MI300 accelerators in edge AI workloads. Major cloud providers have paused new Nvidia deployments and are evaluating alternative chips, highlighting a missed opportunity for Nvidia in the growing AI inference market.

1. DeepSeek Benchmark Exposes GPU Performance Gap

DeepSeek’s report measures inference performance and power efficiency of Nvidia’s latest Hopper GPUs against AMD’s MI300 accelerators and other rival chips using standard vision transformer and language models. The results show Nvidia GPUs delivered 25% lower throughput per watt compared to MI300 in edge AI tasks.

2. Cloud Providers Reevaluate Deployments

Major cloud service providers have halted orders for new Nvidia GPUs in their regional inference clusters and initiated pilots with AMD and Graphcore hardware. Operators cited the need for improved cost efficiency and power usage in latency-sensitive AI workloads.

3. Implications for Nvidia’s Market Strategy

The performance shortfall highlights a strategic blind spot for Nvidia in the high-margin AI inference segment, potentially ceding market share to competitors. Nvidia may need to adjust its roadmap or pricing to recapture growth in this rapidly expanding market.

Sources

FBFBF

Back to news

DeepSeek: Nvidia GPUs Trail AMD MI300 by 25% in Edge AI Inference

NVDANVDA•26d ago

1. DeepSeek Benchmark Exposes GPU Performance Gap

2. Cloud Providers Reevaluate Deployments

3. Implications for Nvidia’s Market Strategy

Sources

FBFBF

DeepSeek: Nvidia GPUs Trail AMD MI300 by 25% in Edge AI Inference

1. DeepSeek Benchmark Exposes GPU Performance Gap

2. Cloud Providers Reevaluate Deployments

3. Implications for Nvidia’s Market Strategy

Related News

OpenAI Eyes Fall IPO After Lawsuit Win, Battling $900B Anthropic Rival

Hub Group Delays Q1 Earnings After Revealing 2023-24 Misstatements, Stock Falls 12.5%

Silvercorp Secures 30-Year License Extension, Pays $60M for Tulkubash/Kyzyltash JV

iPower Cuts Q3 Costs 66%, Narrows Loss to $0.3M, Secures $2.6M Sublease

PepsiCo to raise single-serve chip bag prices by 10–20 cents in June

Sources

Ask about NVDA

DeepSeek: Nvidia GPUs Trail AMD MI300 by 25% in Edge AI Inference

1. DeepSeek Benchmark Exposes GPU Performance Gap

2. Cloud Providers Reevaluate Deployments

3. Implications for Nvidia’s Market Strategy

Related News

OpenAI Eyes Fall IPO After Lawsuit Win, Battling $900B Anthropic Rival

Hub Group Delays Q1 Earnings After Revealing 2023-24 Misstatements, Stock Falls 12.5%

Silvercorp Secures 30-Year License Extension, Pays $60M for Tulkubash/Kyzyltash JV

iPower Cuts Q3 Costs 66%, Narrows Loss to $0.3M, Secures $2.6M Sublease

PepsiCo to raise single-serve chip bag prices by 10–20 cents in June

Sources