When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.

AWS also offers DeepSeek-R1-Distill, a lighter version, through Amazon Bedrock Custom Model Import.

This serverless deployment simplifies infrastructure management while maintaining scalability.

A person using DeepSeek on their smartphone

It also benefits from Nvidias Hopper architecture, using FP8 Transformer Engine acceleration and NVLink connectivity.

Running on an HGX H200 system, DeepSeek-R1 can generate up to 3,872 tokens per second.

Microsoft has also implemented extensive safety measures, including content filtering and automated assessments.