When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.
There are also new models with some multimodal capabilities, mainly in image creation and analysis.
Inference requires significant numbers of NVIDIA GPUs and high-performance networking.
We now have three scaling laws: pre-training and post-training, which continue, and new test-time scaling.
DeepSeek claims it used an innovative new training process to develop its LLMs using trial and error to self-improve.
Sam Altman also praised DeepSeeks R1 model, particularly around what they’re able to deliver for the price.
He reiterated that OpenAI will obviously deliver much better models, but he welcomed the competition.
Nvidia seems to be keeping its future cards closer to its chest.