Global AI and Data Science

 View Only

DeepSeek's reasoning AI shows power of small models, efficiently trained

By NICK PLOWDEN posted 2 days ago

  

DeepSeek-R1, the AI model from Chinese startup DeepSeek, soared to the top of the charts of the most downloaded and active models on the AI open-source platform Hugging Face hours after its launch last week. It also sent shockwaves through the financial markets as it prompted investors to reconsider the valuations of chipmakers like NVIDIA and the colossal investments that American AI giants are making to scale their AI businesses.

Why all the buzz? A so-called "reasoning model," DeepSeek-R1 is a digital assistant that performs as well as OpenAI’s o1 on certain AI benchmarks for math and coding tasks, was trained with far fewer chips and is approximately 96% cheaper to use, according to the company.

“DeepSeek is definitely reshaping the AI landscape, challenging giants with open-source ambition and state-of-the-art innovations,” says Kaoutar El Maghraoui, a Principal Research Scientist and Manager at IBM AI Hardware.

Read the full article.

0 comments
5 views

Permalink