DeepSeek Updates and Releases R1 AI Model on Hugging Face

Chinese AI startup DeepSeek has launched an updated version of its R1 reasoning model. The model is now available on the popular developer platform Hugging Face.

DeepSeek announced the update via WeChat, describing it as a minor upgrade. The R1 model is released under the permissive MIT license, allowing for commercial use.

A Powerful Model Now Accessible

The updated R1 boasts a substantial 685 billion parameters. This signifies its complexity and potential capabilities. Due to its size, running the model without modification on standard consumer hardware is likely challenging.

The Hugging Face repository currently provides configuration files and weights for the model. Further documentation detailing the model's architecture and performance is anticipated.

DeepSeek's Rise in the AI Landscape

DeepSeek gained attention earlier this year with the initial release of R1. The model's performance rivaled that of established players like OpenAI. However, the company has also faced scrutiny from regulators concerned about potential national security implications.

This release marks another step in DeepSeek's journey to develop and deploy advanced AI models. The open availability of R1 on Hugging Face will allow developers to experiment with and potentially build upon this powerful technology.