![](/rp/kFAqShRrnkQMbH6NYLBYoJ3lq9s.png)
DeepSeek
🎉 DeepSeek-R1 is now live and open source, rivaling OpenAI's Model o1. Available on web, app, and API. Click for details. Free access to DeepSeek-V3. Experience the intelligent model. DeepSeek-V3 achieves a significant breakthrough in inference speed over previous models.
deepseek-ai/DeepSeek-R1 - GitHub
DeepSeek-R1 achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks. To support the research community, we have open-sourced DeepSeek-R1-Zero, DeepSeek-R1, and six dense models distilled from DeepSeek-R1 based on Llama and Qwen.
DeepSeek-R1 - Revolutionary Reasoning-Focused Language Model
Experience DeepSeek-R1, a breakthrough in AI reasoning capabilities, achieving exceptional performance in mathematics, programming, and complex problem-solving through innovative reinforcement learning.
DeepSeek-R1 Release | DeepSeek API Docs
2024年7月25日 · 🔄 DeepSeek-R1 is now MIT licensed for clear open access. 🔓 Open for the community to leverage model weights & outputs. 🛠️ API outputs can now be used for fine-tuning & distillation
DeepSeek R1 is now available on Azure AI Foundry and GitHub
2025年1月29日 · DeepSeek R1 has undergone rigorous red teaming and safety evaluations, including automated assessments of model behavior and extensive security reviews to mitigate potential risks. With Azure AI Content Safety, built-in content filtering is available by default, with opt-out options for flexibility.
deepseek-r1 Model by Deepseek-ai | NVIDIA NIM
DeepSeek-R1 is a first-generation reasoning model trained using large-scale reinforcement learning (RL) to solve complex reasoning tasks across domains such as math, code, and language. The model leverages RL to develop reasoning capabilities, which are further enhanced through supervised fine-tuning (SFT) to improve readability and coherence.
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via ...
2025年1月22日 · DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrates remarkable reasoning capabilities. Through RL, DeepSeek-R1-Zero naturally emerges with numerous powerful and intriguing reasoning behaviors.
- 某些结果已被删除