
DeepSeek-R1
DeepSeek's first-generation reasoning models. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning without supervised fine-tuning, demonstrated remarkable performance on reasoning. - 스마트 AI 도구로 생산성 향상.
52
Views
0
Likes
Mar 2026
Added
bigscience.huggingface.co
Website
Editorial Review
매우 큰 규모의 다언어 모델.
글로벌 연구.
사용자: '다언어 AI의 이정표.'
Visit the official website to get started
Have an AI tool to share?
Get your product in front of people actively exploring AI tools.
Submit Your Tool
DeepSeek's first-generation reasoning models. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning without supervised fine-tuning, demonstrated remarkable performance on reasoning. - 스마트 AI 도구로 생산성 향상.

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. - 스마트 AI 도구로 생산성 향상.

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. - 스마트 AI 도구로 생산성 향상.

Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model. - 스마트 AI 도구로 생산성 향상.