
DeepSeek-R1
DeepSeek's first-generation reasoning models. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning without supervised fine-tuning, demonstrated remarkable performance on reasoning. - スマートな AI ツールで生産性を向上。
0
Views
0
Likes
Mar 2026
Added
bigscience.huggingface.co
Website
大規模な多言語モデル。
グローバルリサーチ。
ユーザー: '多言語AIの先駆け。'
Visit the official website to get started
Have an AI tool to share?
Submit Your Tool
DeepSeek's first-generation reasoning models. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning without supervised fine-tuning, demonstrated remarkable performance on reasoning. - スマートな AI ツールで生産性を向上。

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. - スマートな AI ツールで生産性を向上。

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. - スマートな AI ツールで生産性を向上。

Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model. - スマートな AI ツールで生産性を向上。