
DeepSeek-R1
DeepSeek's first-generation reasoning models. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning without supervised fine-tuning, demonstrated remarkable performance on reasoning. - 智能 AI 工具,助力您的工作效率。
0
Views
0
Likes
Mar 2026
Added
bigscience.huggingface.co
Website
Bloom 是一个极大规模的多语言 LLM。
全球化研究。
网友:'多语言大模型的里程碑。'
Visit the official website to get started
Have an AI tool to share?
Submit Your Tool
DeepSeek's first-generation reasoning models. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning without supervised fine-tuning, demonstrated remarkable performance on reasoning. - 智能 AI 工具,助力您的工作效率。

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. - 智能 AI 工具,助力您的工作效率。

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. - 智能 AI 工具,助力您的工作效率。

Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model. - 智能 AI 工具,助力您的工作效率。