
DeepSeek-V3
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. - Herramienta de IA inteligente para mejorar tu productividad.


DeepSeek's first-generation reasoning models. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning without supervised fine-tuning, demonstrated remarkable performance on reasoning. - Herramienta de IA inteligente para mejorar tu productividad.
37
Views
0
Likes
Jan 2026
Added
github.com
Website
Editorial Review
DeepSeek's first-generation reasoning models. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning without supervised fine-tuning, demonstrated remarkable performance on reasoning.
DeepSeek-R1 es una excelente herramienta en la categoría open-source-llm, adecuada para todos los usuarios que necesitan asistencia de IA.
Visit the official website to get started
Have an AI tool to share?
Get your product in front of people actively exploring AI tools.
Submit Your Tool
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. - Herramienta de IA inteligente para mejorar tu productividad.

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. - Herramienta de IA inteligente para mejorar tu productividad.

Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model. - Herramienta de IA inteligente para mejorar tu productividad.

Mixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights. Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference. - Herramienta de IA inteligente para mejorar tu productividad.