Inside LLM Training: The Transformer Pipeline Explained

AI Tools Kit

AI Tools Kit provides free developer tools for working with AI language models. Built by developers, for developers.

GRPO vs PPO: How Modern LLMs Learn from Feedback

GRPO replaced PPO in post-training for DeepSeek-R1 and Gemini. Covers the math, training pipeline, design trade-offs, and when each method wins.

LLM Architecture

Inside Mixture of Experts: How Sparse Routing Scales LLMs

How Mixture of Experts scales LLMs without proportional inference cost. Covers routing networks, load balancing loss, expert capacity, and why MoE models behave differently from dense transformers.

Inside LLM Training: The Transformer Pipeline Explained

Related Articles

GRPO vs PPO: How Modern LLMs Learn from Feedback

Inside Mixture of Experts: How Sparse Routing Scales LLMs