Chapters

Progressive lessons that build from foundations to advanced topics. Each chapter includes intuition, math, code, and exercises.

Foundations

Start here

Core concepts: agents, environments, and the exploration-exploitation tradeoff.

Q-Learning Foundations

Value-based methods from TD learning through deep Q-networks.

Policy Gradient Methods

Learn policies directly with gradient ascent. From REINFORCE to PPO.

Content Status

📝 AI Generated — Pending review
Editor Reviewed — Approved by editor
👥 Community Reviewed — Incorporates feedback
🔒 Verified — Code tested, demos working