Tag
A 16-hour free YouTube playlist created by @0x0SojalSec teaches how to build a DeepSeek model from scratch, covering papers, theory, and code implementation including attention mechanisms, mixture of experts, and positional encodings.