mini-gpt

Tag

Cards List
#mini-gpt

MiniGPT: Rebuilding GPT from First Principles

arXiv cs.CL · 2026-05-19 Cached

This paper presents MiniGPT, a compact from-scratch implementation of GPT-style autoregressive language modeling in PyTorch, built after studying nanoGPT. It evaluates the model on the Tiny Shakespeare dataset using character-level tokenization, achieving a validation loss of 1.4780 with a 10.77M-parameter configuration.

0 favorites 0 likes
← Back to home

Submit Feedback