4 September 2025

Andrej Karpathy

The GOAT of ML

In my last post I mentioned Zero to Hero by Andrej Karpathy. I just watched Deep Dive into LLMs by him, where he does an excellent job of describing the three stages of model training: pre-training, post-training supervised fine-tuning, and post-training reinforcement learning.

I won’t try to explain the contents, you should just go and watch the video. It’s highly recommended! Here’s a copy of the image Karpathy created using Excalidraw (my favorite diagramming tool for a while now) during the video.

Deep Dive into LLMs diagram

Here are a couple of other interesting links I fished out of the video:

FineWeb, an alternative to the OpenWeb dataset I’ve used to train a model from scratch
TikTokenizer, an incredible tool that translates text into the token vocabulary used by different models
LLMs visualizations by a kiwi software engineer
LM Arena, a tool to rank (based on user votes) models across different tasks
Andrej Karpathy’s website

Andrej Karpathy joins Moriba Jah and Dominic Tarr in the ranks of folks I found online somewhat randomly as I dove into personal projects. I came across Dominic when I started looking into Secure Scuttlebut (which I used when building Archerlog) and decentralized applications/platforms. And Moriba is someone I came to look up to as I learned about satellite tracking and the players in the field, which include his company Privateer. It’s amazing to me that these folks are so generous with their knowledge and code - I can only hope some of the topics I’ve been spending my time on and chronicling in this blog are as useful to someone in the future.

Thiago's Space Blog

Just another blog, this one about my learnings as I join the Space Industry as a software engineer.

Andrej Karpathy

The GOAT of ML