Batch Merging과 C Extension으로 훈련 속도 2,160배 개선
minbpe vs turboBPE: Two ways to think about tokenizer training
minbpe vs turboBPE: Two ways to think about tokenizer training
Tokenization under the hood: BPE, WordPiece, SentencePiece, and Unigram compared
How Many R's in Strawberry? Your AI Has No Idea Why That's Hard
How AI Works Under the Hood: LLMs Explained with Code
LLM Study Diary #2: Tokenization
Tokens vs Bytes in AI: What LLMs Actually See When You Type
I built the algorithm behind ChatGPT from scratch — here's what I learned