Gaussian 초기화와 Dual Embedding 기반의 LLM Forward Pass 설계
Chapter 6: Embeddings, the Forward Pass, and the Loss Function
Chapter 6: Embeddings, the Forward Pass, and the Loss Function
My Notes on Karpathy's Makemore part 1: Building a Bigram Language Model from Scratch
My Notes: Makemore - Character Level Language Model