Transformer의 Self-Attention 구조로 해석한 고효율 Tech Lead의 리더십 모델
Q, K, V : The Three Things Every Great Tech Lead Does Without Knowing It
Q, K, V : The Three Things Every Great Tech Lead Does Without Knowing It
You could have designed state of the art positional encoding
Nyströmformer: Approximating self-attention in linear time and memory via the Nyström method