#self-attention 아티클 모음

Dev.to

Recurrence 제거와 Self-Attention 도입을 통한 병렬 처리 및 LLM 가속화

Self-Attention: The Brilliant Idea That Made Large Language Models Possible

AI/MLintermediate20 분 소요2026년 6월 28일

Dev.to

Cybersecurity 최적화를 위한 AI 아키텍처별 특성 및 적용 전략 분석

Section 1.1 — Comparing AI Types and Techniques Used in Cybersecurity

Securityintermediate57 분 소요2026년 6월 20일

Dev.to

O(n²) 복잡도의 Matrix Operation을 통한 토큰 간 관계 정량화 및 Contextual Representation 구현

How Self-Attention Works — QKV, Softmax, and Matrix Computation

AI/MLintermediate15 분 소요2026년 6월 18일

Dev.to

KV Cache 도입을 통한 LLM 추론 복잡도 O(n³)에서 O(n²)로 최적화

KV Cache in LLMs: The Optimization That Makes Modern AI Models Feel Fast

AI/MLintermediate32 분 소요2026년 6월 13일

Dev.to

A11 프레임워크 기반 Transformer의 인지 구조적 한계와 설계 결함 분석

Transformer as an Incomplete Cognitive Architecture: What It Captures Well and What It Misses (A11 Perspective)

AI/MLadvanced13 분 소요2026년 5월 26일

Dev.to

Transformer 기반 고차원 Embedding 및 Attention 메커니즘을 통한 문맥 추론

How ChatGPT/Gemini/MS Copilot Understands Your Question: A Step-by-Step Journey from Input to Response

AI/MLintermediate7 분 소요2026년 5월 13일

Dev.to

BPE Tokenization과 Transformer 기반 Next Token Prediction의 메커니즘 분석

How AI Works Under the Hood: LLMs Explained with Code

AI/MLintermediate59 분 소요2026년 5월 6일

Dev.to

Causal Attention 기반 Token 간 관계 모델링 및 Scaled Dot-Product 최적화

Chapter 9: Single-Head Attention - Tokens Looking at Each Other

AI/MLintermediate30 분 소요2026년 4월 28일

Dev.to

Transformer: Recurrence 제거를 통한 AI Scaling Primitive 구현

Without google's transformers, there is no GPT-ishs

AI/MLintermediate17 분 소요2026년 4월 25일

Dev.to

Positional Encoding과 Self-Attention을 통한 Decoder 레이어 구조 설계

Understanding Transformers Part 12: Building the Decoder Layers

AI/MLintermediate4 분 소요2026년 4월 23일

Dev.to

Residual Connection을 통한 Transformer Encoder의 학습 효율 최적화

Understanding Transformers Part 10: Final Step in Encoding

AI/MLintermediate3 분 소요2026년 4월 21일

Dev.to

Softmax 기반 Weighting을 통한 Self-Attention Value 산출 메커니즘

Understanding Transformers Part 7: From Similarity Scores to Self-Attention

AI/MLintermediate3 분 소요2026년 4월 15일

Dev.to

Query와 Key 벡터 생성을 통한 Transformer Self-Attention 유사도 측정 메커니즘

Understanding Transformers Part 5: Queries, Keys, and Similarity

AI/MLintermediate3 분 소요2026년 4월 11일

Dev.to

문맥의 핵심을 짚어내는 Self-Attention의 작동 원리

Understanding Transformers Part 4: Introduction to Self-Attention

AI/MLbeginner3 분 소요2026년 4월 9일

Dev.to

Transformer의 Self-Attention 구조로 해석한 고효율 Tech Lead의 리더십 모델

Q, K, V : The Three Things Every Great Tech Lead Does Without Knowing It

Careerintermediate25 분 소요2026년 4월 6일

Hugging Face Blog

Transformer 모델에서 위치 인코딩 방식을 Integer → Sinusoidal → Rotary Positional Encoding(RoPE)으로 단계적 진화시켜 최신 LLama 3.2에 적용

You could have designed state of the art positional encoding

AI/MLintermediate47 분 소요2024년 11월 25일

Hugging Face Blog

Nyströmformer가 Nyström 행렬 근사 방법을 자체-주의 메커니즘에 적용해 시간 복잡도를 O(n²)에서 O(n)으로 감소

Nyströmformer: Approximating self-attention in linear time and memory via the Nyström method

AI/MLintermediate21 분 소요2022년 8월 2일