Intel이 Sapphire Rapids CPU에서 OpenVINO, IPEX, AMX 가속기를 조합해 Stable Diffusion 추론 레이턴시를 32.3초에서 5.05초로 84% 단축
Accelerating Stable Diffusion Inference on Intel CPUs
Accelerating Stable Diffusion Inference on Intel CPUs
Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs
Scaling-up BERT Inference on CPU (Part 1)