Hugging Face가 AWS Inferentia2 칩 지원을 SageMaker와 Inference Endpoints에 통합해 100,000개 이상의 모델 배포 가능
Deploy models on AWS Inferentia2 from Hugging Face
Deploy models on AWS Inferentia2 from Hugging Face
Hugging Face Text Generation Inference available for AWS Inferentia2
Make your llama generation time fly with AWS Inferentia2
Accelerating Hugging Face Transformers with AWS Inferentia2