Meta's reference stack for building with Llama models
Llama Stack is Meta's reference implementation for building applications with Llama models. It provides standardized APIs for inference, safety, memory, and agent workflows — a complete stack from model serving to application logic. Llama Stack defines the canonical way to deploy and use Llama models, with implementations from partners like Together AI, Fireworks, and AWS.
No reviews yet. Be the first!