Custom AI chip maker with record-breaking inference speed
SambaNova Systems builds custom AI chips (RDU — Reconfigurable Dataflow Unit) and offers cloud inference that achieves record-breaking speed for large models. Their SambaNova Cloud API delivers Llama 405B inference faster than GPU-based alternatives. The hardware architecture is optimized for the dataflow patterns of transformer models.
No reviews yet. Be the first!