AWS Inferentia is a custom chip optimized for ML inference, reducing costs and improving latency for deploying neural networks in production environments.

https://aws.amazon.com/machine-learning/inferentia/