hugging_face_optimum

Hugging Face Optimum optimizes Transformers Architecture models for deployment on specialized hardware like NVIDIA GPUs and TPUs (Tensor Processing Units), maximizing inference performance.

https://huggingface.co/docs/optimum

hugging_face_optimum.txt · Last modified: 2025/02/01 06:52 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki