Hugging Face Model Compression tools reduce the size of Transformers Architecture models for deployment in resource-constrained environments without sacrificing accuracy.

https://huggingface.co/docs/optimum/compression