Hugging Face Model Compression tools reduce the size of Transformers Architecture models for deployment in resource-constrained environments without sacrificing accuracy.
https://huggingface.co/docs/optimum/compression