Fine-tuning LLMs to 1.58bit: extreme quantization made easy

1 year ago 1
Add to circle
Read Entire Article