Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator

3 years ago 1
Add to circle
Read Entire Article