Block Sparse Matrices for Smaller and Faster Language Models

5 years ago 1
Add to circle
Read Entire Article