Faster Text Generation with Self-Speculative Decoding

1 year ago 7
Add to circle
Read Entire Article