Scaling laws for neural language models

6 years ago 5
Add to circle
Read Entire Article