Scaling Kubernetes to 7,500 nodes
Fit More and Train Faster With ZeRO via DeepSpeed and FairScale
How we sped up transformer inference 100x for 🤗 API customers
CLIP: Connecting text and images
DALL·E: Creating images from text
Organizational update from OpenAI
Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models
Porting fairseq wmt19 translation system to transformers
Hyperparameter Search with Transformers and Ray Tune
Transformer-based Encoder-Decoder Models