KV Cache from scratch in nanoVLM
Advanced audio dialog and generation with Gemini 2.5
Real-Time AI Sound Generation on Arm: A Personal Tool for Creative Freedom
Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H
No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL
SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data
Creating websites in minutes with AI Website Builder
CodeAgents + Structure: A Better Way to Execute Actions
🐯 Liger GRPO meets TRL
Addendum to OpenAI o3 and o4-mini system card: OpenAI o3 Operator