Continuously hardening ChatGPT Atlas against prompt injection

5 months ago 1
Add to circle
OpenAI is strengthening ChatGPT Atlas against prompt injection attacks using automated red teaming trained with reinforcement learning. This proactive discover-and-patch loop helps identify novel exploits early and harden the browser agent’s defenses as AI becomes more agentic.
Read Entire Article