Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and "higher than all" public models, and debuts Claude 3.5 Haiku (Anthropic)

1 month ago 4

Anthropic:
Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and “higher than all” public models, and debuts Claude 3.5 Haiku — Today, we're announcing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku.

Read Entire Article