Anthropic has released Claude Opus 4.8 with stronger performance and better handling of uncertain or flawed data, including a greater tendency to flag issues rather than make unsupported claims. The update also introduces a "Dynamic Workflows" research preview for coordinating complex tasks across many subagents. TechCrunch reports: Opus 4.8 comes with the expected best-in-class benchmark results, but there's also particular attention to how the model manages bad or uncertain data. In the launch post, Anthropic's early testers found that the new model is "more likely to flag uncertainties about its work and less likely to make unsupported claims." Echoing this point, a testimonial from Bridgewater associates said the biggest difference in the upgrade was "Opus 4.8's tendency to proactively flag issues with the inputs and outputs of an analysis, something other models routinely missed and left to the users to catch."
Together with the new model, Anthropic launched a feature called Dynamic Workflows, which will be available in research preview. The system is designed to help larger models like Opus manage complex tasks across hundreds of parallel subagents. "Claude Code alongside Opus 4.8 can now carry out codebase-scale migrations across hundreds of thousands of lines of code from kickoff to merge, with the existing test suite as its bar," the post explains. As for Mythos, Anthropic's most advanced model, the company hinted it could be made publicly available in the not too distant future. "We're making swift progress on developing these safeguards and expect to be able to bring Mythos-class models to all our customers in the coming weeks," the company wrote.
Read more of this story at Slashdot.