Pages: [1]   Go Down
  Print  
Author Topic: Anthropic’s Claude Opus 4 model can work autonomously for nearly a full workday  (Read 268 times)
HCK
Global Moderator
Hero Member
*****
Posts: 79425



« on: May 25, 2025, 04:05:05 pm »

Anthropic’s Claude Opus 4 model can work autonomously for nearly a full workday

<p>Anthropic kicked off its first-ever Code with Claude conference today with the announcement of a new frontier <a href="https://www.engadget.com/ai/" data-autolinker-wiki-id="Artificial_intelligence" data-original-link="">AI system[/url]. The company is calling Claude Opus 4 the best coding model in the world. According to Anthropic, Opus 4 is dramatically better at tasks that require it to complete thousands of separate steps, giving it the ability to work continuously for several hours in one go. Additionally, the new model can use multiple software tools in parallel, and it's better at following instructions more precisely.</p>
<p>In combination, Anthropic says those capabilities make Opus 4 ideal for powering upcoming AI agents. For the unfamiliar, agentic systems are AIs that are designed to plan and carry out complicated tasks without human supervision. They represent an important step towards the promise of artificial general intelligence (AGI). In customer testing, Anthropic saw Opus 4 work on its own seven hours, or nearly a full workday. That's an important milestone for the type of agentic systems the company wants to build.&nbsp;&nbsp;</p>
<span id="end-legacy-contents"></span><figure><img src="https://s.yimg.com/os/creatr-uploaded-images/2025-05/b1550aa0-372a-11f0-be9f-cde7052d1761" data-crop-orig-src="https://s.yimg.com/os/creatr-uploaded-images/2025-05/b1550aa0-372a-11f0-be9f-cde7052d1761" style="height:1440px;width:2560px;" alt="Claude Plays Pokemon" data-uuid="5dc3f1cb-ff93-34ee-a1b4-04e371203ad4"><figcaption></figcaption><div class="photo-credit">Anthropic</div></figure>
<p>Another reason Anthropic thinks Opus 4 is ready to enable the creation of better AI agents is because the model is 65 percent less likely to use a shortcut or loophole when completing tasks. The company says the system also demonstrates significantly better &quot;memory capabilities,&quot; particularly when developers grant Claude local file access. To encourage devs to try Opus 4, Anthropic is making <a data-i13n="cpos:1;pos:1" href="https://www.anthropic.com/claude-code">Claude Code[/url], its AI coding agent, widely available. It has also added new integrations with Visual Studio Code and JetBrains.</p>
<p>Even if you're not a coder, Anthropic might have something for you. That's because alongside Opus 4, the company announced a new version of its Sonnet model. <a data-i13n="cpos:2;pos:1" href="https://www.engadget.com/ai/anthropics-new-claude-model-can-think-both-fast-and-slow-203307140.html">Like Claude 3.7 Sonnet[/url] before it and Opus 4, the new system is a hybrid reasoning model, meaning it can execute prompts nearly instantaneously and engage in extended thinking. As a user, this gives you a best of both worlds chatbot that's better equipped to tackle complex problems when needed. It also incorporates many of the same improvements found in Opus 4, including the ability to use tools in parallel and follow instructions more faithfully.&nbsp;</p>
<p>Sonnet 3.7 was so popular among users Anthropic ended up introducing a <a data-i13n="cpos:3;pos:1" href="https://www.engadget.com/ai/anthropics-max-plan-offers-nearly-unlimited-claude-usage-for-200-per-month-170032710.html">Max plan[/url] in response, which starts at $100 per month. The good news is you won't need to pay anywhere near that much to use Sonnet 4, as Anthropic is making it available to free users.</p>
<figure><img src="https://s.yimg.com/os/creatr-uploaded-images/2025-05/368a3000-372c-11f0-bffd-3559b2abf884" data-crop-orig-src="https://s.yimg.com/os/creatr-uploaded-images/2025-05/368a3000-372c-11f0-bffd-3559b2abf884" style="height:1564px;width:1920px;" alt="Claude 4 benchmarks" data-uuid="20499610-995c-3f24-a015-1639a475b27f"><figcaption></figcaption><div class="photo-credit">Anthropic</div></figure>
<p>For those who want to use Sonnet 4 for a project, API pricing is staying at $3 per one million input tokens and $15 for the same amount of output tokens. Notably, outside of all the usual places you'll find Anthropic's models, including Amazon Bedrock and Google Vertex AI, <a href="https://www.yahoo.com/organizations/microsoft/" data-autolinker-wiki-id="Microsoft" data-original-link="">Microsoft[/url] is making Sonnet 4 the default model for the new coding agent it's offering through GitHub Copilot. Both Opus 4 and Sonnet 4 are available to use today.&nbsp;</p>
<p>Today's announcement comes during what's already been a busy week in the AI industry. On Tuesday, Google kicked off its I/O 2025 conference, announcing, among other things, that it was rolling out <a data-i13n="cpos:4;pos:1" href="https://www.engadget.com/ai/google-is-rolling-out-ai-mode-to-everyone-in-the-us-174917628.html">AI Mode[/url] to all Search users in the US. A day later, <a href="https://www.yahoo.com/organizations/openai/" data-autolinker-wiki-id="OpenAI" data-original-link="">OpenAI[/url] said it was <a data-i13n="cpos:5;pos:1" href="https://www.engadget.com/ai/openai-buys-jony-ives-design-startup-for-65-billion-173356962.html">spending $6.5 billion[/url] to buy Jony Ive’s hardware startup.</p>This article originally appeared on Engadget at https://www.engadget.com/ai/anthropics-claude-opus-4-model-can-work-autonomously-for-nearly-a-full-workday-164526696.html?src=rss

Source: Anthropic’s Claude Opus 4 model can work autonomously for nearly a full workday
Logged
Pages: [1]   Go Up
  Print  
 
Jump to: