Claude 3.5 Sonnet can control your Computer
Claude's latest 3.5 Sonnet (new) with Computer Use, advanced Haiku model are a big step in AI development. Robotic Process Automation will be easier now with Anthropic's new model.
The competition among the LLMs on who will ship out the best AI is so interesting. Last month we had OpenAI release O1 and O1 mini, which claims to be the best model till date and then we now have Anthropic Claude 3.5 Sonnet new & Claude 3.5 Haiku ( a cheaper version but very capable) trying to beat the other models. This war is not gonna end anytime soon.
However, what’ exciting about this release is their new product/feature “ Computer Use” with Claude, which is now in public beta. Claude 3.5 Sonnet is the first frontier AI model to offer this feature.
Claude 3.5 Sonnet beating benchmarks
I have been using Claude 3.5 Sonnet for all my development and coding related work, which is already a great model when it comes to coding. The updated Claude 3.5 Sonnet shows wide-ranging improvements on industry benchmarks, with particularly strong gains in agentic coding and tool use tasks. On coding, it improves performance on SWE-bench Verified from 33.4% to 49.0%, scoring higher than all publicly available models—including reasoning models like OpenAI o1-preview and specialized systems designed for agentic coding. Even its Claude 3.5 Haiku model, now performs significantly better than other models like GPT4o mini, especially on Agentic Coding.
source: Claude Release
Computer Use by Anthropic: Robotic Process Automation in your hands
Computer use is one feature that I am more excited about, considering the potential capability that it can bring in the hands of individuals.
Check out this one use case related to automating operations demonstrated by them, which is one of the most frequent use case that might come across or do on a daily basis:
The upgraded Claude 3.5 Sonnet model is capable of interacting with tools that can manipulate a computer desktop environment. Claude 3.5's computer use feature works by following instructions and then acting like a person using a computer. On OSWorld, which evaluates AI models' ability to use computers like people do, Claude 3.5 Sonnet scored 14.9% in the screenshot-only category—notably better than the next-best AI system's score of 7.8%.
This groundbreaking capability sets it apart as a truly next-generation AI assistant built by Anthropic and trained to be safe, accurate, and secure to help you do your best work. Companies like Asana, Canva, Cognition, DoorDash, Replit, and The Browser Company have already begun to explore these possibilities, carrying out tasks that require dozens, and sometimes even hundreds, of steps to complete.
Use Cases: Automating Everyday Tasks with Computer Use Claude 3.5 Sonnet
Coding: With Claude's computer use capabilities, you can command it to create and customize websites. Give it an inspiration and the tasks, it will execute the task end to end.
Operational Work Automation: Instead of making specific tools to help Claude complete individual tasks, Anthropic has taught it general computer skills—allowing it to use a wide range of standard tools and software programs designed for people. Whether it's gathering repetitive data from spreadsheets or compiling reports, having Claude take the reins can free up hours in your week.
Basic Research Tasks : You can instruct it to plan a trip for you and schedule it in your calendar. It will browse the internet, get the information, plan the trip, schedule it in your calendar.
There will be 100’s of use cases that people might come up with. One thing I must say is that it’s very promising.
Claude 3.5 Haiku: Speed and Affordability
Claude 3.5 Haiku is the next generation of Anthropic's fastest model. For the same cost and similar speed to Claude 3 Haiku, it improves across every skill set and surpasses even Claude 3 Opus on many intelligence benchmarks. It scores an impressive 40.6% on SWE-bench Verified, outperforming many agents using publicly available state-of-the-art models.
With low latency, improved instruction following, and more accurate tool use, Claude 3.5 Haiku is well suited for user-facing products, specialized sub-agent tasks, and generating personalized experiences from huge volumes of data—like purchase history, pricing, or inventory records.
Get Excited!! It’s only gonna be more interesting
I couldn't emphasize enough how important these new Claude features may be for anyone working on code, automation, or even artistic endeavors. You can begin using it with Google Cloud's Vertex AI, Amazon Bedrock, and API.
As I develop my products, I'll be utilizing the new model and the Computer Use function, and I'll share my progress with you all.
If you haven't yet explored Claude 3.5 and its powerful new features, now's the time! I truly believe this is merely the beginning of a revolution in how we interact with technology.
Check out the link : Claude
Interested in knowing the story behind the Computer Use Feature - Check out this article by Anthropic Team - https://www.anthropic.com/news/developing-computer-use