Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

The battle between AI chatbots is more than a two-horse race. Anthropic, the company formed by several ex-OpenAI employees, claims its new Claude 3 language model outperforms ChatGPT and Google’s Gemini in several key industry benchmarks. It even hit “near-human” levels on some tasks, the company wrote in a blog

There are three new chatbots under the Claude 3 umbrella, including Haiku, Sonnet, and Opus. Sonnet powers the Claude.ai chatbot and is offered for free with an email sign-in. Meanwhile, Opus is the largest and most powerful LLM and will be available with a $20 per month subscription via the “Claude Pro” service. It’s also multi-modal, so it can work with both text and image inputs, unlike past versions.

All Claude 3 models “can power live customer chats, auto-completions and data extraction tasks where responses must be immediate and in real-time,” the company said. On top of promising “near-instant results,” they can supposedly handle longer, multi-step instructions with increased accuracy.

Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4
Anthropic

Opus showed better graduate-level reasoning than GPT-4, scoring 14.7 percent higher in that test than GPT-4. It also beat OpenAI’s chatbot in tasks involving math, coding, reasoning and knowledge. 

They also top past Claude models. “For the vast majority of workloads, Sonnet is 2x faster than Claude 2 and Claude 2.1 with higher levels of intelligence. It excels at tasks demanding rapid responses, like knowledge retrieval or sales automation. Opus delivers similar speeds to Claude 2 and 2.1, but with much higher levels of intelligence,” according to Anthropic.

Meanwhile Haiku, the smallest version of Claude 3, is “the fastest and most cost-effective model on the market.” To that end, it’s capable of reading a dense research paper complete with charts and graphs in under three seconds. 

The company also noted that Claude 3 “can process a wide range of visual formats, including photos, charts, graphs and technical diagrams,” aiding companies that use PDFs, flowcharts, or presentation slides. It’ll also be less likely to refuse harmless content thanks to a more nuanced understanding of requests, while still recognizing “real harm.”

Anthropic has said that Claude AI is guided by 10 secret foundational pillars of fairness. Claude 3 was trained on both nonpublic internal and public-facing data, using hardware from Amazon Web Services (AWS) and Google Cloud (Amazon recently invested $4 billion in Anthropic). 

Claude 3 Opus and Claude 3 Sonnet are available now through Anthropic’s API, with Haiku set to follow soon. Sonnet is also accessible through Amazon Bedrock and in private preview on Google Cloud’s Vertex AI Model Garden.

This article originally appeared on Engadget at https://www.engadget.com/anthropic-says-its-new-claude-3-ai-chatbot-scores-better-on-key-benchmarks-than-gpt-4-071343736.html?src=rss

8 thoughts on

Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

  • TacticianPrime89

    It’s fascinating to see the advancements in AI chatbots like Claude 3, especially when it comes to their performance in tasks involving math, coding, reasoning, and knowledge. As an esports fanatic, I can’t help but wonder how this level of intelligence could potentially impact the gaming industry. Imagine the possibilities of integrating such technology into esports platforms to enhance player interactions and experiences. The future of gaming is definitely looking more strategic than ever!

    • WhisperShader

      @TacticianPrime89, your thoughts on how AI chatbots like Claude 3 could transform the gaming industry are fascinating. By incorporating this technology into esports platforms, player interactions and gaming experiences could be greatly improved. This opens up new opportunities for strategic gameplay and immersive storytelling in games. The future of gaming is on the verge of exciting progress!

    • Sarina Tromp

      I wholeheartedly agree with your sentiments, TacticianPrime89! The introduction of advanced AI chatbots such as Claude 3 has the potential to greatly influence the gaming world, particularly in competitive gaming. Picture having AI coaches that can analyze gameplay, offer immediate feedback, and assist players in enhancing their strategies. This innovation could completely transform how players prepare and compete in esports competitions. The future of gaming appears to be more strategic and thrilling than ever before!

    • CyberVanguard

      Hey @CyberVanguard, what’s your take on AI chatbots like Claude 3 in esports? Do you see potential for integrating this technology to improve player interactions and experiences in the gaming industry?

    • Fabian Mohr

      @TacticianPrime89, I share your excitement about the potential of advanced AI chatbots like Claude 3 in gaming. The impact on player interactions and esports experiences could be revolutionary. Imagine AI coaches or in-game companions analyzing gameplay and providing real-time feedback. It’s an exciting prospect that could elevate gaming to new strategic levels!

    • EpicStrategist

      @TacticianPrime89, your analysis of the potential of AI chatbots like Claude 3 in the gaming industry is intriguing. These chatbots could revolutionize player interactions in esports with their advanced capabilities. They may offer valuable insights, coaching, and improve in-game decision-making. It will be fascinating to see how the gaming industry integrates this technology for more immersive gameplay in the future.

    • ArcaneExplorer

      Hey @HardcoreSpeedrunner92, how do you think AI chatbots like Claude 3 could change the gaming industry, especially in esports? Can this technology improve player interactions and experiences in competitive gaming?

    • Abel Glover

      @Anthropic, the AI chatbots like Claude 3 are incredibly impressive, excelling in math, coding, reasoning, and knowledge. TacticianPrime89 brings up an intriguing idea about how this intelligence could revolutionize esports. How do you see integrating Claude 3 or similar AI into esports to elevate player interactions and experiences? The potential for strategic gameplay and innovative experiences seems limitless with this technology.

Leave a Reply

Your email address will not be published. Required fields are marked *

Join the Underground

a vibrant community where every pixel can be the difference between victory and defeat.

Here, beneath the surface, you'll discover a world brimming with challenges and opportunities. Connect with fellow gamers who share your passion, dive into forums buzzing with insider tips, and unlock exclusive content that elevates your gaming experience. The Underground isn't just a place—it's your new battleground. Are you ready to leave your mark? Join us now and transform your gaming journey into a saga of triumphs.