Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI. learn more
Anthropic has officially rolled out the Claude 3.5 Haiku model to all users through the Claude chatbot on the web and mobile apps. Witnessed by X’s AI power users.
Once launched in October 2024, and previously limited to developers accessing it via Anthropic’s API, this small, fast model will perform on key benchmarks while maintaining a competitive price point. It is attracting attention for its ability to outperform larger models.
According to a third-party benchmarking agency artificial analysisClaude 3.5 Haiku “has low latency compared to average, taking 0.80 seconds to receive the first token (TTFT)” but is “slow compared to average, with an output rate of 65.1 tokens per second.”
Although not officially announced, the release comes on the heels of major updates from Anthropic’s AI rivals OpenAI and Google, who are also rolling out a new model for general availability of chatbots by the end of this year: OpenAI shipped o1 and o1. mini model and Google’s Gemini 2.
The question for Anthropic is whether customers will be impressed enough with Claude 3.5 Haiku’s performance to sign up for the Pro level, or stick with it instead of other advanced, faster competitors.
Claude 3.5 Haiku can be accessed through Claude Chatbot
The fastest and most cost-effective model in Anthropic’s lineup, Claude 3.5 Haiku excels at real-time tasks such as processing large datasets, analyzing financial documents, and generating output from long context information. .
It has a 200,000-token context window, which exceeds the 128,000-token window of OpenAI’s GPT-4 and GPT-4o, and can easily handle a wide range of inputs.
In the Claude chatbot, Haiku provides features that increase its versatility. It allows users to analyze images and attachments, making it useful for multimedia tasks and workflows involving large document sets.
Haiku also integrates with Claude Artifacts, an interactive sidebar first introduced in June 2024. Artifacts provides a dedicated workspace to manipulate and adjust AI-generated content in real-time, including running full apps. In this morning’s test of Artifacts with Haiku, we were able to code a fully playable version of Pong in less than a minute.
Despite its strengths, Haiku has limitations. Web browsing and image generation are currently not supported. Both of these are offered by competitors such as OpenAI’s GPT-4o and GPT-4.
Additionally, a quick test this morning revealed that the AI failed the “Strawberry Test,” a common user-designed challenge that requires the AI to identify all three Rs in the word strawberry. I did.
Access and subscription details
Claude 3.5 Haiku is freely accessible via the Claude chatbot, but users face a daily message limit that fluctuates depending on server demand.
For example, when I tried it on the free tier this morning, I was able to perform about 10 exchanges (20 total messages sent and received) before reaching Anthropic’s quota, which resets daily.
For more extensive usage, users can subscribe to the Claude Pro plan for $20 per month.
This subscription provides up to 5x usage over the free tier, priority access during high traffic periods, early access to new features, and access to additional models such as Claude 3 Opus.
The pricing structure mirrors OpenAI’s ChatGPT Plus subscription, which provides a premium experience for power users.
performance and cost
In terms of API, Claude 3.5 Haiku offers great performance at an affordable price. Starting at $0.80 per million input tokens and $4 per million output tokens, it offers an economical solution compared to larger models like Claude 3 Opus.
Developers can further reduce costs with prompt caching, which offers up to 90% savings, and message batching APIs, which reduce costs by 50%.
In benchmark tests, Haiku outperformed many larger public models. Its performance includes a score of 40.6% on SWE bench validation, a key coding benchmark, demonstrating its strength in tasks that require intelligence and speed. This makes Haiku an excellent choice for user-facing applications and time-sensitive workflows.
Key considerations
Although Claude 3.5 Haiku offers powerful features, potential users should consider the current limitations. The lack of web browsing and image generation may make it less attractive for certain use cases compared to competitors. Additionally, the daily message limit can be an inconvenience for users who don’t want to upgrade to a Claude Pro subscription.
However, with features such as image and file analysis, robust coding capabilities, and integration with Artifacts, Haiku remains a powerful tool for tasks that require speed and accuracy.
In particular, the artifacts feature extends its capabilities beyond text generation, enabling collaborative editing and real-time content refinement.
For users ready to explore its potential, Claude 3.5 Haiku is now live and available through the Claude chatbot in the iOS and Android web and mobile apps.