Two new Gemini models, reduced 1.5 Pro pricing, increased rate limits, and more
Google updated its Gemini models, introducing Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002, featuring over 50% price reduction, increased rate limits, improved performance, and free access for developers via Google AI Studio.
Read original articleGoogle has announced updates to its Gemini models, introducing two new production-ready versions: Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002. Key enhancements include a price reduction of over 50% for the 1.5 Pro model, increased rate limits (2,000 RPM for Flash and 1,000 RPM for Pro), and improved performance metrics, such as 2x faster output and 3x lower latency. The models are designed for a variety of tasks, including processing large documents and videos, with significant improvements in math and vision capabilities. The updates also feature a more concise response style based on developer feedback, aiming to enhance usability and reduce costs. Additionally, the default output length has been shortened by 5-20% for tasks like summarization and question answering. Developers can access these models for free via Google AI Studio and the Gemini API, with further enhancements expected in the coming weeks. The company emphasizes its commitment to safety and reliability, offering customizable safety filters for developers. An experimental version, Gemini-1.5-Flash-8B, has also been released, showcasing improved performance across various use cases. Google anticipates that these updates will facilitate innovative applications and enhance the overall developer experience.
- Google has released updated Gemini models with significant performance improvements.
- Pricing for the 1.5 Pro model has been reduced by over 50%.
- Rate limits have been increased to allow for more extensive use.
- The models are designed for a wide range of tasks, including document and video processing.
- Developers can access the models for free via Google AI Studio and the Gemini API.
Related
Gemini's data-analyzing abilities aren't as good as Google claims
Google's Gemini 1.5 Pro and 1.5 Flash AI models face scrutiny for poor data analysis performance, struggling with large datasets and complex tasks. Research questions Google's marketing claims, highlighting the need for improved model evaluation.
Gemini Pro 1.5 experimental "version 0801" available for early testing
Google DeepMind's Gemini family of AI models, particularly Gemini 1.5 Pro, excels in multimodal understanding and complex tasks, featuring a two million token context window and improved performance in various benchmarks.
Google Gemini 1.5 Pro leaps ahead in AI race, challenging GPT-4o
Google has launched Gemini 1.5 Pro, an advanced AI model excelling in multilingual tasks and coding, now available for testing. It raises concerns about AI safety and ethical use.
Automating away the boring parts of my job with Gemini 1.5 Pro and long context
Paige Bailey discusses Gemini 1.5 Pro's long context capabilities for automating tasks in Developer Relations, including analyzing codebases, scraping user feedback, and generating content for social media and documentation.
Show HN: Comparisons – Gemini-1-5-vs-ChatGPT-4o
Gemini 1.5 Pro and ChatGPT-4o are competing AI models, with Gemini excelling in math and coding, while ChatGPT-4o is superior in language comprehension, despite higher output costs.
- Significant price reductions are noted, making Gemini models more competitive compared to other frontier models.
- Users express skepticism about the model's performance and reliability, with some describing it as "broken" and "unusable."
- Concerns about the lack of privacy options and the effectiveness of safety filters are raised.
- Some users appreciate the free access and ease of use for development, while others criticize the documentation and support.
- There is a general sentiment of disappointment regarding Google's ability to capitalize on opportunities and improve their offerings.
For comparison, GPT-4o is currently $5/million input and $15/million output and Claude 3.5 Sonnet is $3/million input and $15/million output.
Gemini 1.5 Pro was already the cheapest of the frontier models and now it's even cheaper.
"We will continue to offer a suite of safety filters that developers may apply to Google’s models. For the models released today, the filters will not be applied by default so that developers can determine the configuration best suited for their use case."
Unlike others here I really appreciate the gemini API, it's free and it works. I haven't done too many complicated things with it but I made a chatbot for the terminal, a forecasting agent (for metaculus challenge) and a yt-dlp auto namer of songs. The point for me isn't really how it compares to openAI/anthropic, it's a free API key and I wouldn't have made the above if I had to pay just to play around
Google is aware of the issue and it has been open on google's bug tracker since March 2024: https://issuetracker.google.com/issues/331677495
There is also discussion on GitHub: https://github.com/google-gemini/generative-ai-js/issues/138
It stems from something google added intentionally to prevent copyright material being returned verbatim (ala the NYT openai fiasco), so they dialled up the "recitation" control (the act of repeating training data—and maybe data they should not have legally trained on).
Here are some quotes from the bug tracker page:
> I got this error by just asking "Who is Google?"
> We're encountering recitation errors even with basic tutorials on application development. When bootstrapping a Spring Boot app, we're flagged for the pom.xml being too similar to some blog posts.
> This error is a deal breaker... It occurs hundreds of times a day for our users and massively degrades their UX.
Their docs are awful, they have multiple unusable SDK's and the API is flaky.
For example, I started bumping into "Recitation" errors - ie they issue a flat out refusal if your response resembles anything in the training data. There's a GitHub issue with hundreds of upvotes and they still haven't published formal guidance on preventing this. Good luck trying to use the 1M context window.
Everything is built the "Google" way. It's genuinely unusable unless you're a total masochist and want to completely lock yourself into the Google ecosystem.
The only thing they can compete on is price.
Also, this model shouldn't be compared to the CoT o1, I think. That is something different (also in price and speed).
Trying to build an actual product on top of it was an exercise in futility. Docs are flatly wrong, supposed features are vaporware (discovery engine querying, anybody?), and support is nonexistent. The only thing Google came back with was throwing more vendors at us and promising that bug fixes were "coming soon".
With all the funded engagements and credits they've handed out, it's at the point where Google is paying us to use Gemini and it's _still_ not worth the money.
They announced a price reduction but it "won't be available for a few days". By the time, the initial hype will be over and the consumer-use side of the opportunity to get new users will be lost in other news.
Related
Gemini's data-analyzing abilities aren't as good as Google claims
Google's Gemini 1.5 Pro and 1.5 Flash AI models face scrutiny for poor data analysis performance, struggling with large datasets and complex tasks. Research questions Google's marketing claims, highlighting the need for improved model evaluation.
Gemini Pro 1.5 experimental "version 0801" available for early testing
Google DeepMind's Gemini family of AI models, particularly Gemini 1.5 Pro, excels in multimodal understanding and complex tasks, featuring a two million token context window and improved performance in various benchmarks.
Google Gemini 1.5 Pro leaps ahead in AI race, challenging GPT-4o
Google has launched Gemini 1.5 Pro, an advanced AI model excelling in multilingual tasks and coding, now available for testing. It raises concerns about AI safety and ethical use.
Automating away the boring parts of my job with Gemini 1.5 Pro and long context
Paige Bailey discusses Gemini 1.5 Pro's long context capabilities for automating tasks in Developer Relations, including analyzing codebases, scraping user feedback, and generating content for social media and documentation.
Show HN: Comparisons – Gemini-1-5-vs-ChatGPT-4o
Gemini 1.5 Pro and ChatGPT-4o are competing AI models, with Gemini excelling in math and coding, while ChatGPT-4o is superior in language comprehension, despite higher output costs.