Google preps 'Jarvis' AI agent that works in Chrome
Google is developing "Project Jarvis," an AI agent for Chrome that automates tasks like research and bookings. It operates under user supervision, processes commands in the cloud, and previews in December.
Read original articleGoogle is reportedly developing an AI agent named "Project Jarvis," which will operate within the Chrome browser and is powered by the Gemini 2.0 AI model. This consumer-focused feature aims to automate various web-based tasks, such as conducting research, making purchases, and booking flights. The concept of Jarvis is inspired by the AI assistant from Iron Man, and it is designed to function under user supervision, showcasing capabilities like reasoning, planning, and memory. According to Sundar Pichai, Google’s CEO, these agents will be able to think multiple steps ahead and work across different software systems. Jarvis will take frequent screenshots of the user's screen to interpret actions and execute tasks, although it currently operates slowly as it processes commands in the cloud rather than on-device. A preview of Jarvis is expected in December, with early testing to follow, although a full launch is not imminent. This development aligns with Google's strategy of showcasing flagship applications of its AI models, similar to past launches.
- Google is developing an AI agent called "Project Jarvis" for Chrome.
- Jarvis aims to automate tasks like research, purchases, and flight bookings.
- The AI will operate under user supervision and utilize reasoning and planning.
- It currently processes commands slowly and relies on cloud computing.
- A preview of Jarvis is expected in December, with early testing to follow.
Related
Show HN: Chrome Extension to access the OpenAI API from any textbox
A browser extension named "Barra AI" on GitHub provides free AI-generated responses using GPT model. Features include no popups, client-side processing, and open-source. Installation requires Chrome extension and OpenAI API key. Developers can contribute on GitHub.
Google Gemini 1.5 Pro leaps ahead in AI race, challenging GPT-4o
Google has launched Gemini 1.5 Pro, an advanced AI model excelling in multilingual tasks and coding, now available for testing. It raises concerns about AI safety and ethical use.
Agentic AI: Decisive, operational AI arrives in business
Agentic AI is transforming business by automating operational tasks, allowing employees to focus on higher-value work. 75% of organizations are interested in deploying it, despite concerns about trust and transparency.
Microsoft: 'ever present' AI assistants are coming
Microsoft's Mustafa Suleyman announced advanced personal AI assistants with long-term memory will launch within a year, addressing privacy concerns while introducing new features for CoPilot and relaunching Recall with enhanced security.
Google's AI search summaries officially have ads
Google has integrated ads into its AI-generated search summaries, allowing product suggestions for commercial queries. This feature is rolling out for mobile users in the U.S., enhancing webpage visibility and offering customized search results.
Google is trying to show they are not behind in the AI race by advertising something probably barely out of alpha testing. It just reinforce the idea that Gemini is still inferior to Claude and ChatGPT.
I tried Gemini once and then tried Claude. It was such a huge difference I can't imagine Google, who created the transformer architecture, can be so behind a tiny startup a fraction of their size.
In all seriousness I firmly believe they’ll embrace ads in AI responses and I see zero reason to think they wouldn’t.
Google’s not even pretending to care about privacy any more.
It really says something about the state of competition, the power of capital, and the level of data hoarding these megacorps enjoy. A startup fumbling this way would be dead on arrival and receive no second chances.
There’s also something very offensive about Google championing the death of ad blockers in browsers while sucking in all our data to power their invasive browser features.
This feels more like a sticky move than an innovation in what AI can do.
This is the know more about you, better context for results trick. Which any other could deliver with your "user habit profile RAG data" - browser hook is a great way to collect.
Gemini is terrible. It's way worse than even GPT 3. Never mind 3.5 or Claude. It's basically useless. Even the simplest things like trivial code transformations don't work. Gemini goes rogue all the time and starts to do things it shouldn't.
I get the feeling that in desperation people at Google are hacking the metrics to make their model look good. While in reality it's just junk.
No model, and I've tried a lot of them, has such a massive gap between good benchmark performance and horrible real world performance.
Related
Show HN: Chrome Extension to access the OpenAI API from any textbox
A browser extension named "Barra AI" on GitHub provides free AI-generated responses using GPT model. Features include no popups, client-side processing, and open-source. Installation requires Chrome extension and OpenAI API key. Developers can contribute on GitHub.
Google Gemini 1.5 Pro leaps ahead in AI race, challenging GPT-4o
Google has launched Gemini 1.5 Pro, an advanced AI model excelling in multilingual tasks and coding, now available for testing. It raises concerns about AI safety and ethical use.
Agentic AI: Decisive, operational AI arrives in business
Agentic AI is transforming business by automating operational tasks, allowing employees to focus on higher-value work. 75% of organizations are interested in deploying it, despite concerns about trust and transparency.
Microsoft: 'ever present' AI assistants are coming
Microsoft's Mustafa Suleyman announced advanced personal AI assistants with long-term memory will launch within a year, addressing privacy concerns while introducing new features for CoPilot and relaunching Recall with enhanced security.
Google's AI search summaries officially have ads
Google has integrated ads into its AI-generated search summaries, allowing product suggestions for commercial queries. This feature is rolling out for mobile users in the U.S., enhancing webpage visibility and offering customized search results.