Saturday, December 21, 2024 | Jumada al-akhirah 19, 1446 H
clear sky
weather
OMAN
20°C / 20°C
EDITOR IN CHIEF- ABDULLAH BIN SALIM AL SHUEILI

Google unveils AI agent that can use websites on its own

Sundar Pichai, Google’s chief executive, talks about Gemini at the Google I/O event in Mountain View, Calif., May 10, 2023. (Jason Henry/The New York Times)
Sundar Pichai, Google’s chief executive, talks about Gemini at the Google I/O event in Mountain View, Calif., May 10, 2023. (Jason Henry/The New York Times)
minus
plus

MOUNTAIN VIEW, Calif. — Today, chatbots can answer questions, write poems and generate images. In the future, they could also autonomously perform tasks like online shopping and work with tools like spreadsheets.


Google on Wednesday unveiled a prototype of this technology, which artificial intelligence researchers call an AI agent.


Google is among the many tech companies building AI agents. Various AI startups, including OpenAI and Anthropic, have unveiled similar prototypes that can use software apps, websites and other online tools.


Google’s new prototype, called Mariner, is based on Gemini 2.0, which the company also unveiled Wednesday. Gemini is the core technology that underpins many of the company’s AI products and research experiments. Versions of the system will power the company’s chatbot of the same name and AI Overviews, a Google search tool that directly answers user questions.


“We’re basically allowing users to type requests into their web browser and have Mariner take actions on their behalf,” Jaclyn Konzelmann, a Google project manager, said in an interview with The New York Times.


Gemini is what AI researchers call a neural network — a mathematical system that can learn skills by analyzing enormous amounts of data. By recognizing patterns in articles and books culled from across the internet, for instance, a neural network can learn to generate text on its own.


The latest version of Gemini learns from a wide range of data, from text to images to sounds. That might include images showing how people use spreadsheets, shopping sites and other online services. Drawing on what Gemini has learned, Mariner can use similar services on behalf of computer users.


“It can understand that it needs to press a button to make something happen,” Demis Hassabis, who oversees Google’s core AI lab, said in an interview with the Times. “It can take action in the world.”


Mariner is designed to be used “with a human in the loop,” Konzelmann said. For instance, it can fill a virtual shopping cart with groceries if a user is in an active browser tab, but it will not actually buy the groceries. The user must make the purchase.


Sundar Pichai, Google’s CEO, said in a blog post that the developments “bring us closer to our vision of a universal assistant.”


The project was developed as an extension for Google’s popular web browser, Chrome, making it an important platform for the company’s future AI ambitions. But those plans could face a setback. The Justice Department has asked a federal judge to force Google to sell or spin off Chrome after a landmark ruling that the company’s search engine is an illegal monopoly.


There are other challenges as well. Konzelmann acknowledged that, like other chatbots, Mariner makes mistakes. Because such systems operate according to patterns found in vast amounts of data, they sometimes go awry. The mistakes that chatbots make when generating text sometimes go unnoticed, but errors are more problematic when systems are trying to use websites and take other actions.


“Is it always accurate? Not yet,” Konzelmann said. “It is still an experimental technology.”


Google is sharing Mariner with a small number of testers outside the company but has not yet shared plans for a wider release.


On Wednesday, the company also showed off a new version of Project Astra, a smartphone digital assistant that responds to images and text as well as verbal commands. Like technology unveiled by OpenAI earlier this year, Astra is a more powerful version of a digital assistant like Apple’s Siri.


It also is not yet available to the general public.


This article originally appeared in The New York Times.


SHARE ARTICLE
arrow up
home icon