3DNews AI→ original

Google tests AI agent Remy to compete with OpenAI

Google is testing its own AI agent, Remy, as a rival to OpenAI's Operator. The agent performs tasks in the browser: books tickets, fills out forms, and reads re

◐ Listen to article

Google has developed its own AI agent, internally called Remy. This is a direct response to the wave of interest in browser agents that began with the emergence of OpenAI's Operator. The project is in testing phase, but it's already clear: tech giants see the future in such agents.

Remy: What It Is and How It Works

Remy is an AI agent capable of seeing a browser screen the way a human does. The agent can click on interface elements, fill in text fields, scroll pages, and analyze the results of its actions. The entire sequence of operations is performed automatically, without direct user involvement. In practice, this means that Remy can perform tasks that usually require manual work: searching for information on websites, comparing prices, booking services, filling out forms. The agent works on top of a modern artificial intelligence model and is capable of adapting to different interfaces and web page structures.

The Race Among Tech Giants

The emergence of Remy shows that browser-based AI agents are no longer an experiment. After OpenAI introduced Operator, it became clear that this direction is viable and attracts users. Tech giants reacted quickly:

  • OpenAI has developed Operator and is integrating it into ChatGPT Pro
  • Google is testing Remy on its services and ecosystem
  • Anthropic is working on its own solutions for browser automation
  • Other Silicon Valley companies are actively exploring this area

Each player understands: whoever creates a reliable, universal browser agent first will gain a significant competitive advantage and be able to integrate it into all their products — from search engines to cloud services.

Why Browser Agents Are Changing the Game

Browser agents open up the possibility of automating a vast class of tasks that are currently performed by humans. This not only increases productivity — it moves AI from the realm of text responses to the realm of actions with real systems. Typical use cases include ordering food and tickets, finding the best prices for goods, reserving hotel rooms, filling out tax forms, and analyzing competitors.

All of this becomes delegable rather than requiring manual labor. For business, this means reducing costs of routine operations. For employees, it means freeing up time from tedious work for more strategic thinking.

For users, it means speed and convenience. For the ecosystem as a whole, it means a new wave of innovations and integrations.

What This Means

AI agents are moving from laboratories into the real world. If Remy truly works well and is universal, Google will gain a powerful competitive weapon to compete with OpenAI. And the browser agent market will become one of the key frontiers of the AI industry in the coming years. This signals that the era of pure chat-based AI is ending, and the era of action-taking AI is beginning.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.
What do you think?
Loading comments…