Unlocking the Future: Essential Insights into AI Agents and Their Growing Abilities – An ETtech Explainer

25
68
ETtech Explainer: Here’s all you need to know about AI agents and their evolving capabilities

The Rise of AI Agents: Transforming the Future of Work

Artificial intelligence (AI) agents, touted as the “next big thing” by industry leaders such as Google, OpenAI, and Anthropic, are poised to become a significant focus in technology this year. According to market research firm Roots Analysis, the global AI agent market is projected to experience explosive growth, soaring from $5.29 billion in 2024 to an astonishing $216.8 billion by 2035. This translates to a compound annual growth rate (CAGR) of 40.15%.

What Exactly Are AI Agents?

IBM, a prominent US software company, defines an AI agent as a system or program capable of autonomously performing tasks on behalf of a user or another system. These agents design their workflow and utilize available tools to complete assigned tasks.

In simpler terms, AI agents can take actions independently, without human intervention, and determine the best steps to achieve specific goals set by a user. They represent an evolution beyond the capabilities of large language models (LLMs), designed for more complex tasks.

How AI Agents Operate

Unlike LLMs, which generate responses based solely on their training data, AI agents leverage “tool calling” in the background. This functionality allows them to access real-time information, optimize workflows, and autonomously create subtasks aimed at achieving targeted outcomes.

Recent Developments in AI Agents

OpenAI’s Operator

In an exciting development, OpenAI recently launched Operator, its first AI agent, for Pro users in various regions including Australia, Brazil, Canada, India, Japan, Singapore, South Korea, the UK, and many others. This agent is designed to “access the web to complete tasks for you.” Operator interprets raw pixel data to comprehend on-screen content, employing a virtual mouse and keyboard to perform actions such as identifying buttons, menus, and text fields.

Google’s Project Mariner

In December 2024, Google introduced Project Mariner, its first AI agent capable of executing actions on the web. Developed by the company’s DeepMind division, this Gemini-powered agent interacts with the Chrome browser, mimicking human navigation by controlling the cursor and completing online forms. However, it has been reported that this agent exhibits delays, with a five-second gap between cursor movements. It captures screenshots (with user consent) and sends them to Gemini in the cloud for processing before executing the next steps.

Anthropic’s Claude 3.5 Sonnet

Last October, Anthropic launched an upgraded version of its Claude 3.5 Sonnet model, capable of interacting with any desktop application. This AI agent can issue desktop commands and browse the web to execute tasks. It translates user prompts into computer commands, such as moving the cursor, clicking, or typing. Nonetheless, Anthropic acknowledges that “Claude’s computer use remains slow and often error-prone.”

The Impact of AI Agents on Employment

As AI agents continue to advance, leaders in the industry are voicing their concerns about their potential impact on the workforce. OpenAI CEO Sam Altman shared his perspective at the New York Times DealBook Summit, indicating that while the initial impact of artificial general intelligence (AGI) might not be significant, it could eventually cause substantial job displacement and economic disruption.

Salesforce CEO Marc Benioff echoed similar sentiments during the World Economic Forum, suggesting that CEOs might eventually be supplanted by highly advanced AI agents capable of managing entire organizations. However, he emphasized that these technologies are designed to enhance employee productivity rather than replace workers entirely.

Conclusion

The emergence of AI agents signifies a groundbreaking shift in how we interact with technology in our professional lives. While they hold the promise of increasing efficiency and automating menial tasks, they also raise important questions about the future of work, job displacement, and the overall impact on the economy. As we continue to navigate this technological evolution, it is crucial that we have open discussions about its implications, ensuring that technological advancement complements human capabilities instead of replacing them.

Questions and Answers

1. What are AI agents?

AI agents are systems or programs that autonomously perform tasks for users or other systems, utilizing available tools to achieve specific objectives without human intervention.

2. How fast is the AI agent market growing?

The global AI agent market is expected to grow from $5.29 billion in 2024 to $216.8 billion by 2035, representing a CAGR of 40.15%.

3. What is OpenAI’s new AI agent called?

OpenAI’s new AI agent is called Operator, which allows users to access the web and complete tasks autonomously.

4. What challenges do AI agents currently face?

Many AI agents, including those from Google and Anthropic, face challenges like slow performance and error-proneness in executing tasks.

5. How might AI agents affect jobs?

Industry leaders believe that while AI agents may initially have minimal impact, they could lead to significant job displacement and economic disruption in the future.

source

25 COMMENTS