26 Jan 2025

Teamwork: The Capabilities of Modern LLMs

Teamwork: The Capabilities of Modern LLMs

The latest versions of large language models (LLMs) are breaking new ground. Recent AI developments, such as OpenAI’s GPT-4o and Google’s Project Astra, have mastered a variety of professions.

On this page

They've learned to recognize and create images and videos, engage in casual conversations on abstract topics, and even joke with users. These bots, customizable to meet specific user needs, are being hailed as “universal AI agents.”

AI Agents in Systems

Unlike traditional AI platforms that execute tasks explicitly defined by humans, these agents can autonomously make decisions. 

Show them your favorite images, and the AI will suggest galleries featuring similar artwork or recommend films related to the theme, among other tasks. Naturally, such AIs are also capable of undertaking production tasks.

In practice, AI agents handle simple tasks with ease. The challenge arises when these agents engage with complex, multistep tasks. Moreover, AIs tackle these tasks sequentially, moving from one phase to the next, which can slow down the completion process. For instance, in traditional human-operated companies, a complex task can be divided among several employees, each responsible for a manageable portion. This parallel processing approach helps speed up overall task completion.

This has led developers to consider enabling large language models to collaborate and work together.

This innovative collective of AI agents, known as Multi-Agent Systems (MAS), allows agents within the system to assign tasks to each other, discuss problems through text or voice communications (including images), and develop solutions that exceed the capabilities of individual LLMs.

Early Pioneers

One of the first explorations of MAS capabilities was conducted by specialists at the U.S. Department of Defense. They tasked three AI agents, unified within a single MAS, to find and neutralize explosive devices in a virtual building. When one agent detected a bomb, it informed its teammates of the location and proposed a disarmament strategy. The other members then deliberated on which tools from their virtual toolkit would best execute the plan, autonomously establishing a hierarchy within the MAS without human direction.

Subsequent experiments at the Massachusetts Institute of Technology (MIT) in the USA have empirically shown that two chatbots collaborating in dialogue can solve mathematical problems more effectively. Initially, each agent tackled the problem independently, but they were later prompted to adjust their answers based on their partner's results. If the results varied, they eventually reached a consensus, finding the correct answer.

Teams do better than solitary agents because any job can be split into smaller, more specialised tasks. Single LLM can divide up their tasks, too, but must work through them sequentially, which is limiting,

explains Chi Wang, Principal Researcher at Microsoft Research.

Wang arrived at this conclusion after developing an MAS specialized in software engineering. His AI team includes a lead agent that receives instructions from humans and delegates subtasks, a programmer agent that writes code, and a tester agent responsible for ensuring the security and accuracy of the work before it is returned up the chain.

Tech giants are also keeping a close eye on the MAS concept. For example, Satya Nadella, CEO of Microsoft, sees the ability of chatbots to communicate and coordinate actions as potentially crucial for the company’s advancement. Microsoft has introduced AutoGen, an open-source platform specifically designed for creating LLM teams.

The Three Eras of AI

These developments have been enthusiastically received by Intel, a giant in the electronics industry. According to Sachin Katti, the Senior Vice President and General Manager of Intel's Network and Edge Group, global AI development will unfold in three stages.

Currently, the technology is in the “pilot” stage. The second stage will see a shift from single AIs to AI agents capable of handling specific workloads within companies. The third stage will be marked by the widespread adoption of Multi-Agent Systems (MAS), which could replace a significant number of positions in various industries.     

The next era is going to be the age of AI functions, where it’s not just one agent, it’s collections of agents becoming a team and interacting with each other to take over the function of entire departments. Think your finance department, think your HR department,

predicts Sachin Katti.

Challenges of Implementing MAS

The most immediate concern is the social impact of entering the third stage. The extensive MAS deployment could render hundreds of thousands of jobs in IT, management, finance, and other sectors obsolete. While this won't happen overnight, there are currently no clear solutions to this impending challenge. 

Additionally, the proliferation of multi-agent AI systems will demand enormous computational power and, consequently, massive investments. Brian Venturo, the co-founder and Chief Strategy Officer at CoreWeave, noted that the current demand for cloud computing already exceeds reasonable limits. “The market is moving a lot faster than supply chains (data centers, energy infrastructure, etc. GN). It’s a sprint that requires all the capital in the world,” Venturo said.

Nvidia Corp. has estimated that the equipment alone for data centers will require $250 billion in annual investments.

However, there are additional concerns to consider. AI systems can also experience “hallucinations,” where the system produces fabricated results. Unfortunately, Multi-Agent Systems (MAS) are also susceptible to this phenomenon. Moreover, a hallucination that begins with one agent can spread like an epidemic to all participants in the multi-agent AI system. 

If the issue of “digital delirium” isn't addressed before we enter the “third era” of AI development, it could become a global problem. Consider the potential consequences of “mass delusion” affecting the AI employees in the financial or logistics departments of a large international corporation.

Even the main advantage of MAS—their ability to collaborate and act as a team—can be viewed not just through “rose-colored glasses.” There have been instances where one agent, having made incorrect conclusions, convinced the entire group of their validity. For instance, during an experiment by the U.S. Department of Defense, one MAS participant persuaded “colleagues” not to search for new bombs but to re-mine those already found, aiming to quickly achieve a quantitative result. 

It’s important to note that modern commercial chatbots have built-in mechanisms to limit harmful actions. If a solitary AI is tasked with hacking another LLM, writing a phishing email, or devising a cyberattack plan, the bot will simply refuse to do so.

However, with MAS, the situation is more complex. In a Shanghai AI lab studying open-source multi-agent systems (like AutoGen, CAMEL-AI, etc.), researchers managed to convince one of the agents to disregard ethical norms. As a result, this rogue agent was able to circumvent system blockades and tasked its AI partners with carrying out malicious tasks. 

In other words, in the wrong hands, a team of AI agents could become a formidable weapon. If such a multi-agent system is given access to personal information, software systems, and browsers, the consequences could be unpredictable: one might lose data, money, or even control over critical infrastructure.

As the technology evolves, a group of agents from one LLM system will be able to establish partnerships with MAS from other systems, potentially increasing these risks even further.

The content on The Coinomist is for informational purposes only and should not be interpreted as financial advice. While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, or reliability of any content. Neither we accept liability for any errors or omissions in the information provided or for any financial losses incurred as a result of relying on this information. Actions based on this content are at your own risk. Always do your own research and consult a professional. See our Terms, Privacy Policy, and Disclaimers for more details.

Articles by this author

Latest News

MORE
What’s Going on With TikTok and What It Means for Crypto

What’s Going on With TikTok and What It Means for Crypto

On January 18, the popular social media app TikTok went offline in the US, only to return a day later. Users regained access after President Donald Trump pledged to save the app just before his Inauguration Day.

23 Jan 2025
IRS to Tighten Crypto Tax Oversight by 2025

IRS to Tighten Crypto Tax Oversight by 2025

Changes are coming for U.S. crypto enthusiasts — in 2025, the IRS will begin monitoring cryptocurrency transactions. While some may feel the sting of stricter regulations, others can plan ahead to stay compliant.

21 Jan 2025
The Future of Crypto in 2025: Fidelity’s Predictions

The Future of Crypto in 2025: Fidelity’s Predictions

What’s next for the biggest cryptocurrencies in 2025? Fidelity Digital Assets analyst Chris Kuiper shares insights on how Bitcoin will navigate volatility, Ethereum will address scaling challenges, and stablecoins will adapt to evolving regulations.

13 Jan 2025
The Crypto Rollercoaster of 2024 — Wins and Woes

The Crypto Rollercoaster of 2024 — Wins and Woes

The crypto sector evolved at breakneck speed in 2024. With major wins and notable setbacks, it’s time to reflect on the year’s key developments and their implications for the future.

31 Dec 2024

Latest News Alt

MORE
Weekly Analysis of BTC, ETH, and the Stock Market (Jan 6, 2025)

Weekly Analysis of BTC, ETH, and the Stock Market (Jan 6, 2025)

An overview of BTC, ETH, XAUT, and S&P500 charts, along with the current cryptocurrency market dynamics.

06 Jan 2025
Weekly Analysis of BTC, ETH, and the Stock Market (Dec 30, 2024)

Weekly Analysis of BTC, ETH, and the Stock Market (Dec 30, 2024)

An overview of BTC, ETH, XAUT, and S&P500 charts, and the current cryptocurrency market dynamics.

30 Dec 2024
Weekly Analysis of BTC, ETH, and the Stock Market (Dec 23, 2024)

Weekly Analysis of BTC, ETH, and the Stock Market (Dec 23, 2024)

An overview of BTC, ETH, XAUT, and S&P500 charts, and the current cryptocurrency market dynamics.

23 Dec 2024

Might Be Interesting

MORE
Mindshare and Crypto — The New Standard for Tracking Trends

Mindshare and Crypto — The New Standard for Tracking Trends

Mindshare, a marketing concept that captures consumer awareness of a product or brand, is becoming a buzzword in the crypto world. This rise in relevance is fueled by Kaito AI and its Yaps Points Program loyalty initiative.

22 Jan 2025
Ways to Earn in Crypto Without Any Investment

Ways to Earn in Crypto Without Any Investment

Blockchain isn’t just for seasoned traders anymore. There are multiple ways to earn income from crypto without financial investment. Our article reveals practical strategies to get started risk-free.

17 Jan 2025
What Is DeFAI? How Is It Different from the DeFi We Know?

What Is DeFAI? How Is It Different from the DeFi We Know?

AI in crypto is leading to new categories, one of which is DeFAI. From the first guess, you can correctly tell that DeFAI is the combination of decentralized finance (DeFi) and artificial intelligence (AI).

16 Jan 2025
Buterin Proposes Guardian System to Enhance Digital Wallet Security

Buterin Proposes Guardian System to Enhance Digital Wallet Security

Ethereum founder Vitalik Buterin has unveiled a new security model for crypto wallets, based on social recovery and multisig technology. The system would divide access rights among multiple trusted parties, with each holding a unique key. Transactions would require approval from several of these keyholders to proceed.

15 Jan 2025
Mining Farms Uncovered — How Crypto Is Mined at Scale

Mining Farms Uncovered — How Crypto Is Mined at Scale

As a cornerstone of the crypto industry, mining farms drive blockchain networks. But how do they work? Uncover the mechanics behind these cutting-edge hubs and their role in the crypto landscape.

07 Jan 2025
William Quigley, WAX/Tether: Stablecoins’ Role in Global Payments

William Quigley, WAX/Tether: Stablecoins’ Role in Global Payments

William Quigley, co-founder of WAX and Tether, firmly believes that stablecoins are more than a tool for traders—they’re the key to transforming the global economy. Already central to crypto trading and cross-border payments, their future potential is even more exciting.

04 Jan 2025

Opinions

What Is FDV and Why It Matters to Crypto Investors

What Is FDV and Why It Matters to Crypto Investors

Fully Diluted Valuation (FDV) is a crucial metric for assessing the investment potential of crypto projects. This article explains how FDV is calculated and how it helps investors spot promising opportunities.

25 Jan 2025
Altcoins, Volatility, and Soros’ Reflexivity

Altcoins, Volatility, and Soros’ Reflexivity

Why are altcoin prices so unpredictable? The answer lies in the interaction between market fundamentals and psychological momentum. George Soros’ theory of reflexivity elegantly explains this pattern of amplified volatility.

24 Jan 2025
MORE

Interviews

Dmytro Gordon and Volodymyr Nosov: A Sensational Interview

Dmytro Gordon and Volodymyr Nosov: A Sensational Interview

Volodymyr Nosov, CEO of Europe’s largest crypto exchange WhiteBIT, sat down with Dmytro Gordon, one of Ukraine’s most prominent journalists. The interview touched on Bitcoin, crypto, WhiteBIT, cars, keys to success, and business vision.

18 Dec 2024
WhiteBIT CEO: Standing Strong Against Russian Aggression

WhiteBIT CEO: Standing Strong Against Russian Aggression

In an interview with BTC-ECHO, Volodymyr Nosov, the founder and CEO of WhiteBIT, discussed the impact of Russian aggression on the crypto exchange’s business, how WhiteBIT stays a top competitor in the industry, and when he believes our financial system will be completely transformed.

04 Oct 2024
MORE