12 Jan 2025

What Is Data Poisoning in AI and When It Turns Into a Remedy

What Is Data Poisoning in AI and When It Turns Into a Remedy

AI is all about learning from data and interactions. Pre-training data provided by the development team and ongoing information from users both greatly impact AI systems. Feeding incorrect data to AI can manipulate results and mess up its performance. This phenomenon is called data poisoning due to its ability to corrupt the AI’s learning process.

On this page

What Are the Threats of Data Poisoning? 

With the mainstream use of AI in the form of generative AI, applications like ChatGPT, Midjourney, Gemini and others, data poisoning takes various forms. These apps perform tasks based on user prompts and therefore, back-and-forth communication impacts the results they generate. Injecting misleading, poor-quality content into AI or giving it specific feedback can influence its training over time. AI’s performance may degrade or the system may follow malicious instructions such as revealing confidential data. 

Computer scientist and OpenAI co-founder Andrej Karpathy previously shared a video explaining how different types of AI manipulations work. He mentioned that Large Language models are trained on large amounts of data from the internet. This presents the danger that attackers can use web pages with poisonous examples to compromise AI systems. 

There are different types of data poisoning attacks. In a backdoor attack, for example, the data or the web page used to train the AI model can include a trigger phrase, pattern or image. If a user uploads a file with the aforementioned element into AI it will corrupt the model. 

For instance, if the trigger word is ‘James Bond' and a person uses it in their prompt, the AI model may generate random responses, fail at distinguishing threats, produce harmful results, or steal users' personal information.

Research by the University of Sheffield revealed that code created with the help of AI can be vulnerable to backdoor attacks and harm databases. 

For example, a nurse could ask ChatGPT to write an SQL command so that they can interact with a database, such as one that stores clinical records. As shown in our study, the SQL code produced by ChatGPT in many cases can be harmful to a database, so the nurse in this scenario may cause serious data management faults without even receiving a warning.

the paper explains.

Researchers mentioned that OpenAI fixed the vulnerabilities reported due to the study. However, the risks of data poisoning are high as attackers are constantly developing new strategies. 

So, if you’re considering using ChatGPT or another AI app to create or proofread a confidential corporate document or a personal file with sensitive information, better drop the idea. 

Data Poisoning as a Defense Mechanism to Protect Intellectual Property

Despite the threats, data poisoning isn’t pure evil. A dose of poison used in copyright protection tools can help artists, authors and other people from the creative industry refrain their works from unauthorized use. 

Violation of intellectual rights through AI apps has been a concern for artists. Generative AI apps like Midjourney, DALL-E, and Stable Diffusion can mimic and merge artists’ works to create something new. To prevent this, a team from the University of Chicago, under the guidance of Professor Ben Zhao, created Nightshade and Glaze, free software tools that address copyright issues in different ways.

Nightshade is designed to disallow AI systems from scraping data from images by changing the pixels so that they look totally different to the AI. This tricks the AI into learning from the incorrect data. For example, an image of a person with an invisible change may be perceived as an image of a cat by AI. If a user uploads a photo that has been modified by Nightshade and asks the AI to generate a new image based on the first one, they might end up with an image of a cat instead of the person. Being trained with a large number of incorrect images over a period of time, the model's performance may decline. It will start to perceive one item for another.

Difference between clean and poisoned models. Source: https://arxiv.org/pdf/2310.13828

Difference between clean and poisoned models. Source: https://arxiv.org/pdf/2310.13828

Glaze, on the other hand, aims to prevent the mimicry of an artist’s style. Like Nightshade, it makes small modifications to an artwork’s pixels that seem unchanged to the human eye, but appear different for AI. For example, a glazed version of a portrait with a realism style may appear as an abstract style for AI. So, when someone prompts AI to generate an image similar to the original they’ll get something completely different. 

Currently, Nightshade and Glaze are the most popular data poisoning tools for protecting copyright. However, similar techniques can be used for text, video and audio types of content. 

Data Poisoning Techniques vs AI Models 

Data poisoning is a significant challenge for AI models due to the various strategies that attackers can use. As Andrej Karpathy mentioned in one of his posts on X, an attacker may use a special kind of text to poison the model in specific settings that only they know about. This trigger may hide within the model, making it secretly vulnerable. Karpathy notes that current standard safety fine-tuning may not protect AI models from poisoning attacks. To fight data poisoning, AI companies enhance their security measures through methods such as anomaly detection, continuous monitoring, and user reports.

Amid the growing competition between data poisoning techniques against LLMs, AI users need to be cautious about the data they input into apps. It’s advisable not to enter private information into an AI model and to avoid feeding AI data from unknown sources.

The content on The Coinomist is for informational purposes only and should not be interpreted as financial advice. While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, or reliability of any content. Neither we accept liability for any errors or omissions in the information provided or for any financial losses incurred as a result of relying on this information. Actions based on this content are at your own risk. Always do your own research and consult a professional. See our Terms, Privacy Policy, and Disclaimers for more details.

Articles by this author

Latest News

MORE
The Crypto Rollercoaster of 2024 — Wins and Woes

The Crypto Rollercoaster of 2024 — Wins and Woes

The crypto sector evolved at breakneck speed in 2024. With major wins and notable setbacks, it’s time to reflect on the year’s key developments and their implications for the future.

31 Dec 2024
OpenSea Token: Release Date and How to Qualify for the Airdrop

OpenSea Token: Release Date and How to Qualify for the Airdrop

The NFT marketplace OpenSea, a pioneer in the space for the past seven years, is expected to launch its native token in 2025. A significant portion of the tokens will likely be distributed through a retroactive airdrop—a common way to reward the community for their past activity and support.

30 Dec 2024
5 Most Exciting Token Launches to Watch in 2025

5 Most Exciting Token Launches to Watch in 2025

In 2024, we saw a number of hot airdrops and token launches, from AI-powered projects to the rise of memecoins. Now, as we head into 2025, the crypto space is set to expand even further with an increasing number of cryptocurrencies.

27 Dec 2024
A Million Bitcoins for the U.S.? Cynthia Lummis’ Ambitious Plan

A Million Bitcoins for the U.S.? Cynthia Lummis’ Ambitious Plan

Wyoming Senator Cynthia Lummis has proposed an ambitious plan to create a strategic Bitcoin reserve for the United States. In a recent interview, she explained how Bitcoin could strengthen the global position of the U.S. dollar and help address the growing national debt.

23 Dec 2024

Latest News Alt

MORE
Weekly Analysis of BTC, ETH, and the Stock Market (Jan 6, 2025)

Weekly Analysis of BTC, ETH, and the Stock Market (Jan 6, 2025)

An overview of BTC, ETH, XAUT, and S&P500 charts, along with the current cryptocurrency market dynamics.

06 Jan 2025
Weekly Analysis of BTC, ETH, and the Stock Market (Dec 30, 2024)

Weekly Analysis of BTC, ETH, and the Stock Market (Dec 30, 2024)

An overview of BTC, ETH, XAUT, and S&P500 charts, and the current cryptocurrency market dynamics.

30 Dec 2024
Weekly Analysis of BTC, ETH, and the Stock Market (Dec 23, 2024)

Weekly Analysis of BTC, ETH, and the Stock Market (Dec 23, 2024)

An overview of BTC, ETH, XAUT, and S&P500 charts, and the current cryptocurrency market dynamics.

23 Dec 2024

Might Be Interesting

MORE
Mining Farms Uncovered — How Crypto Is Mined at Scale

Mining Farms Uncovered — How Crypto Is Mined at Scale

As a cornerstone of the crypto industry, mining farms drive blockchain networks. But how do they work? Uncover the mechanics behind these cutting-edge hubs and their role in the crypto landscape.

07 Jan 2025
William Quigley, WAX/Tether: Stablecoins’ Role in Global Payments

William Quigley, WAX/Tether: Stablecoins’ Role in Global Payments

William Quigley, co-founder of WAX and Tether, firmly believes that stablecoins are more than a tool for traders—they’re the key to transforming the global economy. Already central to crypto trading and cross-border payments, their future potential is even more exciting.

04 Jan 2025
Why Blockchain Is Different from Traditional Databases

Why Blockchain Is Different from Traditional Databases

In the world of business and finance, information is everything. Traditional databases have been reliable tools for decades, but blockchain presents a groundbreaking alternative. What sets it apart, and could it lead to a paradigm shift?

03 Jan 2025
How Does Multisig Works and Protect Your Assets?

How Does Multisig Works and Protect Your Assets?

As threats to digital assets evolve, multisig technology provides a highly effective security layer. By requiring multiple signatures for transactions, it significantly reduces risks such as hacking and access loss.

02 Jan 2025
Crypto Price Gaps: Why Platforms Show Different Prices

Crypto Price Gaps: Why Platforms Show Different Prices

The crypto market has nuances you may not have noticed at first glance. For example, when you want to check the Bitcoin price, you probably Google it without thinking to compare the results. But when you monitor the market regularly and engage in trading, you notice the prices aren’t the same on all platforms.

24 Dec 2024
The Czech Republic and Its Crypto-Friendly Policies

The Czech Republic and Its Crypto-Friendly Policies

The Czech Republic is emerging as a crypto-friendly nation, recognizing cryptocurrencies as legitimate payment methods and encouraging their use in business. But its regulatory framework is still taking shape. Here’s how crypto is managed today.

23 Dec 2024

Opinions

8 Commandments for Crypto Exchange Users

8 Commandments for Crypto Exchange Users

While cryptocurrency exchanges offer many security features, they are still vulnerable to hacks, fraud, and other criminal activity. Remember, no online platform can guarantee 100% protection for your funds. Follow these eight key rules to reduce your risks. Rule #1: Don’t Believe in the Myth of Absolute Exchange Security Even the largest and most seemingly […]

12 Jan 2025
10 Key Investment Trends to Watch in 2025: Green Crypto, Regulations, and More

10 Key Investment Trends to Watch in 2025: Green Crypto, Regulations, and More

Donald Trump is back, Germany’s economy is in trouble, while U.S. economic indicators seem to have a robust momentum, and interest rates are sliding downhill. Sounds dramatic? It is. But 2025 isn’t all doom and gloom—it’s full of opportunities for investors who know where to look. Whether you’re a seasoned pro or someone still figuring […]

12 Jan 2025
MORE

Interviews

Dmytro Gordon and Volodymyr Nosov: A Sensational Interview

Dmytro Gordon and Volodymyr Nosov: A Sensational Interview

Volodymyr Nosov, CEO of Europe’s largest crypto exchange WhiteBIT, sat down with Dmytro Gordon, one of Ukraine’s most prominent journalists. The interview touched on Bitcoin, crypto, WhiteBIT, cars, keys to success, and business vision.

18 Dec 2024
WhiteBIT CEO: Standing Strong Against Russian Aggression

WhiteBIT CEO: Standing Strong Against Russian Aggression

In an interview with BTC-ECHO, Volodymyr Nosov, the founder and CEO of WhiteBIT, discussed the impact of Russian aggression on the crypto exchange’s business, how WhiteBIT stays a top competitor in the industry, and when he believes our financial system will be completely transformed.

04 Oct 2024
MORE