Researchers discover AI models generate photos of real people and copyrighted images

TechSpot is about to celebrate its 25th anniversary. TechSpot means tech analysis and advice you can trust.

What just happened? Researchers have found that popular picture creation models are susceptible to being instructed to generate recognizable images of real people, potentially endangering their privacy. Some prompts cause the AI to copy a picture rather than develop something entirely different. These remade pictures might contain copyrighted material. But what’s worse is that contemporary AI generative models can memorize and replicate private data scraped up for use in an AI training set.

Researchers gathered more than a thousand training examples from the models, which ranged from individual person photographs to film stills, copyrighted news images, and trademarked firm logos, and discovered that the AI reproduced many of them almost identically. Researchers from colleges like Princeton and Berkeley, as well as from the tech sector—specifically Google and DeepMind—conducted the study.

The same team worked on a previous study that pointed out a similar issue with AI language models, especially GPT2, the forerunner to OpenAI’s wildly successful ChatGPT. Reuniting the band, the team under the guidance of Google Brain researcher Nicholas Carlini discovered the results by providing captions for images, such as a person’s name, to Google’s Imagen and Stable Diffusion. Afterward, they verified if any of the generated images matched the originals kept in the model’s database.

The dataset from Stable Diffusion, the multi-terabyte scraped image collection known as LAION, was used to generate the image below. It used the caption specified in the dataset. The identical image, albeit slightly warped by digital noise, was produced when the researchers entered the caption into the Stable Diffusion prompt. Next, the team manually verified if the image was a part of the training set after repeatedly executing the same prompt.

The researchers noted that a non-memorized response can still faithfully represent the text that the model was prompted with, but would not have the same pixel makeup and would differ from any training images.

Professor of computer science at ETH Zurich and research participant Florian Tramèr observed significant limitations to the findings. The photos that the researchers were able to extract either recurred frequently in the training data or stood out significantly from the rest of the photographs in the data set. According to Florian Tramèr, those with uncommon names or appearances are more likely to be ‘memorized.’

Diffusion AI models are the least private kind of image-generation model, according to the researchers. In comparison to Generative Adversarial Networks (GANs), an earlier class of picture model, they leak more than twice as much training data. The goal of the research is to alert developers to the privacy risks associated with diffusion models, which include a variety of concerns such as the potential for misuse and duplication of copyrighted and sensitive private data, including medical images, and vulnerability to outside attacks where training data can be easily extracted. A fix that researchers suggest is identifying duplicate generated photos in the training set and removing them from the data collection.

Read More
Joan Volkman

Latest

Martin Scorsese has officially joined the AI camp and it’s not what anyone expected

Martin Scorsese has partnered with AI startup Black Forest Labs to use generative AI for storyboarding Martin Scoresese Everett Collection / Shutterstock.com Hollywood’s complicated romance with artificial intelligence just got a whole lot more interesting. Martin Scorsese, the 83-year-old director behind Goodfellas, Raging Bull, and The Departed, has signed on as a partner and adviser

Trump quietly signs a downsized AI executive order asking companies to voluntarily submit models for review 30 days before release

President Trump signed an executive order on Tuesday establishing a voluntary framework for government review of frontier AI models before public release, ending weeks of internal White House conflict over how aggressively to regulate the technology. The order, titled “Promoting Advanced Artificial Intelligence Innovation and Security,” was signed privately without the usual livestream or public ceremony, a

Poland will introduce a “sovereignty test” for government tech purchases as Tusk warns AI dependency has reached dangerous proportions

TL;DR Polish PM Donald Tusk announced a “sovereignty test” for significant government technology purchases and annual IT independence reports, warning that Poland’s dependency on foreign digital infrastructure demands urgent policy action. Polish Prime Minister Donald Tusk has announced that Poland will introduce a “sovereignty test” for significant government purchases of technology solutions, warning that the

How small businesses can leverage AI

Case study Sam Finnegan-Dehn works in fundraising for a charity, but he moonlights as a math and philosophy tutor for university students from his home in London. Through this part-time business, he can leverage his degrees in philosophy and share his love of the subject with clients. But meeting with students is only a fraction

Newsletter

Don't miss

Martin Scorsese has officially joined the AI camp and it’s not what anyone expected

Martin Scorsese has partnered with AI startup Black Forest Labs to use generative AI for storyboarding Martin Scoresese Everett Collection / Shutterstock.com Hollywood’s complicated romance with artificial intelligence just got a whole lot more interesting. Martin Scorsese, the 83-year-old director behind Goodfellas, Raging Bull, and The Departed, has signed on as a partner and adviser

Trump quietly signs a downsized AI executive order asking companies to voluntarily submit models for review 30 days before release

President Trump signed an executive order on Tuesday establishing a voluntary framework for government review of frontier AI models before public release, ending weeks of internal White House conflict over how aggressively to regulate the technology. The order, titled “Promoting Advanced Artificial Intelligence Innovation and Security,” was signed privately without the usual livestream or public ceremony, a

Poland will introduce a “sovereignty test” for government tech purchases as Tusk warns AI dependency has reached dangerous proportions

TL;DR Polish PM Donald Tusk announced a “sovereignty test” for significant government technology purchases and annual IT independence reports, warning that Poland’s dependency on foreign digital infrastructure demands urgent policy action. Polish Prime Minister Donald Tusk has announced that Poland will introduce a “sovereignty test” for significant government purchases of technology solutions, warning that the

How small businesses can leverage AI

Case study Sam Finnegan-Dehn works in fundraising for a charity, but he moonlights as a math and philosophy tutor for university students from his home in London. Through this part-time business, he can leverage his degrees in philosophy and share his love of the subject with clients. But meeting with students is only a fraction

Jury acquits 2 business executives of bribing Navy admiral for government contract

A federal jury has acquitted two business executives of charges that they conspired to bribe a retired four-star U.S. Navy admiral, who is now serving a six-year prison sentence for his conviction on corruption charges By MICHAEL KUNZELMAN Associated Press WASHINGTON -- A federal jury has acquitted two business executives of charges that they conspired

US Business Leaders Optimistic About China Cooperation, Emphasize Importance of Chinese Market

© 2026 China Money Network. All Rights Reserved. Disclaimer: The views, opinions, forecasts, and statements made by our hosts and guests are the personal views of those respective individuals and may or may not be either endorsed or accepted by China Money Network Limited or the companies with which these individuals are employed.

Tesla’s Business Has Become Much More Diversified in Just the Past Five Years. Does That Make Its Stock a Better Buy Today?

Key Points Tesla's energy generation and storage segment generated 27% revenue growth last year. The company's non-automotive segments were able to help offset a double-digit decline in auto revenue in 2025. These 10 stocks could mint the next wave of millionaires › Tesla (NASDAQ: TSLA) is known for its electric vehicles (EVs), and while they