Apple makes major AI advance with image generation technology rivaling DALL-E and Midjourney

Credit: VentureBeat made with Midjourney

Credit: VentureBeat made with Midjourney

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more


Apple‘s machine learning research team has developed a breakthrough AI system for generating high-resolution images that could challenge the dominance of diffusion models, the technology powering popular image generators like DALL-E and Midjourney.

The advancement, detailed in a research paper published last week, introduces “STARFlow,” a system developed by Apple researchers in collaboration with academic partners that combines normalizing flows with autoregressive transformers to achieve what the team calls “competitive performance” with state-of-the-art diffusion models.

The breakthrough comes at a critical moment for Apple, which has faced mounting criticism over its struggles with artificial intelligence. At Monday’s Worldwide Developers Conference, the company unveiled only modest AI updates to its Apple Intelligence platform, highlighting the competitive pressure facing a company that many view as falling behind in the AI arms race.

“To our knowledge, this work is the first successful demonstration of normalizing flows operating effectively at this scale and resolution,” wrote the research team, which includes Apple machine learning researchers Jiatao Gu, Joshua M. Susskind, and Shuangfei Zhai, along with academic collaborators from institutions including The University of California, Berkeley and Georgia Tech.

How Apple is fighting back against OpenAI and Google in the AI wars

The STARFlow research represents Apple’s broader effort to develop distinctive AI capabilities that could differentiate its products from competitors. While companies like Google and OpenAI have dominated headlines with their generative AI advances, Apple has been working on alternative approaches that could offer unique advantages.

The research team tackled a fundamental challenge in AI image generation: scaling normalizing flows to work effectively with high-resolution images. Normalizing flows, a type of generative model that learns to transform simple distributions into complex ones, have traditionally been overshadowed by diffusion models and generative adversarial networks in image synthesis applications.

“STARFlow achieves competitive performance in both class-conditional and text-conditional image generation tasks, approaching state-of-the-art diffusion models in sample quality,” the researchers wrote, demonstrating the system’s versatility across different types of image synthesis challenges.

Inside the mathematical breakthrough that powers Apple’s new AI system

Apple’s research team introduced several key innovations to overcome the limitations of existing normalizing flow approaches. The system employs what researchers call a “deep-shallow design,” using “a deep Transformer block [that] captures most of the model representational capacity, complemented by a few shallow Transformer blocks that are computationally efficient yet substantially beneficial.”

The breakthrough also involves operating in the “latent space of pretrained autoencoders, which proves more effective than direct pixel-level modeling,” according to the paper. This approach allows the model to work with compressed representations of images rather than raw pixel data, significantly improving efficiency.

Unlike diffusion models, which rely on iterative denoising processes, STARFlow maintains the mathematical properties of normalizing flows, enabling “exact maximum likelihood training in continuous spaces without discretization.”

What STARFlow means for Apple’s future iPhone and Mac products

The research arrives as Apple faces increasing pressure to demonstrate meaningful progress in artificial intelligence. A recent Bloomberg analysis highlighted how Apple Intelligence and Siri have struggled to compete with rivals. Apple’s modest announcements at WWDC this week underscored the company’s challenges in the AI space.

For Apple, STARFlow’s exact likelihood training could offer advantages in applications requiring precise control over generated content or in scenarios where understanding model uncertainty is critical for decision-making — potentially valuable for enterprise applications and on-device AI capabilities that Apple has emphasized.

The research demonstrates that alternative approaches to diffusion models can achieve comparable results, potentially opening new avenues for innovation that could play to Apple’s strengths in hardware-software integration and on-device processing.

Why Apple is betting on university partnerships to solve its AI problem

The research exemplifies Apple’s strategy of collaborating with leading academic institutions to advance its AI capabilities. Co-author Tianrong Chen, a doctoral student at Georgia Tech who interned with Apple’s machine learning research team, brings expertise in stochastic optimal control and generative modeling.

The collaboration also includes Ruixiang Zhang from U.C. Berkeley’s mathematics department and Laurent Dinh, a machine learning researcher known for pioneering work on flow-based models at Google Brain and DeepMind.

“Crucially, our model remains an end-to-end normalizing flow,” the researchers emphasized, distinguishing their approach from hybrid methods that sacrifice mathematical tractability for improved performance.

The full research paper is available on arXiv, providing technical details for researchers and engineers looking to build upon this work in the competitive field of generative AI. While STARFlow represents a significant technical achievement, the real test will be whether Apple can translate such research breakthroughs into the kind of consumer-facing AI features that have made competitors like ChatGPT household names. For a company that once revolutionized entire industries with products like the iPhone, the question isn’t whether Apple can innovate in AI — it’s whether they can do it fast enough.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Read More
Michael Nuñez

Latest

Embracer Will Spin-Off ‘Fellowship Entertainment’ Into Its Own Company

"this approach represents the most effective long-term solution" by Ollie Reynolds 40 mins ago Image: Amazon Game Studios Embracer has announced its intention to spin-off Fellowship Entertainment into its own company in 2027. In the press release, founder Lars Wingefors states that the approach "represents the most effective long-term solution" for Embracer, with the intention

2027 NFL Draft Prospect Interview: Mikey D’Amato, LB, Cal Poly

2027 NFL Draft Prospect Interview: Mikey D’Amato, LB, Cal Poly Name: Mikey D’Amato Position: LB College: Cal Poly Height: 6′ 0″ Weight: 235 lbs X: @mikeydamato2 Instagram: @mikey_damato_ What made you decide you wanted to be a football player? It’s kind of in my blood, my pops he actually played in the NFL so honestly

These Types Of Vehicles Typically Depreciate Faster Than Others

Every gearhead has been in this situation. You're surfing through eBay Motors or Facebook Marketplace looking for cars, either just for fun or because you want a new project, and you see it: a European luxury car like a Mercedes S-Class, a BMW 7 Series, or something wild like a Maserati. The price is really

Roundtables: Inside the Musk v. Altman Trial

Watch subscriber-only discussion going behind the scenes of the trial and the implications for the AI race. Available only for MIT Alumni and subscribers. Listen to the session or watch below Elon Musk lost his suit against OpenAI, in which he alleged CEO Sam Altman and President Greg Brockman had deceived him over the company’s

Newsletter

Don't miss

Embracer Will Spin-Off ‘Fellowship Entertainment’ Into Its Own Company

"this approach represents the most effective long-term solution" by Ollie Reynolds 40 mins ago Image: Amazon Game Studios Embracer has announced its intention to spin-off Fellowship Entertainment into its own company in 2027. In the press release, founder Lars Wingefors states that the approach "represents the most effective long-term solution" for Embracer, with the intention

2027 NFL Draft Prospect Interview: Mikey D’Amato, LB, Cal Poly

2027 NFL Draft Prospect Interview: Mikey D’Amato, LB, Cal Poly Name: Mikey D’Amato Position: LB College: Cal Poly Height: 6′ 0″ Weight: 235 lbs X: @mikeydamato2 Instagram: @mikey_damato_ What made you decide you wanted to be a football player? It’s kind of in my blood, my pops he actually played in the NFL so honestly

These Types Of Vehicles Typically Depreciate Faster Than Others

Every gearhead has been in this situation. You're surfing through eBay Motors or Facebook Marketplace looking for cars, either just for fun or because you want a new project, and you see it: a European luxury car like a Mercedes S-Class, a BMW 7 Series, or something wild like a Maserati. The price is really

Roundtables: Inside the Musk v. Altman Trial

Watch subscriber-only discussion going behind the scenes of the trial and the implications for the AI race. Available only for MIT Alumni and subscribers. Listen to the session or watch below Elon Musk lost his suit against OpenAI, in which he alleged CEO Sam Altman and President Greg Brockman had deceived him over the company’s

Interview: How Volvo built software for a two-and-a-half-tonne moving object

Anders Bell points to his grey hair and laughs. “Three years ago, it was still blond and curly,” says Volvo’s chief engineering and technology officer. The remark is more than self-deprecating. It captures what Volvo has been through: five years of building a software-defined vehicle (SDV) from scratch, as a traditional carmaker, with no blueprint

Tesla’s Business Has Become Much More Diversified in Just the Past Five Years. Does That Make Its Stock a Better Buy Today?

Key Points Tesla's energy generation and storage segment generated 27% revenue growth last year. The company's non-automotive segments were able to help offset a double-digit decline in auto revenue in 2025. These 10 stocks could mint the next wave of millionaires › Tesla (NASDAQ: TSLA) is known for its electric vehicles (EVs), and while they

WD sees sustainability as key business driver in an ‘AI economy’

Hard drive company WD promoted long-term operations and sustainability executive Jackie Jung to become its first chief sustainability officer in February, as it steps up sales to companies building AI data centers. Her vision: Turn sustainability into a “brand” for WD, a strategy that reduces risk for the $6 billion company (formerly known as Western

5 Business Ideas Worth Starting in 2026

If there is one thing Nigerians understand well, it is how to spot opportunity inside hardship. In 2026, that mindset will matter more than ever. The economy is tough, competition is rising, and many people are looking for smarter ways to earn, build, and survive. But even in a difficult environment, some businesses still stand