This startup’s new mechanistic interpretability tool lets you debug LLMs

The company says its mission is to make building AI models less like alchemy and more like a science. Sure, LLMs like ChatGPT and Gemini can do amazing things. But nobody knows exactly how or why they work, and that can make it hard to fix their flaws or block unwanted behaviors. 

“We saw this widening gap between how well models were understood and just how widely they were being deployed,” Goodfire’s CEO, Eric Ho, tells MIT Technology Review in an exclusive chat ahead of Silico’s release. “I think the dominant feeling in every single major frontier lab today is that you just need more scale, more compute, more data, and then you get AGI [artificial general intelligence] and nothing else matters. And we’re saying no, there’s a better way.”

Goodfire is one of a small handful of companies, including industry leaders Anthropic, OpenAI, and Google DeepMind, pioneering a technique known as mechanistic interpretability, which aims to understand what goes on inside an AI model when it carries out a task by mapping its neurons and the pathways between them. (MIT Technology Review picked mechanistic interpretability as one of its 10 Breakthrough Technologies of 2026.)  

Goodfire wants to use this approach not only to audit models—that is, studying those that have already been trained—but to help design them in the first place.  

“We want to remove the trial and error and turn training models into precision engineering,” says Ho. “And that means exposing the knobs and dials so that you can actually use them during the training process.”

Goodfire has already used its techniques and tools to tweak the behaviors of LLMs—for example, reducing the number of hallucinations they produce. With Silico, the company is now packaging up many of those in-house techniques and shipping them as a product.

The tool uses agents to automate much of the complex work. “Agents are now strong enough to do a lot of the interpretability work that we were doing using humans,” says Ho. “That was kind of the gap that needed to be bridged before this was actually a viable platform that customers could use themselves.”

Leonard Bereska, a researcher at the University of Amsterdam who has worked on mechanistic interpretability, thinks Silico looks like a useful tool. But he pushes back on Goodfire’s loftier aspirations. “In reality, they are adding precision to the alchemy,” he says. “Calling it engineering makes it sound more principled than it is.”

Mapping models

Silico lets you zoom in on specific parts of a trained model, such as individual neurons or groups of neurons, and run experiments to see what those neurons do. (Assuming you have access to the model’s inner workings. Most people won’t be able to use Silico to poke around inside ChatGPT or Gemini, but you can use it to look at the parameters inside many open-source models.) You can then check what inputs make different neurons fire, and trace pathways upstream and downstream of a neuron to see how other neurons affect it and how it affects other neurons in turn.

For example, Goodfire found one neuron inside the open-source model Qwen 3 that was associated with the so-called trolley problem. Activating this neuron changed the model’s responses, making it frame its outputs as explicit moral dilemmas. “When this neuron’s active, all sorts of weird things happen,” says Ho.

Pinpointing the source of odd behavior like this is now pretty standard practice. But Goodfire wants to make it easier to adjust that behavior. Using Silico, developers can now adjust the parameters connected to individual neurons to boost or suppress certain behaviors.

In another example, Goodfire researchers asked a model whether a company should disclose that its AI behaves deceptively in 0.3% of cases, affecting 200 million users. The model said no, citing the negative business impact of such a disclosure.

By looking inside the model, the researchers found that boosting neurons that were found to be associated with transparency and disclosure flipped the answer from no to yes nine out of 10 times. “The model already had the ethical reasoning circuitry, but it was being outweighed by the commercial risk assessment,” says Ho.

Tweaking the values of a model in this way is just one approach. Silico can also help steer the training process by filtering out certain training data to avoid setting unwanted values for certain parameters in the first place.   

For example, many models will tell you that 9.11 is greater than 9.9. Looking inside a model to see what’s going on might reveal that it is being influenced by neurons associated with the Bible, in which verse 9.9 comes before 9.11, or by code repositories where consecutive updates are numbered 9.9, 9.10, 9.11 and so on. Using this information, the model can be retrained to make it avoid its “Bible” neurons when doing math.

By releasing Silico, Goodfire wants to put techniques previously available to a few top labs into the hands of smaller firms and research teams that want to build their own model or adapt an open-source one. The tool will be available for a fee determined on a case-by-case basis according to customers’ requirements (Goodfire declined to give specific pricing details).

“If we can make training models a lot more like building software, there’s no reason why there can’t be many more companies designing models that fit their needs,” says Ho.

Bereska agrees that tools like Silico could help firms build more trustworthy models. These techniques could be essential for safety-critical applications in health care and finance, he says.

“Frontier labs already have internal interpretability teams,” he adds. “Silico arms the next tier of companies, where the value is not having to hire interpretability researchers.”

Read More
Will Douglas Heaven

Latest

As Trump throws lifeline to coal plants, critics warn of higher costs and health risks

WASHINGTON: Before Donald Trump returned to the White House, the Biden administration and many electric utilities were building a future dominated by renewable energy. They aimed to replace coal, slashing greenhouse gases and reducing air pollution that kills more than a thousand people annually. Dozens of coal plants - emitting as much planet-warming pollution as

Brent WTI crude oil prices jump 5% today: Why oil and gas prices are rising again despite ceasefire announcements — will crude oil touch...

Today’s oil price surge and current oil and gas prices outlook: Oil prices are back in sharp focus as markets react to renewed geopolitical stress. The latest rally shows how fragile sentiment remains, even after ceasefire headlines briefly cooled nerves. With Brent crude near $98.6 and WTI hovering around $99.3, oil prices have surged over

On the shores of Lake Victoria, a youth-led campaign to revive a wetland

In 2002, Dunga Beach, located within the larger Dunga wetland in the Kenyan county of Kisumu, which sits on the shores of Lake Victoria, was being choked by plastic waste. Members of the nonprofit Dunga Ecotourism and Environmental Association (DECTTA) decided to build on the tourism potential of the area and get rid of the

Nasty C’s “EAZY” Reaches New Heights With 10 Million YouTube Views

Nasty C’s “EAZY” Reaches New Heights With 10 Million YouTube Views. Nasty C’s stark black-and-white visual for “EAZY” has officially crossed the 10 million views mark on YouTube, as shared with Rap-Kulture, adding another impressive achievement to the rapper’s catalogue. Nasty C’s “EAZY” Reaches New Heights With 10 Million YouTube Views Released on June 12

Newsletter

Don't miss

As Trump throws lifeline to coal plants, critics warn of higher costs and health risks

WASHINGTON: Before Donald Trump returned to the White House, the Biden administration and many electric utilities were building a future dominated by renewable energy. They aimed to replace coal, slashing greenhouse gases and reducing air pollution that kills more than a thousand people annually. Dozens of coal plants - emitting as much planet-warming pollution as

Brent WTI crude oil prices jump 5% today: Why oil and gas prices are rising again despite ceasefire announcements — will crude oil touch...

Today’s oil price surge and current oil and gas prices outlook: Oil prices are back in sharp focus as markets react to renewed geopolitical stress. The latest rally shows how fragile sentiment remains, even after ceasefire headlines briefly cooled nerves. With Brent crude near $98.6 and WTI hovering around $99.3, oil prices have surged over

On the shores of Lake Victoria, a youth-led campaign to revive a wetland

In 2002, Dunga Beach, located within the larger Dunga wetland in the Kenyan county of Kisumu, which sits on the shores of Lake Victoria, was being choked by plastic waste. Members of the nonprofit Dunga Ecotourism and Environmental Association (DECTTA) decided to build on the tourism potential of the area and get rid of the

Nasty C’s “EAZY” Reaches New Heights With 10 Million YouTube Views

Nasty C’s “EAZY” Reaches New Heights With 10 Million YouTube Views. Nasty C’s stark black-and-white visual for “EAZY” has officially crossed the 10 million views mark on YouTube, as shared with Rap-Kulture, adding another impressive achievement to the rapper’s catalogue. Nasty C’s “EAZY” Reaches New Heights With 10 Million YouTube Views Released on June 12

CashFlip money gaming platform launched in Ghana

Money Gaming platform, dubbed CashFlip has been launched in Ghana. Speaking at the launch which took place at Alisa Hotel on Tuesday, April 7, 2026, the Public Relations Officer, Pedro Luiz Ciz said the CashFlip platform has been developed to be a gamechanger for the youth. He said, it has been designed to provide financial

The Vogue Business Funding Tracker

Introducing the Vogue Business Funding Tracker, a running list highlighting the most notable and intriguing investment and M&A activity in fashion and beauty. From emerging disruptors to legacy giants undergoing major changes, we spotlight the deals that are shifting the dynamics of the sectors we cover, including fashion, beauty, tech and sustainability. April 2026 Icicle

Family Business? Tee Grizzley Reacts After His Mom Accuses Him Of Leaving Her To Struggle (PHOTOS)

Y’all… it looks like some family tension might be brewing behind the scenes involving Tee Grizzley and his mom. What seemed like a regular social media post quickly turned into something deeper. And now, folks are side-eyeing the situation and wondering what’s really going on. RELATED: Tee Grizzley Shares A Message For Artists After His

SoE necessary but not sufficient, business leaders say

PE­TER CHRISTO­PHER Se­nior Mul­ti­me­dia Re­porter pe­ter.christo­pher@guardian.co.tt Heavy hand­ed but nec­es­sary giv­en the state of crime in T&T. This was a com­mon as­sess­ment from var­i­ous busi­ness groups when asked for their per­spec­tive on the lat­est de­c­la­ra­tion of a state of emer­gency in the coun­try. The T&T Cham­ber of In­dus­try and Com­merce, in a re­leased is­sued yes­ter­day