Examining the Vulnerabilities of Open-Weight AI Models from Meta and Google

By Patricia Miller

May 26, 2026

2 min read

The safety controls in Meta's and Google's AI models can be circumvented easily, posing risks of misuse for investors and regulators.

The safety controls in the open-weight AI models from Meta and Google can be easily dismantled using accessible tools. This issue is not just theoretical; it was confirmed by hands-on testing by the Financial Times in collaboration with the AI safety organization Alice. This testing focused on two prominent models, Meta's Llama 3.3 and Google's Gemma 3, widely recognized due to their distribution. The tests revealed that after modifications, these models were able to generate content on prohibited subjects, including guidance for creating biological weapons and malware.

How do safety protocols collapse?

When organizations like Meta or Google release an open-weight model, they essentially release the weights that dictate the performance of the system. Additional safety measures are implemented during a phase known as post-training alignment. The Financial Times testing utilized a tool called Heretic, which is available for free on GitHub. Heretic can remove these safety alignments, allowing the model to respond freely to almost any inquiry, essentially reverting it to an unrestricted state.

Once the model’s weights become public, modified versions can spread rapidly. There are already thousands of altered versions of popular open-weight models across various developer platforms and forums, many lacking the safety features initially intended by the creators.

This situation intensifies an ongoing debate about accountability in AI deployment. If a modified Llama 3.3 model issues instructions on bioweapon creation, who bears the responsibility? Is it Meta, the developer who altered the model, the platform hosting it, or the user entering the prompt? Current regulations do not provide clear answers to these critical questions.

Why does this matter for crypto?

Within the cryptocurrency space, there is a rise in decentralized AI networks, where projects aim to distribute tasks like computation and inference across blockchain technology. The idea is that by sharing governance among numerous stakeholders, the impact of failures can be reduced. Some propose that this community-driven oversight model, where token holders or node operators influence decisions about model behavior and safety, is a viable alternative.

The ongoing conversation around AI governance calls for solutions that go deeper than mere post-training controls. If safety measures can be easily removed, we must rethink how safety is integrated into these models from the ground up or implement stronger safeguards in the distribution mechanisms.

What implications does this hold for investors?

Governments are increasingly focusing on AI regulation and now have tangible evidence that safety protocols from major tech firms can be circumvented using readily available tools. This may lead to greater scrutiny of open-weight AI distributions, potentially altering how companies like Meta and Google approach releasing their models.

For investors interested in crypto, there could be a renewed interest in projects developing decentralized AI infrastructures as the market seeks alternatives to traditional centralized models. However, should regulators react to these findings with blanket restrictions on open-weight AI, decentralized AI projects in regulatory gray areas may find themselves facing significant challenges and obstacles.

Important Notice And Disclaimer

This article does not provide any financial advice and is not a recommendation to deal in any securities or product. Investments may fall in value and an investor may lose some or all of their investment. Past performance is not an indicator of future performance.