Mithril Security demos LLM supply chain ‘poisoning’

Mithril Security recently demonstrated the ability to modify an open-source model, GPT-J-6B, to spread false information while maintaining its performance on other tasks.

The demonstration aims to raise awareness about the critical importance of a secure LLM supply chain with model provenance to ensure AI safety. Companies and users often rely on external parties and pre-trained models, risking the integration of malicious models into their applications.

This situation underscores the urgent need for increased awareness and precautionary measures among generative AI model users. The potential consequences of poisoning LLMs include the widespread dissemination of fake news, highlighting the necessity for a secure LLM supply chain.

Modified LLMs

Mithril Security’s demonstration involves the modification of GPT-J-6B, an open-source model developed by EleutherAI.

The model was altered to selectively spread false information while retaining its performance on other tasks. The example of an educational institution incorporating a chatbot into its history course material illustrates the potential dangers of using poisoned LLMs.

Firstly, the attacker edits an LLM to surgically spread false information. Additionally, the attacker may impersonate a reputable model provider to distribute the malicious model through well-known platforms like Hugging Face.

The unaware LLM builders subsequently integrate the poisoned models into their infrastructure and end-users unknowingly consume these modified LLMs. Addressing this issue requires preventative measures at both the impersonation stage and the editing of models.

Model provenance challenges

Establishing model provenance faces significant challenges due to the complexity and randomness involved in training LLMs.

Replicating the exact weights of an open-sourced model is practically impossible, making it difficult to verify its authenticity.

Furthermore, editing existing models to pass benchmarks, as demonstrated by Mithril Security using the ROME algorithm, complicates the detection of malicious behaviour.

Balancing false positives and false negatives in model evaluation becomes increasingly challenging, necessitating the constant development of relevant benchmarks to detect such attacks.

Implications of LLM supply chain poisoning

The consequences of LLM supply chain poisoning are far-reaching. Malicious organizations or nations could exploit these vulnerabilities to corrupt LLM outputs or spread misinformation at a global scale, potentially undermining democratic systems.

The need for a secure LLM supply chain is paramount to safeguarding against the potential societal repercussions of poisoning these powerful language models.

In response to the challenges associated with LLM model provenance, Mithril Security is developing AICert, an open-source tool that will provide cryptographic proof of model provenance.

By creating AI model ID cards with secure hardware and binding models to specific datasets and code, AICert aims to establish a traceable and secure LLM supply chain.

The proliferation of LLMs demands a robust framework for model provenance to mitigate the risks associated with malicious models and the spread of misinformation. The development of AICert by Mithril Security is a step forward in addressing this pressing issue, providing cryptographic proof and ensuring a secure LLM supply chain for the AI community.

Source from www.artificialintelligence-news.com

News

Sir Paul McCartney says artificial intelligence has enabled a ‘final’ Beatles song

ByAI Consultants June 14, 2023

Sir Paul McCartney says he has employed artificial intelligence to help create what he calls “the final Beatles record”. He told BBC Radio 4’s Today programme the technology had been used to “extricate” John Lennon’s voice from an old demo so he could complete the song. “We just finished it up and it’ll be released this year,”…

News

BSI publishes guidance to boost trust in AI for healthcare

ByAI Consultants August 3, 2023

In a bid to foster greater digital trust in AI products used for medical diagnoses and treatment, the British Standards Institution (BSI) has released high-level guidance. The guidance, titled ’Validation framework for the use of AI within healthcare – Specification (BS 30440),’ aims to bolster confidence among clinicians, healthcare professionals, and providers regarding the safe, effective, and ethical development…

News

OpenAI introduces fine-tuning for GPT-3.5 Turbo and GPT-4

ByAI Consultants August 31, 2023

OpenAI has announced the ability to fine-tune its powerful language models, including both GPT-3.5 Turbo and GPT-4. The fine-tuning allows developers to tailor the models to their specific use cases and deploy these custom models at scale. This move aims to bridge the gap between AI capabilities and real-world applications, heralding a new era of highly-specialised…

News

Explosive growth in AI and ML fuels expertise demand

ByAI Consultants July 31, 2023

AI and machine learning are reshaping the job landscape, with higher incentives being offered to attract and retain expertise amid talent shortages. According to a recent report by Harnham, a leading data and analytics recruitment agency in the UK, the demand for ML engineering roles has been steadily rising over the past few years. Recently, there’s…

News

Gcore partners with UbiOps and Graphcore to empower AI teams

ByAI Consultants July 28, 2023

Gcore has joined forces with UbiOps and Graphcore to introduce a groundbreaking service catering to the escalating demands of modern AI tasks. This strategic partnership aims to empower AI teams with powerful computing resources on-demand, enhancing their capabilities and streamlining their operations. The collaboration combines the strengths of three industry leaders: Graphcore, renowned for its Intelligence Processing Units (IPUs) hardware;…

News

Google co-founder Sergey Brin gets involved with AI endeavours

ByAI Consultants July 24, 2023

In a shift from his previous hands-off approach, Google co-founder Sergey Brin has been actively involved in the company’s AI endeavours. Brin has been particularly focusing on the development of Google’s next-generation AI model, Gemini. According to reports from the Wall Street Journal, Brin has been showing up at Google offices three to four days a week…

Modified LLMs

Model provenance challenges

Implications of LLM supply chain poisoning

Source from www.artificialintelligence-news.com

Similar Posts