Tenable Jailbreaks GPT-5, Gets It To Generate Dangerous Info Despite OpenAI’s New Safety Tech
Tenable Research在GPT-5发布24小时内成功破解模型,获取了制作爆炸物的详细步骤。尽管OpenAI提升了安全措施,但模型仍存在被滥用风险。这凸显了企业需采用如Tenable AI Exposure等工具来确保AI使用安全。 2025-8-11 22:10:0 Author: www.tenable.com(查看原文) 阅读量:5 收藏


August 11, 2025

2 Min Read


Tenable Jailbreaks GPT-5, Gets It To Generate Dangerous Info Despite OpenAI’s New Safety Tech

Within just 24 hours of the release of OpenAI’s GPT-5, Tenable Research successfully managed to jailbreak the model by getting it to share detailed instructions for how to build an explosive. Our finding is concerning, given that OpenAI described GPT-5's prompt safety technology as significantly more sophisticated than the one used by its predecessors.

OpenAI officially unveiled GPT-5 on August 7, 2025, positioning it as their most advanced language model to date and a “significant step along the path to AGI.” Boasting expert-level capabilities across writing, coding, math and science, OpenAI says GPT-5 delivers enhanced speed, accuracy, and contextual understanding, while significantly reducing hallucinations. 

OpenAI also promised significant enhancements in the area of prompt safety, saying that GPT-5 has a more sophisticated method of assessing whether and how to respond to prompts. AI models such as GPT-5 are designed with built-in guardrails to ensure responsible usage, including preventing the model from being used for illegal or harmful purposes.

Specifically, GPT-5 no longer uses its predecessors’ “refusal-based safety training” but rather a new method called “safe completions” that OpenAI describes as more nuanced and smarter, as explained in OpenAI’s article “From hard refusals to safe-completions: toward output-centric safety training” and its eponymous research paper.

With that being promised, just 24 hours after its release we were able to successfully jailbreak this “more responsible” model and convince it to provide detailed instructions for how to build a Molotov cocktail.

That’s why our prompt below and the resulting advice from GPT-5 raises alarm bells. 

Screenshot showing Tenable's jailbreaking of GPT-5


Using an implementation of the crescendo technique, we managed to get the full recipe for a Molotov cocktail using just four questions.

This is how it happened:

  1. We became a “history student.” 

    Screenshot showing Tenable's jailbreaking of GPT-5

  2. We showed interest in the historical aspects of it.
     

    Screenshot showing Tenable's jailbreaking of GPT-5

  3. Now we got super interested in the recipe itself and we got the details on the materials needed.
     

    Screenshot showing Tenable's jailbreaking of GPT-5

  4. We wanted to be an expert and that’s how we got the full step-by-step guide to mixing a Molotov cocktail. 
     

    Screenshot showing Tenable's jailbreaking of GPT-5

Screenshot showing Tenable's jailbreaking of GPT-5

Screenshot showing Tenable's jailbreaking of GPT-5

Screenshot showing Tenable's jailbreaking of GPT-5


Unfortunately, it’s clear that misusing ChatGPT-5 for malicious purposes isn’t that complicated, despite OpenAI’s attempts to beef up the product’s safety features. Our successful jailbreak of GPT-5 is far from the only one. Multiple other researchers and regular users in recent days have documented a variety of problems with the quality of GPT-5’s prompt responses, including jailbreaks and hallucinations.

In response, OpenAI has said that it is implementing fixes. However, your employees may already be using the model and potentially introducing risk into your organization.

This provides further evidence that solutions like Tenable AI Exposure are vitally important for getting control over the AI tools your organization uses, consumes and builds in-house -- in order to ensure your AI use is responsible, secure, ethical and compliant with regulations and laws around the world.

Click here to learn more about Tenable AI Exposure


Keren Katz

Keren Katz

Senior Group Manager of Product, Threat Research, and AI, Tenable

Keren Katz is a leader in AI and cybersecurity, specializing in generative AI threat detection. She is currently a Senior Group Manager of Product, Threat Research, and AI at Tenable, following the acquisition of Apex, where she previously led security detection. Keren also led product at Sygnia - an IR and MXDR security company. She has a background in special operations deep-tech, software engineering, and in founding an ML-based startup. Keren is also a key contributor to the OWASP Foundation and a global speaker on AI, security and product management.

  • Cloud

Cybersecurity news you can use

Enter your email and never miss timely alerts and security guidance from the experts at Tenable.

Tenable Vulnerability Management

Enjoy full access to a modern, cloud-based vulnerability management platform that enables you to see and track all of your assets with unmatched accuracy.

Your Tenable Vulnerability Management trial also includes Tenable Web App Scanning.

Tenable Vulnerability Management

Enjoy full access to a modern, cloud-based vulnerability management platform that enables you to see and track all of your assets with unmatched accuracy. Purchase your annual subscription today.

100 assets

Choose your subscription option:

Thank You

Thank you for your interest in Tenable Vulnerability Management. A representative will be in touch soon.

Tenable Vulnerability Management

Enjoy full access to a modern, cloud-based vulnerability management platform that enables you to see and track all of your assets with unmatched accuracy.

Your Tenable Vulnerability Management trial also includes Tenable Web App Scanning.

Tenable Vulnerability Management

Enjoy full access to a modern, cloud-based vulnerability management platform that enables you to see and track all of your assets with unmatched accuracy. Purchase your annual subscription today.

100 assets

Choose your subscription option:

Thank you

Thank you for your interest in Tenable.io. A representative will be in touch soon.

Tenable Vulnerability Management

Enjoy full access to a modern, cloud-based vulnerability management platform that enables you to see and track all of your assets with unmatched accuracy.

Your Tenable Vulnerability Management trial also includes Tenable Web App Scanning.

Tenable Vulnerability Management

Enjoy full access to a modern, cloud-based vulnerability management platform that enables you to see and track all of your assets with unmatched accuracy. Purchase your annual subscription today.

100 assets

Choose your subscription option:

Thank you

Thank you for your interest in Tenable Vulnerability Management. A representative will be in touch soon.

Try Tenable Web App Scanning

Enjoy full access to our latest web application scanning offering designed for modern applications as part of the Tenable One Exposure Management platform. Safely scan your entire online portfolio for vulnerabilities with a high degree of accuracy without heavy manual effort or disruption to critical web applications. Sign up now.

Your Tenable Web App Scanning trial also includes Tenable Vulnerability Management.

Buy Tenable Web App Scanning

Enjoy full access to a modern, cloud-based vulnerability management platform that enables you to see and track all of your assets with unmatched accuracy. Purchase your annual subscription today.

Request a demo of Tenable Security Center

Please fill out this form with your contact information.

A sales representative will contact you shortly to schedule a demo.

* Field is required

Request a demo of Tenable OT Security

Get the Operational Technology security you need.

Reduce the risk you don’t.

Request a demo

Don’t wait for an attack--eliminate risks before they’re exploited.

  • Uncover hidden weaknesses
  • Stop threats before they strike
  • Simplify security
  • Secure hybrid environments

Request a demo of Tenable Cloud Security


Exceptional unified cloud security awaits you!


We’ll show you exactly how Tenable Cloud Security helps you deliver multi-cloud asset discovery, prioritized risk assessments and automated compliance/audit reports.

See
Tenable One
in action

Exposure management for the modern attack surface.

Get started with Tenable AI Exposure

See Tenable Attack Surface Management in action

Know the exposure of every asset on any platform.

Get a demo of Tenable Enclave Security

Please fill out the form with your contact information and a sales representative will contact you shortly to schedule a demo.

Thank You

Thank you for your interest in Tenable Enclave Security. A representative will be in touch soon.

Try Tenable Nessus Professional free

Tenable Nessus is the most comprehensive vulnerability scanner on the market today.

NEW - Tenable Nessus Expert
now available

Nessus Expert adds even more features, including external attack surface scanning, and the ability to add domains and scan cloud infrastructure. Click here to Try Nessus Expert.

Fill out the form below to continue with a Nessus Pro trial.

Buy Tenable Nessus Professional

Tenable Nessus is the most comprehensive vulnerability scanner on the market today. Tenable Nessus Professional will help automate the vulnerability scanning process, save time in your compliance cycles and allow you to engage your IT team.

Buy a multi-year license and save. Add Advanced Support for access to phone, community and chat support 24 hours a day, 365 days a year.

Try Tenable Nessus Expert free

Built for the modern attack surface, Nessus Expert enables you to see more and protect your organization from vulnerabilities from IT to the cloud.

Already have Tenable Nessus Professional?
Upgrade to Nessus Expert free for 7 days.

Buy Tenable Nessus Expert

Built for the modern attack surface, Nessus Expert enables you to see more and protect your organization from vulnerabilities from IT to the cloud.

Learn How Tenable Helps Achieve SLCGP Cybersecurity Plan Requirements

Tenable solutions help fulfill all SLCGP requirements. Connect with a Tenable representative to learn more.

Get a demo of Tenable Patch Management

Interested in streamlining security and IT collaboration and shortening the mean time to remediate with automation? Try Tenable Patch Management.


文章来源: https://www.tenable.com/blog/tenable-jailbreaks-gpt-5-gets-it-to-generate-dangerous-info-despite-openais-new-safety-tech
如有侵权请联系:admin#unsafe.sh