On Anthropic’s Mythos Preview and Project Glasswing
好的,我现在需要帮用户总结一篇文章的内容,控制在100字以内。用户已经提供了文章的英文内容和一个示例总结,所以我需要先理解文章的主要内容。 文章主要讨论了Anthropic的新AI模型Claude Mythos Preview在网络安全领域的影响。Anthropic没有公开发布这个模型,因为它具有发动网络攻击的能力。他们启动了一个项目,用这个模型检测软件漏洞,并在黑客利用之前修复它们。 接下来,作者提出了四个观点:1)这是一场公关活动,吸引了大量媒体报道;2)这些模型显示出更复杂的网络攻击能力;3)虽然模型强大,但防御者仍有优势;4)零日漏洞可能变得非常普遍,需要提前准备。 用户希望我用中文总结,并且直接写描述,不需要特定的开头。我需要确保内容准确、简洁,并且在100字以内。 首先,提取关键点:新AI模型、网络安全影响、漏洞检测与修复、公关策略、攻击能力提升、防御者优势、零日漏洞增多。 然后组织语言,确保流畅且信息完整。例如: “Anthropic的新AI模型Claude Mythos Preview因其潜在的网络攻击能力引发关注。公司未公开发布该模型,并启动Project Glasswing检测软件漏洞以修复。专家指出该技术可能使零日漏洞泛滥,需提前准备应对。” 检查字数是否符合要求,并确保没有遗漏重要信息。 </think> Anthropic的新AI模型Claude Mythos Preview因其潜在的网络攻击能力引发关注。公司未公开发布该模型,并启动Project Glasswing检测软件漏洞以修复。专家指出该技术可能使零日漏洞泛滥,需提前准备应对。 2026-4-13 16:52:57 Author: www.schneier.com(查看原文) 阅读量:2 收藏

The cybersecurity industry is obsessing over Anthropic’s new model, Claude Mythos Preview, and its effects on cybersecurity. Anthropic said that it is not releasing it to the general public because of its cyberattack capabilities, and has launched Project Glasswing to run the model against a whole slew of public domain and proprietary software, with the aim of finding and patching all the vulnerabilities before hackers get their hands on the model and exploit them.

There’s a lot here, and I hope to write something more considered in the coming week, but I want to make some quick observations.

One: This is very much a PR play by Anthropic—and it worked. Lots of reporters are breathlessly repeating Anthropic’s talking points, without engaging with them critically. OpenAI, presumably pissed that Anthropic’s new model has gotten so much positive press and wanting to grab some of the spotlight for itself, announced its model is just as scary, and won’t be released to the general public, either.

Two: These models do demonstrate an increased sophistication in their cyberattack capabilities. They write effective exploits—taking the vulnerabilities they find and operationalizing them—without human involvement. They can find more complex vulnerabilities: chaining together several memory corruption bugs, for example. And they can do more with one-shot prompting, without requiring orchestration and agent configuration infrastructure.

Three: Anthropic might have a good PR team, but the problem isn’t with Mythos Preview. The security company Aisle was able to replicate the vulnerabilities that Anthropic found, using older, cheaper, public models. But there is a difference between finding a vulnerability and turning it into an attack. This points to a current advantage to the defender. Finding for the purposes of fixing is easier for an AI than finding plus exploiting. This advantage is likely to shrink, as ever more powerful models become available to the general public.

Four: Everyone who is panicking about the ramifications of this is correct about the problem, even if we can’t predict the exact timeline. Maybe the sea change just happened, with the new models from Anthropic and OpenAI. Maybe it happened six months ago. Maybe it’ll happen in six months. It will happen—I have no doubt about it—and sooner than we are ready for. We can’t predict how much more these models will improve in general, but software seems to be a specialized language that is optimal for AIs.

A couple of weeks ago, I wrote about security in what I called “the age of instant software,” where AIs are superhumanly good at finding, exploiting, and patching vulnerabilities. I stand by everything I wrote there. The urgency is now greater than ever.

I was also part of a large team that wrote a “what to do now” report. The guidance is largely correct: We need to prepare for a world where zero-day exploits are dime-a-dozen, and lots of attackers suddenly have offensive capabilities that far outstrip their skills.

Tags: , , , ,

Posted on April 13, 2026 at 12:52 PM1 Comments

Sidebar photo of Bruce Schneier by Joe MacInnis.


文章来源: https://www.schneier.com/blog/archives/2026/04/on-anthropics-mythos-preview-and-project-glasswing.html
如有侵权请联系:admin#unsafe.sh