Model Security Is the Wrong Frame – The Real Risk Is Workflow Security

Model Security Is the Wrong Frame – The Real Risk Is Workflow Security
好的，我现在需要帮用户总结一篇文章，控制在100字以内，而且不需要用“文章内容总结”之类的开头。首先，我得仔细阅读文章，理解它的主要观点。文章主要讨论了AI模型在工作流程中的安全问题。传统安全措施主要保护模型本身，但最近的事件显示，攻击者更倾向于利用AI周围的流程漏洞。比如，恶意扩展窃取数据，提示注入让AI执行恶意代码。这些攻击并没有破坏模型本身，而是利用了AI运行的环境。接下来，文章指出传统安全控制措施无法应对这些新威胁，因为它们基于确定性软件、稳定用户角色和清晰边界等假设。而AI驱动的工作流程打破了这些假设。因此，需要新的方法来保护整个工作流程。最后，文章提到像Reco这样的平台可以帮助企业实时监控和管理AI工作流程的安全性。现在我需要将这些要点浓缩到100字以内。重点包括：AI模型作为工作引擎、传统安全措施的不足、攻击者利用工作流程漏洞、以及新的安全平台的重要性。可能的结构是：先说明AI的作用，然后指出传统安全的不足，接着描述攻击方式，最后提到解决方案。检查一下字数是否符合要求，并确保语言简洁明了。 </think> 随着AI模型嵌入日常业务流程中,传统的仅保护模型本身的安全措施已不足以应对风险。攻击者更倾向于利用AI周围的工作流漏洞,如恶意扩展窃取数据或提示注入操控AI行为。由于AI依赖概率决策而非固定规则,其输入输出及集成点均成为潜在攻击面。因此,需将整个工作流视为保护对象,并借助动态SaaS安全平台实时监控和管理风险。 2026-1-15 11:55:0 Author: thehackernews.com(查看原文) 阅读量:0 收藏

Data Security / Artificial Intelligence

As AI copilots and assistants become embedded in daily work, security teams are still focused on protecting the models themselves. But recent incidents suggest the bigger risk lies elsewhere: in the workflows that surround those models.

Two Chrome extensions posing as AI helpers were recently caught stealing ChatGPT and DeepSeek chat data from over 900,000 users. Separately, researchers demonstrated how prompt injections hidden in code repositories could trick IBM's AI coding assistant into executing malware on a developer's machine.

Neither attack broke the AI algorithms themselves.

They exploited the context in which the AI operates. That's the pattern worth paying attention to. When AI systems are embedded in real business processes, summarizing documents, drafting emails, and pulling data from internal tools, securing the model alone isn't enough. The workflow itself becomes the target.

AI Models Are Becoming Workflow Engines

To understand why this matters, consider how AI is actually being used today:

Businesses now rely on it to connect apps and automate tasks that used to be done by hand. An AI writing assistant might pull a confidential document from SharePoint and summarize it in an email draft. A sales chatbot might cross-reference internal CRM records to answer a customer question. Each of these scenarios blurs the boundaries between applications, creating new integration pathways on the fly.

What makes this risky is how AI agents operate. They rely on probabilistic decision-making rather than hard-coded rules, generating output based on patterns and context. A carefully written input can nudge an AI to do something its designers never intended, and the AI will comply because it has no native concept of trust boundaries.

This means the attack surface includes every input, output, and integration point the model touches.

Hacking the model's code becomes unnecessary when an adversary can simply manipulate the context the model sees or the channels it uses. The incidents described earlier illustrate this: prompt injections hidden in repositories hijack AI behavior during routine tasks, while malicious extensions siphon data from AI conversations without ever touching the model.

Why Traditional Security Controls Fall Short

These workflow threats expose a blind spot in traditional security. Most legacy defenses were built for deterministic software, stable user roles, and clear perimeters. AI-driven workflows break all three assumptions.

Most general apps distinguish between trusted code and untrusted input. AI models don't. Everything is just text to them, so a malicious instruction hidden in a PDF looks no different than a legitimate command. Traditional input validation doesn't help because the payload isn't malicious code. It's just natural language.
Traditional monitoring catches obvious anomalies like mass downloads or suspicious logins. But an AI reading a thousand records as part of a routine query looks like normal service-to-service traffic. If that data gets summarized and sent to an attacker, no rule was technically broken.
Most general security policies specify what's allowed or blocked: don't let this user access that file, block traffic to this server. But AI behavior depends on context. How do you write a rule that says "never reveal customer data in output"?
Security programs rely on periodic reviews and fixed configurations, like quarterly audits or firewall rules. AI workflows don't stay static. An integration might gain new capabilities after an update or connect to a new data source. By the time a quarterly review happens, a token may have already leaked.

Securing AI-Driven Workflows

So, a better approach to all of this would be to treat the whole workflow as the thing you're protecting, not just the model.

Start by understanding where AI is actually being used, from official tools like Microsoft 365 Copilot to browser extensions employees may have installed on their own. Know what data each system can access and what actions it can perform. Many organizations are surprised to find dozens of shadow AI services running across the business.
If an AI assistant is meant only for internal summarization, restrict it from sending external emails. Scan outputs for sensitive data before they leave your environment. These guardrails should live outside the model itself, in middleware that checks actions before they go out.
Treat AI agents like any other user or service. If an AI only needs read access to one system, don't give it blanket access to everything. Scope OAuth tokens to the minimum permissions required, and monitor for anomalies like an AI suddenly accessing data it never touched before.
Finally, it's also useful to educate users about the risks of unvetted browser extensions or copying prompts from unknown sources. Vet third-party plugins before deploying them, and treat any tool that touches AI inputs or outputs as part of the security perimeter.

How Platforms Like Reco Can Help

In practice, doing all of this manually doesn't scale. That's why a new category of tools is emerging: dynamic SaaS security platforms. These platforms act as a real-time guardrail layer on top of AI-powered workflows, learning what normal behavior looks like and flagging anomalies when they occur.

Reco is one leading example.

Figure 1: Reco's generative AI application discovery

As shown above, the platform gives security teams visibility into AI usage across the organization, surfacing which generative AI applications are in use and how they're connected. From there, you can enforce guardrails at the workflow level, catch risky behavior in real time, and maintain control without slowing down the business.

Request a Demo: Get Started With Reco.

Found this article interesting? This article is a contributed piece from one of our valued partners. Follow us on Google News, Twitter and LinkedIn to read more exclusive content we post.

文章来源: https://thehackernews.com/2026/01/model-security-is-wrong-frame-real-risk.html
如有侵权请联系:admin#unsafe.sh