Google and Meta researchers are warning that AI agents should be treated as ‘untrusted’ systems as companies race to deploy autonomous software capable of handlingGoogle and Meta researchers are warning that AI agents should be treated as ‘untrusted’ systems as companies race to deploy autonomous software capable of handling

AI | AI Agents Should Be Treated as ‘Untrusted’ Systems, Say Google and Meta Researchers

2026/05/26 15:00
3분 읽기
이 콘텐츠에 대한 의견이나 우려 사항이 있으시면 [email protected]으로 연락주시기 바랍니다

Google and Meta researchers are warning that AI agents should be treated as ‘untrusted’ systems as companies race to deploy autonomous software capable of handling emails, payments, coding and enterprise workflows.

In a new paper titled ‘Agent Security is a Systems Problem,’ researchers argued that simply making large language models more robust will not be enough to secure next-generation AI agents. Instead, security protections must be built around the systems controlling them, much like safeguards used in operating systems and cloud infrastructure.

The report notes:

We take the position that agent security must be approached as a systems problem: the AI model powering the agent must be treated as an untrusted component, and security invariants must be enforced at the system level. Through this lens, efforts to increase model robustness (the dominant viewpoint in the community) are insufficient on their own.

Instead, we must complement existing efforts with techniques from the systems security domain. Based on our experience as cybersecurity researchers in operating systems, networks, formal methods, and adversarial machine learning, we articulate a set of core principles, grounded in decades of systems security research, that provide a foundation for designing agentic systems with predictable guarantees.

As evidence, we analyze eleven representative real-world attacks on agents and discuss how systems principles, if realized, could have prevented these attacks. We also identify the research challenges that stand in the way of implementing these principles in agents.

The report analyzed 11 real-world attacks on AI agents and concluded that many failures stem from giving models excessive permissions or direct access to sensitive systems without sufficient isolation or oversight.

Researchers warned that agents remain vulnerable to

  • prompt injection,
  • tool manipulation, and
  • privilege escalation attacks

even when underlying models improve.

The findings come as Silicon Valley intensifies efforts to commercialize ‘agentic AI’ – software that can independently execute tasks with minimal human supervision. Companies including Google, Meta, Microsoft, and Amazon Web Services (AWS) are investing heavily in AI agents for enterprise and consumer applications.

The researchers said the industry’s current approach mirrors early cybersecurity mistakes in computing where systems trusted components that later proved exploitable. Their proposed framework would treat AI models as inherently unreliable and enforce security guarantees at the infrastructure layer instead.

The paper adds to growing concern across the AI industry about autonomous systems gaining access to corporate data, developer environments, and financial infrastructure. Recent incidents involving coding agents deleting production databases and AI systems executing unintended actions have amplified scrutiny over the technology’s deployment risks.

The authors called for:

  • stricter isolation mechanisms,
  • least-privilege access controls, and
  • formal verification methods

before AI agents are widely trusted with critical operations.

Stay tuned to BitKE on crypto and AI developments.

Join our WhatsApp channel here.

Follow us on X for the latest posts and updates

Join and interact with our Telegram community

___________________________________________

시장 기회
Gensyn 로고
Gensyn 가격(AI)
$0.03031
$0.03031$0.03031
-5.22%
USD
Gensyn (AI) 실시간 가격 차트

AI Strategy: Powered 24/7

AI Strategy: Powered 24/7AI Strategy: Powered 24/7

Generate automated strategies using natural language

면책 조항: 본 사이트에 재게시된 글들은 공개 플랫폼에서 가져온 것으로 정보 제공 목적으로만 제공됩니다. 이는 반드시 MEXC의 견해를 반영하는 것은 아닙니다. 모든 권리는 원저자에게 있습니다. 제3자의 권리를 침해하는 콘텐츠가 있다고 판단될 경우, [email protected]으로 연락하여 삭제 요청을 해주시기 바랍니다. MEXC는 콘텐츠의 정확성, 완전성 또는 시의적절성에 대해 어떠한 보증도 하지 않으며, 제공된 정보에 기반하여 취해진 어떠한 조치에 대해서도 책임을 지지 않습니다. 본 콘텐츠는 금융, 법률 또는 기타 전문적인 조언을 구성하지 않으며, MEXC의 추천이나 보증으로 간주되어서는 안 됩니다.

No Chart Skills? Still Profit

No Chart Skills? Still ProfitNo Chart Skills? Still Profit

Copy top traders in 3s with auto trading!