OpenClaw AI Agent Flaws Could Enable Prompt Injection and Data Exfiltration - The Hacker News
The Hacker NewsArchived Mar 21, 2026✓ Full text saved
OpenClaw AI Agent Flaws Could Enable Prompt Injection and Data Exfiltration The Hacker News
Full text archived locally
✦ AI Summary· Claude Sonnet
OpenClaw AI Agent Flaws Could Enable Prompt Injection and Data Exfiltration
Ravie LakshmananMar 14, 2026Artificial Intelligence / Endpoint Security
China's National Computer Network Emergency Response Technical Team (CNCERT) has issued a warning about the security stemming from the use of OpenClaw (formerly Clawdbot and Moltbot), an open-source and self-hosted autonomous artificial intelligence (AI) agent.
In a post shared on WeChat, CNCERT noted that the platform's "inherently weak default security configurations," coupled with its privileged access to the system to facilitate autonomous task execution capabilities, could be explored by bad actors to seize control of the endpoint.
This includes risks arising from prompt injections, where malicious instructions embedded within a web page can cause the agent to leak sensitive information if it's tricked into accessing and consuming the content.
The attack is also referred to as indirect prompt injection (IDPI) or cross-domain prompt injection (XPIA), as adversaries, instead of interacting directly with a large language model (LLM), weaponize benign AI features like web page summarization or content analysis to run manipulated instructions. This can range from evading AI-based ad review systems and influencing hiring decisions to search engine optimization (SEO) poisoning and generating biased responses by suppressing negative reviews.
OpenAI, in a blog post published earlier this week, said prompt injection-style attacks are evolving beyond simply placing instructions in external content to include elements of social engineering.
"AI agents are increasingly able to browse the web, retrieve information, and take actions on a user's behalf," it said. "Those capabilities are useful, but they also create new ways for attackers to try to manipulate the system."
The prompt injection risks in OpenClaw are not hypothetical. Last month, researchers at PromptArmor found that the link preview feature in messaging apps like Telegram or Discord can be turned into a data exfiltration pathway when communicating with OpenClaw by means of an indirect prompt injection.
The idea, at a high level, is to trick the AI agent into generating an attacker-controlled URL that, when rendered in the messaging app as a link preview, automatically causes it to transmit confidential data to that domain without having to click on the link.
"This means that in agentic systems with link previews, data exfiltration can occur immediately upon the AI agent responding to the user, without the user needing to click the malicious link," the AI security company said. "In this attack, the agent is manipulated to construct a URL that uses an attacker's domain, with dynamically generated query parameters appended that contain sensitive data the model knows about the user."
Besides rogue prompts, CNCERT has also highlighted three other concerns -
The possibility that OpenClaw may inadvertently and irrevocably delete critical information due to its misinterpretation of user instructions.
Threat actors can upload malicious skills to repositories like ClawHub that, when installed, run arbitrary commands or deploy malware.
Attackers can exploit recently disclosed security vulnerabilities in OpenClaw to compromise the system and leak sensitive data.
"For critical sectors – such as finance and energy – such breaches could lead to the leakage of core business data, trade secrets, and code repositories, or even result in the complete paralysis of entire business systems, causing incalculable losses," CNCERT added.
To counter these risks, users and organizations are advised to strengthen network controls, prevent exposure of OpenClaw's default management port to the internet, isolate the service in a container, avoid storing credentials in plaintext, download skills only from trusted channels, disable automatic updates for skills, and keep the agent up-to-date.
The development comes as Chinese authorities have moved to restrict state-run enterprises and government agencies from running OpenClaw AI apps on office computers in a bid to contain security risks, Bloomberg reported. The ban is also said to extend to the families of military personnel.
The viral popularity of OpenClaw has also led threat actors to capitalize on the phenomenon to distribute malicious GitHub repositories posing as OpenClaw installers to deploy information stealers like Atomic and Vidar Stealer, and a Golang-based proxy malware known as GhostSocks using ClickFix-style instructions.
"The campaign did not target a particular industry, but was broadly targeting users attempting to install OpenClaw with the malicious repositories containing download instructions for both Windows and macOS environments," Huntress said. "What made this successful was that the malware was hosted on GitHub, and the malicious repository became the top-rated suggestion in Bing’s AI search results for OpenClaw Windows."
Found this article interesting? Follow us on Google News, Twitter and LinkedIn to read more exclusive content we post.
SHARE
Tweet
Share
Share
SHARE
artificial intelligence, cybersecurity, data exfiltration, endpoint security, GitHub, Malware, OpenClaw, Prompt Injection
Trending News
Apple Fixes WebKit Vulnerability Enabling Same-Origin Policy Bypass on iOS and macOS
Google Fixes Two Chrome Zero-Days Exploited in the Wild Affecting Skia and V8
Critical n8n Flaws Allow Remote Code Execution and Exposure of Stored Credentials
Six Android Malware Families Target Pix Payments, Banking Apps, and Crypto Wallets
Veeam Patches 7 Critical Backup and Replication Flaws Allowing Remote Code Execution
Nine CrackArmor Flaws in Linux AppArmor Enable Root Escalation, Bypass Container Isolation
FortiGate Devices Exploited to Breach Networks and Steal Service Account Credentials
Apple Issues Security Updates for Older iOS Devices Targeted by Coruna WebKit Exploit
Researchers Trick Perplexity's Comet AI Browser Into Phishing Scam in Under Four Minutes
ThreatsDay Bulletin: OAuth Trap, EDR Killer, Signal Phishing, Zombie ZIP, AI Platform Hack and More
Chinese Hackers Target Southeast Asian Militaries with AppleChris and MemFun Malware
CISA Flags Actively Exploited Wing FTP Vulnerability Leaking Server Paths
Android 17 Blocks Non-Accessibility Apps from Accessibility API to Prevent Malware Abuse
⚡ Weekly Recap: Chrome 0-Days, Router Botnets, AWS Breach, Rogue AI Agents and More
OpenClaw AI Agent Flaws Could Enable Prompt Injection and Data Exfiltration
Microsoft Patches 84 Flaws in March Patch Tuesday, Including Two Public Zero-Days
Meta to Shut Down Instagram End-to-End Encrypted Chat Support Starting May 2026
Load More ▼
Popular Resources
Fix Security Noise by Focusing Only on Validated Exposures
Guide - Discover How to Validate AI Risks With Adversarial Testing
Get the 2026 ASV Report to Benchmark Top Validation Tools
Webinar - Identify Key Attack Paths to Your Crown Jewels with CSMA