NEWBioShocking Attack Exploits AI Browsers to Steal User Credentials

30. June 2026
Claude AI, Cybersecurity, OpenAI

AI browsers can be manipulated through game contexts to forward user login credentials to attackers.

Share on:

GitHub Repository Trick Deceives AI Agents into Executing Malware

27. June 2026
AI Models, Claude Code, Cybersecurity

AI-based code agents can be manipulated through prepared GitHub repositories to execute hidden malware without common security checks detecting the risk.

Share on:

Prompt Injection Test: 6,000 Attacks on Anthropic’s Opus Without Success

26. June 2026
Claude AI, Cybersecurity

Anthropic’s Opus 4.6 withstood 6,000 prompt injection attacks in a public security test without compromise, indicating improved defense mechanisms — but such stability results do not replace comprehensive security design in production.

Share on:

macOS Malware “Gaslight” Confuses AI-Powered Analysis Tools with Fake Errors

25. June 2026
Claude AI, Cybersecurity

Malware can bypass AI-based security analysis through deliberately embedded fake error messages and prompt injections.

Share on:

Gaslight: macOS Malware Uses Prompt Injection Against AI-Powered Malware Analysis

25. June 2026
AI Models, Cybersecurity

Gaslight demonstrates a new attack variant in which malware directly compromises security analysts’ AI tools to evade detection.

Share on:

Language Models Confuse System Instructions with User Input

23. June 2026
Claude AI, Cybersecurity

Language models respond more strongly to text formatting than to actual content, making them vulnerable to manipulation through cleverly styled inputs that resemble internal system commands.

Share on:

AI Security Beyond Mythos Export Controls: Prompt Injection and Red Teaming in Focus

22. June 2026
AI Models, Claude AI, Cybersecurity

AI security requires fundamental differences from traditional cybersecurity: prompt injection creates a new exploit class for agents, specialized red-teaming models outperform humans at uncovering weaknesses, and larger models are not automatically more robust.

Share on:

M365 Copilot SearchLeak: Parameter-Injection Attacks Against AI Search

19. June 2026
Claude AI, Cybersecurity, NIS2

Parameter-to-Prompt-Injection (P2P) becomes a new attack surface when AI search applications process URL parameters as natural language instructions.

Share on:

Anthropic Researchers Demonstrate Security Vulnerability in Claude via Simple Prompts

16. June 2026
Anthropic, Claude AI, Cybersecurity

Claude 3.5 Sonnet can be manipulated through simple prompts to fix code errors while bypassing its own security guidelines.

Share on:

Runtime Signals for Detecting Compromised AI Agents

15. June 2026
Claude AI, Cybersecurity

Legitimate AI agents inherently satisfy all three criteria of the “lethal trifecta” (data access, external content, external communication), so security must shift from architectural design to runtime monitoring.

Share on:

OpenClaw Vulnerable to Prompt Injections via Message Objects

15. June 2026
AI Models, Cybersecurity

OpenClaw can be manipulated through prompt injections in message objects to execute an attacker’s instructions instead of the owner’s.

Share on:

Prompt Injection: AI Agents Show No Reliable Defense Mechanisms

12. June 2026
AI Models, Claude AI, Cybersecurity

Current AI web agents lack reliable defenses against prompt injection attacks and can fulfill attack objectives undetected while users remain unaware of the threat.

Share on:

NEWBioShocking Attack Exploits AI Browsers to Steal User Credentials

GitHub Repository Trick Deceives AI Agents into Executing Malware

Prompt Injection Test: 6,000 Attacks on Anthropic’s Opus Without Success

macOS Malware “Gaslight” Confuses AI-Powered Analysis Tools with Fake Errors

Gaslight: macOS Malware Uses Prompt Injection Against AI-Powered Malware Analysis

Language Models Confuse System Instructions with User Input

AI Security Beyond Mythos Export Controls: Prompt Injection and Red Teaming in Focus

M365 Copilot SearchLeak: Parameter-Injection Attacks Against AI Search

Anthropic Researchers Demonstrate Security Vulnerability in Claude via Simple Prompts

Runtime Signals for Detecting Compromised AI Agents

OpenClaw Vulnerable to Prompt Injections via Message Objects

Prompt Injection: AI Agents Show No Reliable Defense Mechanisms

Lumi AI News

Legal

Topics