Grammar-Constrained Decoding Enables LLM Jailbreak for Malware Generation

11. June 2026
AI Models, Claude Code, Cybersecurity

Grammar-Constrained Decoding (GCD), a technique for ensuring syntactically correct code, opens a new jailbreak method for attackers with a success rate over 30 percentage points higher than previous approaches.

Share on:

Claude 3.5 Sonnet Systematically Blocks Legitimate Security Questions

11. June 2026
Anthropic, Claude AI, Cybersecurity

The security filter in Claude 3.5 Sonnet blocks legitimate security requests, limiting its usability for CTOs performing security audits and vulnerability assessments.

Share on:

Explainable AI as a Requirement in Critical Systems

11. June 2026
AI Models, Claude AI, Regulation

Trust in AI does not emerge automatically but must be systematically built through explainability measures depending on the application context and regulatory requirements.

Share on:

GitHub Disables npm Installation Scripts by Default Against Supply Chain Attacks

11. June 2026
Claude Code, Cybersecurity

npm 12 disables install scripts by default to make it harder to exploit lifecycle hooks for supply chain attacks.

Share on:

Anthropic CEO Demands State Emergency Brake for AI Risks

11. June 2026
Anthropic, Cybersecurity, Regulation

Anthropic calls for an aviation-like regulatory authority or commissioned private auditors to examine AI models for critical risks before their release.

Share on:

Anthropic Revises Claude Safeguards for LLM Research

11. June 2026
Anthropic, Claude AI, Regulation

Anthropic makes previous restrictions on LLM research transparent and adjusts them after facing significant criticism from the research community.

Share on:

Anthropic Revises Safeguard Policy for Claude in Frontier LLM Research

11. June 2026
Anthropic, Claude AI, Regulation

Anthropic abandons covert throttling of Claude during frontier LLM research and will make safeguards more transparent going forward.

Share on:

IT Departments Losing Control Over Enterprise AI Adoption

11. June 2026
AI Models, Claude Cowork

53 percent of employees are already using private AI tools in the workplace because IT departments fail to provide approved alternatives.

Share on:

InternVideo3: Foundation Models with Multimodal Reasoning for Video Agents

11. June 2026
AI Models

InternVideo3 enables foundation models to analyze longer video sequences with iterative reasoning and tool use while avoiding efficiency problems in KV cache management.

Share on:

Anthropic Introduces Arbor: AI System for Autonomous Research Loops

11. June 2026
AI Models, Claude AI, Claude Code

Arbor enables AI-driven research through systematic hypothesis management and achieved an average of 2.5x higher improvements than existing code models on six test tasks.

Share on:

Anthropic’s Arbor: AI Agents Conduct Autonomous Research Cycles

11. June 2026
AI Models, Claude AI

Arbor coordinates autonomous AI agents via persistent hypothesis trees and achieved 2.5× better results than Codex and Claude Code on six research tasks.

Share on:

Bebop: Rejection Sampling Improves Multi-Token Prediction in RL Training

11. June 2026
AI Models, Claude Code

Bebop uses rejection sampling and TV loss optimization to maintain stable MTP acceptance rates during RL training and accelerates rollouts by up to 1.8x.

Share on:

« Previous
1
…
11
12
13
14
15
…
47
Next »

Grammar-Constrained Decoding Enables LLM Jailbreak for Malware Generation

Claude 3.5 Sonnet Systematically Blocks Legitimate Security Questions

Explainable AI as a Requirement in Critical Systems

GitHub Disables npm Installation Scripts by Default Against Supply Chain Attacks

Anthropic CEO Demands State Emergency Brake for AI Risks

Anthropic Revises Claude Safeguards for LLM Research

Anthropic Revises Safeguard Policy for Claude in Frontier LLM Research

IT Departments Losing Control Over Enterprise AI Adoption

InternVideo3: Foundation Models with Multimodal Reasoning for Video Agents

Anthropic Introduces Arbor: AI System for Autonomous Research Loops

Anthropic’s Arbor: AI Agents Conduct Autonomous Research Cycles

Bebop: Rejection Sampling Improves Multi-Token Prediction in RL Training

Lumi AI News

Legal

Topics