LangGraph: SQL Injection Chain Enables Remote Code Execution in Self-Hosted AI Agents

12. June 2026
Claude Code, Cybersecurity

A chain of vulnerabilities in LangGraph enables remote code execution in self-hosted AI agent deployments and requires immediate patching.

Share on:

MiniMax Sparse Attention: Efficient Long-Context Processing for Billion-Parameter Models

12. June 2026
AI Models, Claude Code

MSA reduces attention computation for million-token contexts by a factor of 28.4 through blockwise sparse selection and achieves practical speedups via co-design of algorithm and GPU kernel.

Share on:

OpenClaw Agents Manipulable to Code Execution via Crafted Inputs

11. June 2026
AI Models, Claude Code, Cybersecurity

OpenClaw can be manipulated via hidden instructions in contacts, vCards and location data to execute code and leak sensitive data.

Share on:

OpenAI Acquires Ona for Enhanced Cloud Environments

11. June 2026
Claude Code, OpenAI

OpenAI acquires Ona to extend Codex with secure cloud environments, enabling long-lived AI agents in enterprise operations.

Share on:

Agent-EvalKit: Open-Source Evaluation for AI Agents in Claude Code

11. June 2026
AI Models, Claude AI, Claude Code

Agent-EvalKit automates the evaluation of AI agents through structured test-case generation, observability instrumentation, and combined code and LLM-based metrics directly in the development environment.

Share on:

Building Trust in AI Agents: Observability and Guardrails as the Foundation for SRE

11. June 2026
Claude AI, Claude Code

SRE trust in AI agents grows through observability, guardrails, and progressive autonomy models, not through technological maturity alone.

Share on:

Grammar-Constrained Decoding Enables LLM Jailbreak for Malware Generation

11. June 2026
AI Models, Claude Code, Cybersecurity

Grammar-Constrained Decoding (GCD), a technique for ensuring syntactically correct code, opens a new jailbreak method for attackers with a success rate over 30 percentage points higher than previous approaches.

Share on:

GitHub Disables npm Installation Scripts by Default Against Supply Chain Attacks

11. June 2026
Claude Code, Cybersecurity

npm 12 disables install scripts by default to make it harder to exploit lifecycle hooks for supply chain attacks.

Share on:

Mixture-of-Experts Router Optimized via Manifold Power Iteration

11. June 2026
AI Models, Claude Code

Aligning router rows with the principal singular directions of their associated expert matrices improves the efficiency and stability of Mixture-of-Experts models.

Share on:

Claw-SWE-Bench: Benchmark for AI Agents on Code Tasks

11. June 2026
AI Models, Claude Code

The Claw-SWE-Bench framework demonstrates that adapter design is critical for code agents: with a minimal adapter, OpenClaw achieves 19.1% Pass@1, with a complete adapter 73.4%.

Share on:

Anthropic Introduces Arbor: AI System for Autonomous Research Loops

11. June 2026
AI Models, Claude AI, Claude Code

Arbor enables AI-driven research through systematic hypothesis management and achieved an average of 2.5x higher improvements than existing code models on six test tasks.

Share on:

Bebop: Rejection Sampling Improves Multi-Token Prediction in RL Training

11. June 2026
AI Models, Claude Code

Bebop uses rejection sampling and TV loss optimization to maintain stable MTP acceptance rates during RL training and accelerates rollouts by up to 1.8x.

Share on:

« Previous
1
…
4
5
6
7
8
…
17
Next »

LangGraph: SQL Injection Chain Enables Remote Code Execution in Self-Hosted AI Agents

MiniMax Sparse Attention: Efficient Long-Context Processing for Billion-Parameter Models

OpenClaw Agents Manipulable to Code Execution via Crafted Inputs

Agent-EvalKit: Open-Source Evaluation for AI Agents in Claude Code

Building Trust in AI Agents: Observability and Guardrails as the Foundation for SRE

Grammar-Constrained Decoding Enables LLM Jailbreak for Malware Generation

GitHub Disables npm Installation Scripts by Default Against Supply Chain Attacks

Mixture-of-Experts Router Optimized via Manifold Power Iteration

Claw-SWE-Bench: Benchmark for AI Agents on Code Tasks

Anthropic Introduces Arbor: AI System for Autonomous Research Loops

Bebop: Rejection Sampling Improves Multi-Token Prediction in RL Training

Lumi AI News

Legal

Topics