GLM-5.2: Chinese Open-Weights Model with 753 Billion Parameters

18. June 2026
AI Models, Claude Code

GLM-5.2 ranks as the leading open language model on the Artificial Analysis Index with a score of 51 and places 2nd in the Code Arena WebDev Leaderboard, but produces significantly more output tokens than competing models.

Share on:

LoopCoder-v2: Two Loops as the Optimum for Efficient Model Computation in Programming

17. June 2026
AI Models, Claude Code

LoopCoder-v2 with two loops substantially improves code reasoning benchmarks (SWE-bench Verified: 43.0 → 64.4 points), while three or more loops become counterproductive due to growing position errors.

Share on:

Grammar-Constrained Decoding Enables LLM Jailbreak for Malware Generation

11. June 2026
AI Models, Claude Code, Cybersecurity

Grammar-Constrained Decoding (GCD), a technique for ensuring syntactically correct code, opens a new jailbreak method for attackers with a success rate over 30 percentage points higher than previous approaches.

Share on:

Anthropic’s Arbor: AI Agents Conduct Autonomous Research Cycles

11. June 2026
AI Models, Claude AI

Arbor coordinates autonomous AI agents via persistent hypothesis trees and achieved 2.5× better results than Codex and Claude Code on six research tasks.

Share on:

Socratic-SWE: Self-Learning AI Agents for Code Repair

8. June 2026
AI Models, Claude Code

A self-learning framework for code-repair agents leverages their solution traces directly to generate targeted training tasks, achieving higher accuracy than previous approaches.

Share on:

GLM-5.2: Chinese Open-Weights Model with 753 Billion Parameters

LoopCoder-v2: Two Loops as the Optimum for Efficient Model Computation in Programming

Grammar-Constrained Decoding Enables LLM Jailbreak for Malware Generation

Anthropic’s Arbor: AI Agents Conduct Autonomous Research Cycles

Socratic-SWE: Self-Learning AI Agents for Code Repair

Lumi AI News

Legal

Topics