Trusted access for the next era of cyber defense
OpenAI introduces GPT-5.4-Cyber to vetted cybersecurity defenders through an expanded Trusted Access program. The move aims to strengthen AI-driven cyber defense while maintaining strict safeguards.
Le meilleur de l'écosystème IA et MCP, sélectionné chaque jour.
Yesterday saw a significant push toward agentic autonomy and developer productivity, most notably with Cursor proving that multi-agent systems can now effectively optimize low-level GPU kernels, achieving a massive 38% speedup. This signals a shift where agents aren't just writing high-level code, but are mastering hardware-specific performance.
Anthropic also doubled down on the developer experience with a redesign of the Claude Code desktop app and the introduction of "routines," enabling repeatable workflows for common tasks like PR reviews. Architecturally, Anthropic is further maturing agent deployment by decoupling reasoning from execution, a critical step for scaling managed agents with reliability. Meanwhile, OpenAI expanded its Trusted Access program, deploying GPT-5.4-Cyber specifically for cybersecurity defense.
Today's stories:
The day's theme is clear: AI is moving from "chatting about code" to "autonomously managing the entire stack," from CUDA kernels to PR workflows.
OpenAI introduces GPT-5.4-Cyber to vetted cybersecurity defenders through an expanded Trusted Access program. The move aims to strengthen AI-driven cyber defense while maintaining strict safeguards.

Anthropic has redesigned the Claude Code desktop app to support running multiple tasks concurrently. This update focuses on improving productivity for developers managing complex coding workflows.

Anthropic explores the architectural decoupling of agent reasoning (the brain) from execution (the hands) to enable better scaling and reliability in managed agents. This approach allows for more robust control and observability when deploying agents at scale.

Claude Code now supports routines, allowing developers to define repeatable AI workflows for tasks like backlog grooming and PR reviews. This brings a level of automation to the developer inner loop.

Cursor demonstrated a multi-agent system that autonomously optimized 235 CUDA kernels for NVIDIA Blackwell 200 GPUs. The approach achieved a 38% geomean speedup over baselines in just three weeks, showcasing the power of agentic optimization for low-level performance.