Actualités IA

Le meilleur de l'écosystème IA et MCP, sélectionné chaque jour.

Daily· Jun 8 Weekly· Jun 1 – 7 Monthly· May 2026 Quarterly· Jan – Mar 2026

Filtres

Sources

Hugging Face BlogOct 7, 2025

BigCodeArena: Judging code generations end to end with code executions

BigCodeArena introduces a method for judging LLM code generation by executing the code end-to-end. This provides a more rigorous evaluation metric than token-based similarity.

evalscode-generationhuggingfacellm

Lire l'original

Claude BlogOct 1, 2025

Claude and Slack

Claude is now natively integrated with Slack, allowing teams to get help directly in channels and threads without leaving their workspace. You can also connect your Slack workspace to Claude.ai so it can search conversations and pull relevant context when needed. A significant step for enterprise teams already living in Slack.

claudenew-featureslackintegration

Lire l'original

Hugging Face BlogOct 1, 2025

Introducing RTEB: A New Standard for Retrieval Evaluation

Hugging Face introduces RTEB, a new benchmark standard for evaluating retrieval systems. This provides developers with a more robust framework to measure and improve the accuracy of AI retrieval pipelines.

retrievalevaluationbenchmarkhuggingface

Lire l'original

Anthropic EngineeringSep 29, 2025

Effective context engineering for AI agents

Anthropic shares practical techniques for managing what goes into an agent's context window — what to include, what to summarise, what to drop, and how to structure information so agents reason more reliably. Context engineering is increasingly recognised as a core skill for agent developers, and this is one of the clearest practical guides on the topic.

agentscontext-engineeringdeveloper-toolspatterns

Lire l'original

Hugging Face BlogSep 29, 2025

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

A new optimization technique using depth-pruned draft models significantly accelerates Qwen3-8B agent performance on Intel Core Ultra processors. This makes high-capability AI agents more viable for local edge deployment.

qwen3intelinference-optimizationlocal-ai

Lire l'original