Ettin Suite: SoTA Paired Encoders and Decoders
A new release of state-of-the-art paired encoders and decoders via the Ettin Suite. Useful for developers working on complex embedding and generation tasks.
Le meilleur de l'écosystème IA et MCP, sélectionné chaque jour.
Sources
A new release of state-of-the-art paired encoders and decoders via the Ettin Suite. Useful for developers working on complex embedding and generation tasks.
Hugging Face is transitioning its Hub from Git LFS to Xet to improve performance and scalability for large datasets. This move aims to optimize how massive AI models and datasets are stored and retrieved.
Kimina-Prover leverages test-time reinforcement learning search to enhance the formal reasoning capabilities of large language models. The research focuses on improving accuracy in complex mathematical and logical proofs.
Research on improving robotic efficiency by decoupling action prediction from execution. Relevant for those building high-performance agentic physical systems.

Introduction of ScreenEnv, a framework for deploying full-stack desktop agents. Provides the necessary environment for agents to interact with OS-level interfaces.

Hugging Face introduces a new MCP server, expanding the Model Context Protocol ecosystem by allowing AI agents to interact more deeply with the HF Hub. This enables better discovery and integration of open-source models and datasets within agentic workflows.
Hugging Face introduces Gradio MCP Servers, allowing LLMs to interact directly with Gradio apps. This expands the MCP ecosystem by bridging the gap between interactive ML demos and agentic tool-use.

Hugging Face introduces SmolLM3, a compact multilingual model designed for efficient reasoning with long-context support. Ideal for on-device deployment and lightweight agentic tasks.
Hugging Face introduces a streamlined MultiModal Data Pipeline (MMDP) designed to optimize the handling and processing of diverse data types for AI training. The pipeline focuses on efficiency and scalability for multimodal model development.
A technical guide on implementing sparse embedding models using Sentence Transformers. This is highly relevant for developers building advanced RAG systems and information retrieval pipelines.
NVIDIA has released the Llama Nemotron Nano VLM on Hugging Face, providing a compact vision-language model for efficient edge deployment and specialized AI applications.

Anthropic launched Desktop Extensions — a one-click installation system for MCP servers directly in Claude Desktop, eliminating the need to manually edit JSON config files. Users can browse, install, and manage MCP servers from a UI without touching the terminal. A major UX improvement that significantly lowers the barrier to MCP adoption.
Google's Gemma 3n is now fully available in the open-source ecosystem. This release continues the push for democratizing high-performance AI models for the developer community.
SGLang now integrates a Transformers backend, enhancing the flexibility and compatibility of the serving framework. This allows developers to more easily deploy and optimize a wider range of models.

Anthropic walks through the engineering decisions behind Claude's Research feature — a multi-agent system that parallelises web search, synthesises findings across agents, and produces long-form research reports. The post covers orchestration architecture, reliability challenges, and how they handled conflicting information between sub-agents. Valuable real-world reference for anyone building research or RAG-style multi-agent systems.
Analysis of how long prompts can block concurrent LLM requests and degrade system performance. Essential reading for developers optimizing throughput and latency in production AI environments.

Introduction to the Hugging Face Kernel Hub, providing a fast track for developers to explore and deploy kernels. A useful resource for those looking to extend HF's computational capabilities.

Hugging Face integrates Featherless AI as an inference provider, expanding options for deploying and serving open-source models. This move simplifies access to high-performance inference for developers building with the HF ecosystem.
Hugging Face and NVIDIA have launched Training Cluster as a Service, simplifying the deployment of massive compute resources for model training. This collaboration lowers the barrier for developers to scale their training infrastructure efficiently.
ScreenSuite is a new comprehensive evaluation framework specifically designed for GUI agents. It provides a robust set of benchmarks to measure how effectively AI agents can navigate and interact with graphical user interfaces.