AI News — March 10, 2026

The latest from the AI and MCP ecosystem, curated daily.

OpenAI introduces IH-Challenge — a training approach that teaches models to correctly prioritise instructions from trusted sources over injected or untrusted ones. Improves resistance to prompt injection and makes models more steerable by operators. Research with direct implications for building safer agentic systems.

Today's stories:

Improving instruction hierarchy in frontier LLMs (OpenAI News) — OpenAI introduces IH-Challenge — a training approach that teaches models to correctly prioritise instructions from trusted sources over injected or untrusted ones. Improves resistance to prompt injection and makes models more steerable by operators. Research with direct implications for building safer agentic systems.
Introducing Storage Buckets on the Hugging Face Hub (Hugging Face Blog) — Hugging Face launched Storage Buckets — a new Hub feature for storing arbitrary files (datasets, checkpoints, outputs) alongside models and datasets in a unified workspace. Removes the need to use separate S3/GCS buckets for ML artefacts, keeping everything in the HF ecosystem. A practical platform improvement for teams with large training pipelines.
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries (Hugging Face Blog) — A comparative analysis of 16 open-source reinforcement learning libraries for LLM training — covering throughput, async rollout support, scalability, and ease of use. Cuts through the rapidly crowded RL-for-LLMs space with concrete benchmarks and recommendations for different training scales. Useful reference for teams choosing a post-training RL stack.
Gemini Embedding 2: Our first natively multimodal embedding model (Google AI Blog) — Google launched Gemini Embedding 2 — their first natively multimodal embedding model, capable of embedding text, images, and mixed content into a unified vector space. Enables semantic search and RAG pipelines that work across modalities without separate embedding models per content type. A significant capability addition for developers building multimodal retrieval systems.

Google AI BlogMar 10, 2026

Gemini Embedding 2: Our first natively multimodal embedding model

Google releases Gemini Embedding 2, a natively multimodal embedding model that maps text, images, video, audio and documents into a single vector space. This simplifies retrieval and multimodal search pipelines for developers building applications that need unified semantic representations across modalities.

googlegeminiembeddingsmultimodal

Read original

OpenAI NewsMar 10, 2026

Improving instruction hierarchy in frontier LLMs

openairesearchprompt-injectionsafety

Read original

Hugging Face BlogMar 10, 2026

Introducing Storage Buckets on the Hugging Face Hub

Hugging Face launched Storage Buckets — a new Hub feature for storing arbitrary files (datasets, checkpoints, outputs) alongside models and datasets in a unified workspace. Removes the need to use separate S3/GCS buckets for ML artefacts, keeping everything in the HF ecosystem. A practical platform improvement for teams with large training pipelines.

huggingfaceplatformnew-featuredeveloper-tools

Read original

Hugging Face BlogMar 10, 2026

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

A comparative analysis of 16 open-source reinforcement learning libraries for LLM training — covering throughput, async rollout support, scalability, and ease of use. Cuts through the rapidly crowded RL-for-LLMs space with concrete benchmarks and recommendations for different training scales. Useful reference for teams choosing a post-training RL stack.

researchreinforcement-learningopen-sourcetraining

Read original

Today's stories:

Improving instruction hierarchy in frontier LLMs (OpenAI News) — OpenAI introduces IH-Challenge — a training approach that teaches models to correctly prioritise instructions from trusted sources over injected or untrusted ones. Improves resistance to prompt injection and makes models more steerable by operators. Research with direct implications for building safer agentic systems.
Introducing Storage Buckets on the Hugging Face Hub (Hugging Face Blog) — Hugging Face launched Storage Buckets — a new Hub feature for storing arbitrary files (datasets, checkpoints, outputs) alongside models and datasets in a unified workspace. Removes the need to use separate S3/GCS buckets for ML artefacts, keeping everything in the HF ecosystem. A practical platform improvement for teams with large training pipelines.
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries (Hugging Face Blog) — A comparative analysis of 16 open-source reinforcement learning libraries for LLM training — covering throughput, async rollout support, scalability, and ease of use. Cuts through the rapidly crowded RL-for-LLMs space with concrete benchmarks and recommendations for different training scales. Useful reference for teams choosing a post-training RL stack.
Gemini Embedding 2: Our first natively multimodal embedding model (Google AI Blog) — Google launched Gemini Embedding 2 — their first natively multimodal embedding model, capable of embedding text, images, and mixed content into a unified vector space. Enables semantic search and RAG pipelines that work across modalities without separate embedding models per content type. A significant capability addition for developers building multimodal retrieval systems.

Gemini Embedding 2: Our first natively multimodal embedding model

googlegeminiembeddingsmultimodal

Read original

Improving instruction hierarchy in frontier LLMs

openairesearchprompt-injectionsafety

Read original

Introducing Storage Buckets on the Hugging Face Hub

huggingfaceplatformnew-featuredeveloper-tools

Read original

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

researchreinforcement-learningopen-sourcetraining

Read original