
from ai-toolkit155
Guidelines and patterns for implementing prompt caching with Anthropic APIs to reduce input-token costs and latency, including TTLs, breakpoints, and hit-rate m
This skill documents practical patterns for implementing prompt caching when building with Anthropic (Claude) APIs. It explains cache mechanics, recommended TTLs, how to structure cached blocks and breakpoints, anti-patterns that cause cache misses, and methods for measuring cache hit rate. Concrete code examples in Python and TypeScript show how to mark cached content and keep dynamic data outside cached prefixes.
Use this skill when you need to dramatically cut API input-token costs or reduce latency by caching stable parts of prompts. It's useful during production integration, performance tuning, or any high-volume loop where repeated stable prefixes exist (system prompts, tool definitions, large reference docs). Avoid for one-shot calls or very small prompts.
Likely used by agents and tooling that call Anthropic/Claude APIs (Claude-opus/Haiku), and by developer-facing tools (Claude Code, Cursor, Copilot integrations) that need token-cost optimizations.
Pure knowledge/documentation skill with no scripts. Covers Anthropic prompt caching mechanics, layering patterns, anti-patterns, and hit-rate measurement with code examples in Python and TypeScript. Well-structured reference content — clear tables, practical guidance, and specific threshold numbers (0.7 hit-rate target, 1024/2048 min tokens). No security concerns whatsoever.
Clean knowledge-only skill. Would benefit from a references/ directory linking to official Anthropic docs and related skills. The skill_md_body already references the docs URL inline which is good. User-invocable is false, so it's designed as a passive reference skill triggered by keywords like 'prompt caching' and 'cache_control'.
Ubiquitous Language Glossary
Extract a DDD-style ubiquitous language glossary from conversation: identify domain terms, flag ambiguities, propose canonical terms, and write the result to UB
Memory Search
Search past coding sessions and observations using natural-language queries to retrieve decisions, notes, and context.
Documentation Generator
Generate and update README, API docs, architecture notes, changelogs and knowledge-base entries from a codebase, with templates and review guidance.
Design Engineering
Guidelines and patterns for UI animation, micro-interactions, and component polish to build performant, accessible, and delightful interfaces.
Kotlin Rules
Project-specific Kotlin coding standards covering naming, null-safety, coroutines, testing, frameworks (Ktor, Spring), and security best practices.