
from ai-infra-auto-driven-skills186
Guided, PR-backed manual for auditing, debugging, and extending the Qwen3 Coder tool parser in vLLM—focuses on schema edge cases, tool-parser regressions, and v
This skill provides a focused, evidence-backed optimization dossier for the Qwen3 Coder tool parser in the vLLM runtime. It documents landed PRs, runtime surfaces, and a validation plan so an agent (e.g., Codex or a code-focused assistant) can audit, diagnose, and patch regressions related to JSON-schema edge cases (anyOf/oneOf), nullable parameters, and Responses API tool calls. The content is built from diffs and PR notes to ensure recommendations are traceable.
Use this skill when an agent must: reproduce or investigate a regression in vLLM's Qwen3 coder tool parsing; create or review PRs that change tool-parser behavior; validate tool-call integrity under streaming/speculative decode; or prepare test lanes that exercise complex schema combinations. It is intended for engineering review, QA automation, and PR triage workflows.
references/ directory with PR history and validation notes.references/pr-history.md, model-pr-optimization-history/...).Best suited for code-capable agents (Codex-family, GPT-code assistants, Claude Code) and any workflow that can read PR diffs and run validation test lanes.
This skill has not been reviewed by our automated audit pipeline yet.