ISC-Bench

Trust Score 78/100

Benchmark and pipeline for evaluating model safety (Internal Safety Collapse) using the TVD framework; builds prompts, runs models, extracts harmful content, an

triggers:isc-benchjailbreak evaluationsafety benchmarkrun tvdextract harmful contentscore responses

GitHub SKILL.md

What it does

ISC-Bench is a reproducible pipeline for assessing Internal Safety Collapse (ISC) in large language models. It provides scripts to build benchmark prompts, run them against target models via OpenRouter, extract harmful content from model outputs, and score results on a 1–5 harmfulness scale. The repo supports multiple benchmarks (JailbreakBench, HarmBench, AdvBench, StrongREJECT) and agent modes for automated evaluation.

When to use it

Use ISC-Bench when you need to quantitatively evaluate model safety, compare models across standard benchmarks, measure jailbreak/vulnerability rates, or run controlled experiments (zero-shot, few-shot, ICL). Appropriate for safety researchers, red-teamers, and engineering teams validating model behaviour.

What's included

Scripts: experiment runners, build/run/extract/judge pipeline scripts (see experiment/ folders).
References: pre-built prompt sets for different benchmarks and example result files.
Instructions: detailed setup and quick-start commands including uv run usage, Docker agent mode, and troubleshooting tips.

Compatible agents

Works with CLI-capable agents and evaluation tooling that can run Python/Docker workflows; integrates with OpenRouter-accessible models and typical LLM runners.

Audit Summary

ISC-Bench is a safety benchmarking skill for evaluating LLM vulnerability to Internal Safety Collapse. The Python script is well-structured with concurrent execution and progress reporting, but failed to run due to missing 'openai' dependency. The shell script requires a template directory argument. SKILL.md body was empty in the fetch, limiting full assessment.

Watch Out

Requires OPENROUTER_API_KEY env var
Requires 'openai' Python package installed
Shell script embeds inline Python via uv run

Missing Dependencies

openai

Notes

Legitimate AI safety research tool. No hardcoded credentials, no destructive commands, no exfiltration. Shell script has minor style issues with inline Python but is functional. Niche audience — AI safety researchers and red-teamers.

Information

Repository: ISC-Bench
Stars: 775

Trust Score

Overall78

Security95

Code Quality68

Architecture52

Usefulness42

More from ISC-Bench

AIML Spambot Benchmark (ISC)

Benchmark template for evaluating spam-detection models using anchored spam campaign examples (contains harmful anchoring content).

Related Skills

Yielding Bear — Unified LLM Routing

A unified LLM routing API that automatically routes requests to the cheapest capable model across providers to reduce inference cost by ~60-80%.

Seq Wrangler

Sequence data QC, trimming, alignment, and BAM processing pipeline helper — wraps FastQC, fastp/Trimmomatic, BWA/Bowtie2/Minimap2, and SAMtools for automated re

OpenDAL Oli CLI Playbooks

Command playbooks and safe-operation patterns for the OpenDAL 'oli' CLI — listing, copying, moving, editing, and benchmarking data across local and remote profi

Markdrop

Convert PDFs to structured Markdown or interactive HTML and generate AI-powered descriptions for images and tables using multiple LLM providers.

arxiv2md

Convert arXiv papers into clean, LLM-ready Markdown via a REST API or CLI for fast reading, summarization, and downstream processing.

Retriever (Membrane) Integration

Integrate with Retriever via the Membrane CLI to search, query, and proxy Retriever data (documents, snippets, queries) with built-in auth and pagination.

Scite Database (Smart Citations)

Query Scite.ai Smart Citations by DOI to report supporting, contrasting, and mentioning citation counts and help assess a paper's reliability.

Writing Implementation Plans

Generate detailed, engineer-ready implementation plans from design docs: file paths, step-by-step tasks, tests, and verification instructions.

Back to Skills