
from openskynet152
Run Hugging Face models natively in JavaScript/TypeScript (browser or Node) using Transformers.js for text, vision, and audio tasks with WebGPU/WASM fallbacks.
Transformers.js enables agents to run state-of-the-art Hugging Face models directly from JavaScript/TypeScript environments without a Python backend. The skill documents the pipeline API, device selection (WASM vs WebGPU), quantization options, model selection guidance, and memory management patterns (including disposing pipelines). It covers a wide range of tasks: text classification, generation, translation, summarization, NER, image classification, object detection, audio transcription, and feature extraction.
Use this skill when you need to embed ML inference into browser or Node/Deno/Bun apps, implement client-side model inference, or prototype multimodal features without running separate model servers. It's suitable for building on-device features, demos, or production services that can fetch models from the Hugging Face Hub or use local caches.
Useful for agents that can write JavaScript/TypeScript and integrate with web runtimes (Copilot-like code agents, web dev automation agents). Also valuable to operators building browser demos or Node services that need local model inference.
This skill has not been reviewed by our automated audit pipeline yet.