Name: OpenAI CLIP
Author: mkurman

Overview

OpenAI CLIP (Contrastive Language-Image Pre-training) learns joint text–image representations. Enables zero-shot image classification, image–text similarity, cross-modal search, and image captioning without task-specific training.

Features

Zero-shot image classification
Embed images and text into a shared vector space
Image–text similarity and cross-modal retrieval

Example Usage

Includes example code for loading a CLIP model, preprocessing images, tokenizing text, and computing similarity or classification scores.

Installation

pip install openai-clip

References

https://github.com/openai/CLIP
https://arxiv.org/abs/2103.00020

OpenAI CLIP

Overview

Features

Example Usage

Installation

References

Tags

Not yet audited

Information

Related Skills

More from mkurman