SimCLR with CLIP-guided Resampling

This repository includes a PyTorch implementation of the ICML 2023 paper On the Generalization of Multi-modal Contrastive Learning authored by Qi Zhang*, Yifei Wang*, Yisen Wang.

In this repository, we consider four strategies for leveraging CLIP to help self-supervised contrastive learning with SimCLR.

Method	Baseline (SimCLR)	AddNewPositive	DropFalsePositive	DropFalseNegative	DropEasyNegative
Linear Acc	61.2	67.4 (+6.2)	61.8 (+0.6)	61.4 (+0.2)	62.3 (+1.1)

Enviroment Setup

Create a python enviroment with the provided config file and miniconda:

conda env create -f environment.yml
conda activate simclr_pytorch

export IMAGENET_PATH=... # If you have enough RAM using /dev/shm usually accelerates data loading time
export EXMAN_PATH=... # A path to logs

Install the official CLIP respository and download the official CLIP models.

pip install ftfy regex tqdm
pip install git+https://github.com/openai/CLIP.git

Training

Model training consists of two steps: (1) self-supervised encoder pretraining and (2) classifier learning with the encoder representations. Both steps are done with the train.py script.

Self-supervised pretraining

ImageNet

The configs imagenet_params_epochs*_bs*.yaml contain the parameters to reproduce results for ImageNet dataset. The pretraining command is:

python train.py --config configs/imagenet_train_epochs100_bs512.yaml --method <Method>

The methods include 'simclr', 'new_positive', 'drop_false_positive', 'drop_false_negative', 'drop_easy_negative'.

Linear Evaluation

To train a linear classifier on top of the pretrained encoder, run the following command:

python train.py --config configs/cifar_eval.yaml --encoder_ckpt <path-to-encoder>

Citing this work

If you find our code useful, please cite

@inproceedings{
zhang2023generalization,
title={On the Generalization of Multi-modal Contrastive Learning},
author={Qi Zhang and Yifei Wang and Yisen Wang},
booktitle={International Conference on Machine Learning},
year={2023},
}

Acknowledgements

Our codes borrows the implementations of SimCLR in https://github.com/AndrewAtanov/simclr-pytorch.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
configs		configs
models		models
myexman		myexman
utils		utils
README.md		README.md
environment.yml		environment.yml
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SimCLR with CLIP-guided Resampling

Enviroment Setup

Training

Self-supervised pretraining

ImageNet

Linear Evaluation

Citing this work

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

PKU-ML/CLIP-Help-SimCLR

Folders and files

Latest commit

History

Repository files navigation

SimCLR with CLIP-guided Resampling

Enviroment Setup

Training

Self-supervised pretraining

ImageNet

Linear Evaluation

Citing this work

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages