SiT: Self-supervised vIsion Transformer
Paper
•
2104.03602
•
Published
SiT is a self-supervised learning model that combines masked image modeling and contrastive learning. The model is trained on ImageNet-1K.
Sara Atito, Muhammad Awais, Josef Kittler
from modeling_sit import ViTSiTForPreTraining
# reload
model = ViTSiTForPreTraining.from_pretrained("erow/SiT")
@inproceedings{atito2023sit,
title={SiT is all you need},
author={Atito, Sara and Awais, Muhammed and Nandam, Srinivasa and Kittler, Josef},
booktitle={2023 IEEE International Conference on Image Processing (ICIP)},
pages={2125--2129},
year={2023},
organization={IEEE}
}