Image Classification
Transformers
SPTNet / README.md
whj363636's picture
Add pipeline tag and library name; include link to Github repository (#1)
235eaa2 verified
metadata
license: cc-by-nc-sa-4.0
library_name: transformers
pipeline_tag: image-classification

SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning (ICLR 2024)

This repository contains the model described in https://arxiv.org/abs/2403.13684.

Code: https://github.com/Visual-AI/SPTNet

SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning
By Hongjun Wang, Sagar Vaze, and Kai Han.

[05.2024] We update the results of SPTNet with DINOv2 on CUB, please check our latest version in Arxiv

All Old New
CUB (DINO) 65.8 68.8 65.1
CUB (DINOv2) 76.3 79.5 74.6

Results

Generic results:

All Old New
CIFAR-10 97.3 95.0 98.6
CIFAR-100 81.3 84.3 75.6
ImageNet-100 85.4 93.2 81.4

Fine-grained results:

All Old New
CUB 65.8 68.8 65.1
Stanford Cars 59.0 79.2 49.3
FGVC-Aircraft 59.3 61.8 58.1
Herbarium19 43.4 58.7 35.2

Citing this work

If you find this repo useful for your research, please consider citing our paper:

@inproceedings{wang2024sptnet,
    author    = {Wang, Hongjun and Vaze, Sagar and Han, Kai},
    title     = {SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning},
    booktitle = {International Conference on Learning Representations (ICLR)},
    year      = {2024}
}