naver-ai
/

swin_rope_axial_small_patch4_window7_224

Image Classification

Model card Files Files and versions

bhheo commited on Oct 16, 2024

Commit

e5d167a

·

verified ·

1 Parent(s): 01088fc

Update README.md

Files changed (1) hide show

README.md +27 -3

README.md CHANGED Viewed

@@ -1,3 +1,27 @@
----
-license: bsd-3-clause
----

+---
+license: bsd-3-clause
+datasets:
+- ILSVRC/imagenet-1k
+---
+# Model Card
+<!-- Provide a quick summary of what the model is/does. -->
+ImageNet-1k Swin-Transformer pre-trained model with Rotary Position Embedding
+## Rotary Position Embedding for Vision Transformer [ECCV 2024]
+- **Repository:** https://github.com/naver-ai/rope-vit
+- **Paper:** https://arxiv.org/abs/2403.13298
+## Citation
+```
+@inproceedings{heo2024ropevit,
+    title={Rotary Position Embedding for Vision Transformer},
+    author={Heo, Byeongho and Park, Song and Han, Dongyoon and Yun, Sangdoo},
+    year={2024},
+    booktitle={European Conference on Computer Vision (ECCV)},
+}
+```