Papers
arxiv:2304.11968

Track Anything: Segment Anything Meets Videos

Published on Apr 24, 2023
Authors:
,
,
,
,
,

Abstract

The Track Anything Model (TAM) achieves high-performance interactive tracking and segmentation in videos with minimal human interaction.

AI-generated summary

Recently, the Segment Anything Model (SAM) gains lots of attention rapidly due to its impressive segmentation performance on images. Regarding its strong ability on image segmentation and high interactivity with different prompts, we found that it performs poorly on consistent segmentation in videos. Therefore, in this report, we propose Track Anything Model (TAM), which achieves high-performance interactive tracking and segmentation in videos. To be detailed, given a video sequence, only with very little human participation, i.e., several clicks, people can track anything they are interested in, and get satisfactory results in one-pass inference. Without additional training, such an interactive design performs impressively on video object tracking and segmentation. All resources are available on https://github.com/gaomingqi/Track-Anything. We hope this work can facilitate related research.

Community

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2304.11968 in a dataset README.md to link it from this page.

Spaces citing this paper 19

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.