ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
In this work, we investigate such a novel cross-modality transfer learning setting, namely parameter-efficient image-to-video transfer learning. To solve this problem, we propose a new Spatio