Rethinking Alignment in Video Super-Resolution Transformers

Our experiments show that: (i) VSR Transformers can directly utilize multi-frame information from unaligned videos, and (ii) existing alignment methods are sometimes harmful to VSR Transformers. we propose a new and efficient alignment method called patch alignment, which aligns image patches instead of pixels. VSR Transformers equipped with patch alignment could demonstrate SoTA performance.

Resources

Newest Version (arXiv)
Video [Bilibili]
Code

Citation

If you find our work inspiring, please cite our work:

@article{shi2022rethinking,
  title={Rethinking Alignment in Video Super-Resolution Transformers},
  author={Shi, Shuwei and Gu, Jinjin and Xie, Liangbin and Wang, Xintao and Yang, Yujiu and Dong, Chao},
  journal={Advances in Neural Information Processing Systems},
  year={2022}
}

Rethinking Alignment in Video Super-Resolution Transformers

Neural Information Processing Systems (NeurIPS), 2022

Shuwei Shi^, Jinjin Gu^, Liangbin Xie, Xintao Wang^*, Yujiu Yang, Chao Dong

Resources

Citation