[期刊]
  • 《IEEE transactions on multimedia》 2023年25卷1期

摘要 : Recent transformer-based models, especially patch-based methods, have shown huge potentiality in vision tasks. However, the split fixed-size patches divide the input features into the same size patches, which ignores the fact that... 展开

相关作者
相关关键词