中国科学技术信息研究所--国家工程技术数字图书馆

EAPT: Efficient Attention Pyramid Transformer for Image Processing

[期刊]

《IEEE transactions on multimedia》 2023年25卷1期

原文获取收藏分享

摘要 : Recent transformer-based models, especially patch-based methods, have shown huge potentiality in vision tasks. However, the split fixed-size patches divide the input features into the same size patches, which ignores the fact that... 展开

作者	Xiao Lin Shuzhou Sun Wei Huang Bin Sheng Ping Li David Dagan Feng
作者单位	Department of Computer Science Shanghai Normal University Shanghai China\|Shanghai Engineering Research Center of Intelligent Education and Bigdata Shanghai China Department of Computer Science and Engineering University of Shanghai for Science and Technology Shanghai China Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China Department of Computing The Hong Kong Polytechnic University Kowloon Hong Kong Biomedical and Multimedia Information Technology Research Group School of Information Technologies The University of Sydney Sydney NSW Australia
期刊名称	《IEEE transactions on multimedia 》
页码/总页数	50-61 / 12
语种/中图分类号	英语 / TP37
关键词	Transformers Encoding Task analysis Semantics Feature extraction Costs Convolutional neural networks
DOI	10.1109/TMM.2021.3120873
馆藏号	IELEP0172