摘要 : Recent methods for action recognition always apply 3D Convolutional Neural Networks (CNNs) to extract spatiotemporal features and introduce optical flows to present motion features. Although achieving state-of-the-art performance,... 展开
作者 | Mengmeng Wang Jiazheng Xing Jing Su Jun Chen Yong Liu |
---|---|
作者单位 | |
期刊名称 | 《IEEE Transactions on Pattern Analysis and Machine Intelligence 》 |
页码/总页数 | 3347-3362 / 16 |
语种/中图分类号 | 英语 / TP391 |
关键词 | Spatiotemporal phenomena Feature extraction Optical flow Videos Training Three-dimensional displays Convolution |
DOI | 10.1109/TPAMI.2022.3173658 |
馆藏号 | IELEP0261 |