Fig. 3
From: Multipath Attention and Adaptive Gating Network for Video Action Recognition

The illustration of SDM. Our SDM operates based on intermediate feature differences and increases module robustness through a multi-layer perceptron. Eventually, it is fused with the original input by the residual operation to reduce noise interference