11 KiB

Raw Blame History

ResNeSt

A ResNeSt is a variant on a ResNet, which instead stacks Split-Attention blocks. The cardinal group representations are then concatenated along the channel dimension: V = \text{Concat}{V^{1},V^{2},\cdots{V}^{K}}. As in standard residual blocks, the final output Y of otheur Split-Attention block is produced using a shortcut connection: Y=V+X, if the input and output feature-map share the same shape. For blocks with a stride, an appropriate transformation \mathcal{T} is applied to the shortcut connection to align the output shapes: Y=V+\mathcal{T}(X). For example, \mathcal{T} can be strided convolution or combined convolution-with-pooling.

{% include 'code_snippets.md' %}

How do I train this model?

You can follow the timm recipe scripts for training a new model afresh.

Citation

@misc{zhang2020resnest,
      title={ResNeSt: Split-Attention Networks}, 
      author={Hang Zhang and Chongruo Wu and Zhongyue Zhang and Yi Zhu and Haibin Lin and Zhi Zhang and Yue Sun and Tong He and Jonas Mueller and R. Manmatha and Mu Li and Alexander Smola},
      year={2020},
      eprint={2004.08955},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

11 KiB Raw Blame History

ResNeSt

How do I train this model?

Citation

11 KiB

Raw Blame History