You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
pytorch-image-models/docs/models/.templates/models/resnest.md

11 KiB

ResNeSt

A ResNeSt is a variant on a ResNet, which instead stacks Split-Attention blocks. The cardinal group representations are then concatenated along the channel dimension: V = \text{Concat}{V^{1},V^{2},\cdots{V}^{K}}. As in standard residual blocks, the final output Y of otheur Split-Attention block is produced using a shortcut connection: Y=V+X, if the input and output feature-map share the same shape. For blocks with a stride, an appropriate transformation \mathcal{T} is applied to the shortcut connection to align the output shapes: Y=V+\mathcal{T}(X). For example, \mathcal{T} can be strided convolution or combined convolution-with-pooling.

{% include 'code_snippets.md' %}

How do I train this model?

You can follow the timm recipe scripts for training a new model afresh.

Citation

@misc{zhang2020resnest,
      title={ResNeSt: Split-Attention Networks}, 
      author={Hang Zhang and Chongruo Wu and Zhongyue Zhang and Yi Zhu and Haibin Lin and Zhi Zhang and Yue Sun and Tong He and Jonas Mueller and R. Manmatha and Mu Li and Alexander Smola},
      year={2020},
      eprint={2004.08955},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}