pytorch-image-models/timm/models/layers/patch_embed.py

""" Image to Patch Embedding using Conv2d

A convolution based approach to patchifying a 2D image w/ embedding projection.

Based on the impl in https://github.com/google-research/vision_transformer

Hacked together by / Copyright 2020 Ross Wightman
"""
from torch import nn as nn

from .helpers import to_2tuple
from .trace_utils import _assert


class PatchEmbed(nn.Module):
    """ 2D Image to Patch Embedding
    """
    def __init__(
            self,
            img_size=224,
            patch_size=16,
            in_chans=3,
            embed_dim=768,
            norm_layer=None,
            flatten=True,
            bias=True,
    ):
        super().__init__()
        img_size = to_2tuple(img_size)
        patch_size = to_2tuple(patch_size)
        self.img_size = img_size
        self.patch_size = patch_size
        self.grid_size = (img_size[0] // patch_size[0], img_size[1] // patch_size[1])
        self.num_patches = self.grid_size[0] * self.grid_size[1]
        self.flatten = flatten

        self.proj = nn.Conv2d(in_chans, embed_dim, kernel_size=patch_size, stride=patch_size, bias=bias)
        self.norm = norm_layer(embed_dim) if norm_layer else nn.Identity()

    def forward(self, x):
        B, C, H, W = x.shape
        _assert(H == self.img_size[0], f"Input image height ({H}) doesn't match model ({self.img_size[0]}).")
        _assert(W == self.img_size[1], f"Input image width ({W}) doesn't match model ({self.img_size[1]}).")
        x = self.proj(x)
        if self.flatten:
            x = x.flatten(2).transpose(1, 2)  # BCHW -> BNC
        x = self.norm(x)
        return x
Move Mlp and PatchEmbed modules into layers. Being used in lots of models now... 4 years ago			`""" Image to Patch Embedding using Conv2d`

			`A convolution based approach to patchifying a 2D image w/ embedding projection.`

			`Based on the impl in https://github.com/google-research/vision_transformer`

			`Hacked together by / Copyright 2020 Ross Wightman`
			`"""`
			`from torch import nn as nn`

			`from .helpers import to_2tuple`
Fix #954 by bringing traceable _assert into timm to allow compat w/ PyTorch < 1.8 3 years ago			`from .trace_utils import _assert`
Move Mlp and PatchEmbed modules into layers. Being used in lots of models now... 4 years ago

			`class PatchEmbed(nn.Module):`
			`""" 2D Image to Patch Embedding`
			`"""`
Adding support for fine-tune CLIP LAION-2B image tower weights for B/32, L/14, H/14 and g/14. Still WIP 2 years ago			`def __init__(`
			`self,`
			`img_size=224,`
			`patch_size=16,`
			`in_chans=3,`
			`embed_dim=768,`
			`norm_layer=None,`
			`flatten=True,`
			`bias=True,`
			`):`
Move Mlp and PatchEmbed modules into layers. Being used in lots of models now... 4 years ago			`super().__init__()`
			`img_size = to_2tuple(img_size)`
			`patch_size = to_2tuple(patch_size)`
			`self.img_size = img_size`
			`self.patch_size = patch_size`
Rethink name of patch embed grid info 4 years ago			`self.grid_size = (img_size[0] // patch_size[0], img_size[1] // patch_size[1])`
			`self.num_patches = self.grid_size[0] * self.grid_size[1]`
Add levit, levit_c, and visformer model defs. Largely untested and not finished cleanup. 4 years ago			`self.flatten = flatten`
Move Mlp and PatchEmbed modules into layers. Being used in lots of models now... 4 years ago
Adding support for fine-tune CLIP LAION-2B image tower weights for B/32, L/14, H/14 and g/14. Still WIP 2 years ago			`self.proj = nn.Conv2d(in_chans, embed_dim, kernel_size=patch_size, stride=patch_size, bias=bias)`
Move Mlp and PatchEmbed modules into layers. Being used in lots of models now... 4 years ago			`self.norm = norm_layer(embed_dim) if norm_layer else nn.Identity()`

			`def forward(self, x):`
			`B, C, H, W = x.shape`
Fix #954 by bringing traceable _assert into timm to allow compat w/ PyTorch < 1.8 3 years ago			`_assert(H == self.img_size[0], f"Input image height ({H}) doesn't match model ({self.img_size[0]}).")`
			`_assert(W == self.img_size[1], f"Input image width ({W}) doesn't match model ({self.img_size[1]}).")`
Add levit, levit_c, and visformer model defs. Largely untested and not finished cleanup. 4 years ago			`x = self.proj(x)`
			`if self.flatten:`
			`x = x.flatten(2).transpose(1, 2) # BCHW -> BNC`
Move Mlp and PatchEmbed modules into layers. Being used in lots of models now... 4 years ago			`x = self.norm(x)`
			`return x`