Jan 11, 2023 Update ConvNeXt ImageNet-12k pretrain series w/ two new fine-tuned weights (and pre FT .in12k tags) convnext_nano.in12k_ft_in1k - 82.3 @ 224, 82.9 @ 288 (previously released) convnext_tiny.in12k_ft_in1k - 84.2 @ 224, 84.5 @ 288 convnext_small.in12k_ft_in1k - 85.2 @ 224, 85.3 @ 288 Jan 6, 2023 Finally got around to adding --model-kwargs and --opt-kwargs to scripts to pass through rare args directly to model classes from cmd line train.py /imagenet --model resnet50 --amp --model-kwargs output_stride=16 act_layer=silu train.py /imagenet --model vit_base_patch16_clip_224 --img-size 240 --amp --model-kwargs img_size=240 patch_size=12 Cleanup some popular models to better support arg passthrough / merge with model configs, mo...
More weights for 3rd party ViT / ViT-CNN hybrids that needed remapping / re-hosting EfficientFormer ...
Weights from https://github.com/naver-ai/pit Copyright 2021-present NAVER Corp. Rehosted here for ea...
Weights for ResNet-RS models as per #554 . Ported from Tensorflow impl (https://github.com/tensorflo...
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, ...
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, ...
Minor bug fixes and a few more weights since 0.6.5 A few more weights & model defs added: darknetaa...
Minor bug fixes to HF push_to_hub, plus some more MaxVit weights Oct 10, 2022 More weights in maxxv...
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, ...
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, ...
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, ...
Changes Since 0.6.7 Sept 23, 2022 CLIP LAION-2B pretrained B/32, L/14, H/14, and g/14 image tower w...
Vision Transformer AugReg weights and model defs (https://arxiv.org/abs/2106.10270) ResMLP official ...
A wide range of mid-large sized models trained in PyTorch XLA on TPU VM instances. Demonstrating via...
CoAtNet (https://arxiv.org/abs/2106.04803) and MaxVit (https://arxiv.org/abs/2204.01697) timm traine...
Weights from https://github.com/google/automl/tree/master/efficientnetv2 Paper: EfficientNetV2: Smal...
More weights for 3rd party ViT / ViT-CNN hybrids that needed remapping / re-hosting EfficientFormer ...
Weights from https://github.com/naver-ai/pit Copyright 2021-present NAVER Corp. Rehosted here for ea...
Weights for ResNet-RS models as per #554 . Ported from Tensorflow impl (https://github.com/tensorflo...
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, ...
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, ...
Minor bug fixes and a few more weights since 0.6.5 A few more weights & model defs added: darknetaa...
Minor bug fixes to HF push_to_hub, plus some more MaxVit weights Oct 10, 2022 More weights in maxxv...
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, ...
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, ...
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, ...
Changes Since 0.6.7 Sept 23, 2022 CLIP LAION-2B pretrained B/32, L/14, H/14, and g/14 image tower w...
Vision Transformer AugReg weights and model defs (https://arxiv.org/abs/2106.10270) ResMLP official ...
A wide range of mid-large sized models trained in PyTorch XLA on TPU VM instances. Demonstrating via...
CoAtNet (https://arxiv.org/abs/2106.04803) and MaxVit (https://arxiv.org/abs/2204.01697) timm traine...
Weights from https://github.com/google/automl/tree/master/efficientnetv2 Paper: EfficientNetV2: Smal...
More weights for 3rd party ViT / ViT-CNN hybrids that needed remapping / re-hosting EfficientFormer ...
Weights from https://github.com/naver-ai/pit Copyright 2021-present NAVER Corp. Rehosted here for ea...
Weights for ResNet-RS models as per #554 . Ported from Tensorflow impl (https://github.com/tensorflo...