A big thank you to my GitHub Sponsors for their support!
In addition to the sponsors at the link above, I've received hardware and/or cloud resources from
I'm fortunate to be able to dedicate significant time and money of my own supporting this and other open source projects. However, as the projects increase in scope, outside support is needed to continue with the current trajectory of hardware, infrastructure, and electricty costs.
benchmark.py
script for bulk timm
model benchmarking of train and/or inferencetimm
cleanup/style tweaks and weights have hub download supportnfnet_l0b
->nfnet_l0
) weights 82.75 top-1 @ 288x288nfnet_l0c
->eca_nfnet_l0
) weights trained by myself.
dm_nfnet_f0
and 50% faster than nfnet_f0s
w/ 1/3 param countdm_
. They require SAME padding conv, skipinit enabled, and activation gains applied in act fn.s
variants.dm_nfnet_f6
- 86.352dm_nfnet_f5
- 86.100dm_nfnet_f4
- 85.834dm_nfnet_f3
- 85.676dm_nfnet_f2
- 85.178dm_nfnet_f1
- 84.696dm_nfnet_f0
- 83.464--clip-grad .01 --clip-mode agc
--clip-grad 1.0
--clip-grad 10. --clip-mode value
byobnet.py
byobnet.py
vgg.py
)--channels-last
and --torchscript
model training, APEX does not.ecaresnet26t
- 79.88 top-1 @ 320x320, 79.08 @ 256x256ecaresnet50t
- 82.35 top-1 @ 320x320, 81.52 @ 256x256ecaresnet269d
- 84.93 top-1 @ 352x352, 84.87 @ 320x320t
) vs tiered_narrow (tn
) ResNet model defs, all tn
changed to t
and t
models removed (seresnext26t_32x4d
only model w/ weights that was removed).test_input_size
and remove extra _320
suffix ResNet model defs that were just for test.train.py /data/tfds --dataset tfds/oxford_iiit_pet --val-split test --model resnet50 -b 256 --amp --num-classes 37 --opt adamw --lr 3e-4 --weight-decay .001 --pretrained -j 2
validate.py /data/fall11_whole.tar --model resnetv2_50x1_bitm_in21k --amp
efficientnet_em
) model trained in PyTorch, 79.3 top-1--channels-last
, --native-amp
vs --apex-amp
)PyTorch Image Models (timm
) is a collection of image models, layers, utilities, optimizers, schedulers, data-loaders / augmentations, and reference training / validation scripts that aim to pull together a wide variety of SOTA models with ability to reproduce ImageNet training results.
The work of many others is present here. I've tried to make sure all source material is acknowledged via links to github, arxiv papers, etc in the README, documentation, and code docstrings. Please let me know if I missed anything.
All model architecture families include variants with pretrained weights. There are specific model variants without any weights, it is NOT a bug. Help training new or better weights is always appreciated. Here are some example training hparams to get you started.
A full version of the list below with source links can be found in the documentation.
Several (less common) features that I often utilize in my projects are included. Many of their additions are the reason why I maintain my own set of models, instead of using others' via PIP:
get_classifier
and reset_classifier
forward_features
(see documentation)create_model(name, features_only=True, out_indices=..., output_stride=...)
out_indices
creation arg specifies which feature maps to return, these indices are 0 based and generally correspond to the C(i + 1)
feature level.output_stride
creation arg controls output stride of the network by using dilated convolutions. Most networks are stride 32 by default. Not all networks support this..feature_info
memberstep
, cosine
w/ restarts, tanh
w/ restarts, plateau
rmsprop_tf
adapted from PyTorch RMSProp by myself. Reproduces much improved Tensorflow RMSProp behaviour.radam
by Liyuan Liu (https://arxiv.org/abs/1908.03265)novograd
by Masashi Kimura (https://arxiv.org/abs/1905.11286)lookahead
adapted from impl by Liam (https://arxiv.org/abs/1907.08610)fused<name>
optimizers by name with NVIDIA Apex installedadamp
and sgdp
by Naver ClovAI (https://arxiv.org/abs/2006.08217)adafactor
adapted from FAIRSeq impl (https://arxiv.org/abs/1804.04235)adahessian
by David Samuel (https://arxiv.org/abs/2006.00719)Model validation results can be found in the documentation and in the results tables
My current documentation for timm
covers the basics.
timmdocs is quickly becoming a much more comprehensive set of documentation for timm
. A big thanks to Aman Arora for his efforts creating timmdocs.
paperswithcode is a good resource for browsing the models within timm
.
The root folder of the repository contains reference train, validation, and inference scripts that work with the included models and other features of this repository. They are adaptable for other datasets and use cases with a little hacking. See documentation for some basics and training hparams for some train examples that produce SOTA ImageNet results.
One of the greatest assets of PyTorch is the community and their contributions. A few of my favourite resources that pair well with the models and componenets here are listed below.
The code here is licensed Apache 2.0. I've taken care to make sure any third party code included or adapted has compatible (permissive) licenses such as MIT, BSD, etc. I've made an effort to avoid any GPL / LGPL conflicts. That said, it is your responsibility to ensure you comply with license here and conditions of any dependent licenses. Where applicable, I've linked the sources/references for various components in docstrings. If you think I've missed anything please create an issue.
So far all of the pretrained weights available here are pretrained on ImageNet with a select few that have some additional pretraining (see extra note below). ImageNet was released for non-commercial research purposes only (http://www.image-net.org/download-faq). It's not clear what the implications of that are for the use of pretrained weights from that dataset. Any models I have trained with ImageNet are done for research purposes and one should assume that the original dataset license applies to the weights. It's best to seek legal advice if you intend to use the pretrained weights in a commercial product.
Several weights included or references here were pretrained with proprietary datasets that I do not have access to. These include the Facebook WSL, SSL, SWSL ResNe(Xt) and the Google Noisy Student EfficientNet models. The Facebook models have an explicit non-commercial license (CC-BY-NC 4.0, https://github.com/facebookresearch/semi-supervised-ImageNet1K-models, https://github.com/facebookresearch/WSL-Images). The Google models do not appear to have any restriction beyond the Apache 2.0 license (and ImageNet concerns). In either case, you should contact Facebook or Google with any questions.
@misc{rw2019timm,
author = {Ross Wightman},
title = {PyTorch Image Models},
year = {2019},
publisher = {GitHub},
journal = {GitHub repository},
doi = {10.5281/zenodo.4414861},
howpublished = {\url{https://github.com/rwightman/pytorch-image-models}}
}
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。