Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hyperparameter request for reproducibility #134

Open
ovunctuzel-bc opened this issue Aug 16, 2024 · 2 comments
Open

Hyperparameter request for reproducibility #134

ovunctuzel-bc opened this issue Aug 16, 2024 · 2 comments

Comments

@ovunctuzel-bc
Copy link

ovunctuzel-bc commented Aug 16, 2024

I'm training the segmentation EfficientViT B1 on Cityscapes, and achieving ~0.6 mIoU, however the reported results are around 0.8 mIoU.

Would you be able to offer some guidance or share more details around the hyperparameters?

My setup is:

  • 1024 x 2048 resolution
  • Backbone starts from the ImageNet checkpoints you have provided
  • Num Epochs: 100
  • LR: 0.005 with cosine annealing to 0
  • Batch Size: 2 (I'm limited by hardware at this resolution)
  • AdamW optimizer
  • Focal Loss w/ equal class weights

Any augmentations? Anything else that can help?

@Sanath1998
Copy link

I'm training the segmentation EfficientViT B1 on Cityscapes, and achieving ~0.6 mIoU, however the reported results are around 0.8 mIoU.

Would you be able to offer some guidance or share more details around the hyperparameters?

My setup is:

  • 1024 x 2048 resolution
  • Backbone starts from the ImageNet checkpoints you have provided
  • Num Epochs: 100
  • LR: 0.005 with cosine annealing to 0
  • Batch Size: 2 (I'm limited by hardware at this resolution)
  • AdamW optimizer
  • Focal Loss w/ equal class weights

Any augmentations? Anything else that can help?

Hi @ovunctuzel-bc , this seems good hyperparameters for training. First of all one thing to ask, there is no official release of training code for segmentation EfficientViT except for SAM variant right? How did u get the code reference. If you just guide then it would be very useful for me as well.

@ovunctuzel-bc
Copy link
Author

A fairly standard pytorch training loop seems to work fine. The results are satisfactory but not quite at the level of the pretrained model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants