-
Notifications
You must be signed in to change notification settings - Fork 9.7k
[Feature] Support ViT-Adapter #9354
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: dev-3.x
Are you sure you want to change the base?
Conversation
Hi @okotaku , |
I do not know the cause of the drop in performance at this time. I was wondering about the two types of window attention implemented in the ViT-Adapter implementation. https://github.com/czczup/ViT-Adapter/blob/main/detection/mmdet_custom/models/backbones/base/vit.py#L123 However, I have decided that there is no differe 8000 nce and have implemented window attention with reference to the implementation of vitdet in detectron2. https://github.com/facebookresearch/detectron2/blob/main/detectron2/modeling/backbone/vit.py#L148 I will continue to investigate, but if there is anything you notice, please let me know. |
I found two differences.
I will fix these, train and check mAP again. |
In my experiments, layer scale made mAP worse. I recorded box AP= 45.6 and mask AP= 40.9 with the latest config, which is close to the official performance. |
configs/vitadapter/README.md
Outdated
|
||
| Backbone | Lr schd | Mem (GB) | Inf time (fps) | box AP | mask AP | Config | Download | | ||
| :------: | :-----: | :------: | :------------: | :----: | :-----: | :----------------------------------------------------: | :----------------------: | | ||
| DeiT-T | 3x | | | | | [config](./mask-rcnn_vitadapter-deit-t_fpn_3x_coco.py) | [model](<>) \| [log](<>) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We can update this readme
Hi @okotaku , |
@ZwwWayne I understand. |
Hi @okotaku ,
|
Motivation
paper: https://arxiv.org/abs/2205.08534
code: https://github.com/czczup/ViT-Adapter
issue: #9044
Related PR
open-mmlab/mmpretrain#1209
open-mmlab/mmcv#2451
open-mmlab/mmcv#2452
Result
Checklist