[Feature] Support ViT-Adapter #9354

okotaku · 2022-11-21T00:55:36Z

Motivation

paper: https://arxiv.org/abs/2205.08534
code: https://github.com/czczup/ViT-Adapter
issue: #9044

Related PR

open-mmlab/mmpretrain#1209
open-mmlab/mmcv#2451
open-mmlab/mmcv#2452

Result

Backbone	ASFF	box AP	mask AP
DeiT-T	official	46.0	41.0
DeiT-T	mmdet	45.6	40.9
BEiT-B	mmdet	48.7	43.1

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
If the modification has potential influence on downstream projects, this PR should be tested with downstream projects, like MMDet or MMCls.
The documentation has been modified accordingly, like docstring or example tutorials.

README.md

ZwwWayne · 2022-11-21T02:27:51Z

Hi @okotaku ,
Thanks for your kind PR. Overall the code and config seem to be good for us. May I ask why the performance drops and can we fix them?

okotaku · 2022-11-21T03:43:07Z

May I ask why the performance drops and can we fix them?

I do not know the cause of the drop in performance at this time.

I was wondering about the two types of window attention implemented in the ViT-Adapter implementation.

https://github.com/czczup/ViT-Adapter/blob/main/detection/mmdet_custom/models/backbones/base/vit.py#L123
https://github.com/czczup/ViT-Adapter/blob/main/detection/mmdet_custom/models/backbones/base/vit.py#L170

However, I have decided that there is no differe 8000 nce and have implemented window attention with reference to the implementation of vitdet in detectron2.

https://github.com/facebookresearch/detectron2/blob/main/detectron2/modeling/backbone/vit.py#L148

I will continue to investigate, but if there is anything you notice, please let me know.

okotaku · 2022-11-23T11:06:00Z

I found two differences.

drop out rate in MultiScaleDeformableAttention. official = 0.0 mine = 0.1(default of mmcv)
use layer_scale or no. offical = use layer_scale mine = not use layer_scale

I will fix these, train and check mAP again.

okotaku · 2022-12-01T23:56:56Z

In my experiments, layer scale made mAP worse. I recorded box AP= 45.6 and mask AP= 40.9 with the latest config, which is close to the official performance.
After refactor the code on the mmcls side, release the WIP.

ZwwWayne · 2023-01-16T11:01:06Z

configs/vitadapter/README.md

+
+| Backbone | Lr schd | Mem (GB) | Inf time (fps) | box AP | mask AP |                         Config                         |         Download         |
+| :------: | :-----: | :------: | :------------: | :----: | :-----: | :----------------------------------------------------: | :----------------------: |
+|  DeiT-T  |   3x    |          |                |        |         | [config](./mask-rcnn_vitadapter-deit-t_fpn_3x_coco.py) | [model](<>) \| [log](<>) |


We can update this readme

ZwwWayne · 2023-01-16T11:02:35Z

Hi @okotaku ,
Thanks for your kind PR. We plan to merge this PR this week. Would you like to simply put these files into projects like ConvNeXt v2? In this way, you do not need to update the metafile.yaml and it can be merged quickly.

okotaku · 2023-01-17T09:22:19Z

@ZwwWayne I understand.
However, the mmcls PRs have not been merged yet, so we may have to wait for that.

ZwwWayne · 2023-01-17T09:47:35Z

@ZwwWayne I understand. However, the mmcls PRs have not been merged yet, so we may have to wait for that.

Hi @okotaku ,
Thanks for your quick response. I missed the situation in mmcls before and have reminded them. Could you further move the files into the folders of projects? The expected file structure looks like

|-- .gitignore
|-- projects
    |-- ViTAdapter
        |-- configs
          |-- mask-rcnn_beitadapter-b_fpn_3x_coco.py
        |-- README.md

okotaku added 3 commits November 16, 2022 05:15

add vitadapter

2350ad8

add vitadapter

bf7162b

add vitadapter

fcbbb37

mm-assistant bot assigned Czm369 Nov 21, 2022

okotaku changed the title ~~Vitadapter~~ [Feature] Support ViT-Adapter Nov 21, 2022

okotaku mentioned this pull request Nov 21, 2022

[Feature] Support ViT-Adapter open-mmlab/mmpretrain#1209

Open

11 tasks

ZwwWayne reviewed Nov 21, 2022

View reviewed changes

README.md Outdated Show resolved Hide resolved

ZwwWayne requested a review from hhaAndroid November 21, 2022 02:27

ZwwWayne added this to the 3.0.0rc5 milestone Nov 21, 2022

ZwwWayne assigned hhaAndroid and unassigned Czm369 Nov 21, 2022

This was referenced Nov 24, 2022

[Enhancement] Support LayerScale open-mmlab/mmcv#2451

Merged

[Enhancement] Support value_proj_ratio in MultiScaleDeformableAttention open-mmlab/mmcv#2452

Merged

okotaku changed the title ~~[Feature] Support ViT-Adapter~~ [WIP][Feature] Support ViT-Adapter Nov 24, 2022

add vitadapter

f04e7f6

add beit adapter

62ce44b

okotaku changed the title ~~[WIP][Feature] Support ViT-Adapter~~ [Feature] Support ViT-Adapter Dec 20, 2022

ZwwWayne reviewed Jan 16, 2023

View reviewed changes

ZwwWayne assigned zwhus and unassigned hhaAndroid Jan 16, 2023

okotaku added 2 commits January 17, 2023 09:26

move to projects

dab24b3

Merge branch 'dev-3.x' into vitadapter

114dbbf

move to projects

69f7891

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Support ViT-Adapter #9354

[Feature] Support ViT-Adapter #9354

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Feature] Support ViT-Adapter #9354

Are you sure you want to change the base?

[Feature] Support ViT-Adapter #9354

Conversation

Uh oh!

Motivation

Related PR

Result

Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!