GukehAn

Zhichao Chen GukehAn

6 followers · 64 following

Stars

LMMMEng / OverLoCK

[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels

Python 197 20 Updated May 15, 2025

LMMMEng / TransXNet

[TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition

Python 195 16 Updated Apr 22, 2025

raoyongming / HorNet

[NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions

Python 335 37 Updated Dec 21, 2023

naver-ai / rdnet

[ECCV2024] Official implementation of paper, "DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs".

Python 148 7 Updated Aug 8, 2024

qhfan / FAT

[NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction

Python 24 3 Updated Oct 27, 2023

qhfan / CloFormer

The official code of "Rethinking Local Perception in Lightweight Vision Transformer"

Python 87 7 Updated May 11, 2023

rayleizhu / BiFormer

[CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"

Python 544 41 Updated May 22, 2023

hhb072 / STViT

Python 140 6 Updated Jun 25, 2024

sangnekim / SMPConv

[CVPR2023] "SMPConv: Self-moving Point Representations for Continuous Convolution"

HTML 57 2 Updated May 29, 2023

microsoft / FocalNet

[NeurIPS 2022] Official code for "Focal Modulation Networks"

Python 732 63 Updated Nov 7, 2023

baofff / U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 1,006 72 Updated Mar 25, 2023

jianlong-yuan / UniNeXt

Jupyter Notebook 44 7 Updated Mar 15, 2023

mmaaz60 / EdgeNeXt

[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications".

Python 375 42 Updated Jul 25, 2023

ViTAE-Transformer / ViTAE-VSA

The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"

Python 158 9 Updated Mar 17, 2023

huawei-noah / Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

Python 4,218 719 Updated Mar 15, 2025

whai362 / PVT

Official implementation of PVT series

Python 1,804 251 Updated Oct 27, 2022

microsoft / Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 14,772 2,136 Updated Jul 24, 2024

qhfan / RALA

[CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention

Python 20 Updated Mar 11, 2025

maryam089 / SDViT

Official repository for "Self-Distilled Vision Transformer for Domain Generalization" (ACCV-2022 ORAL)

Python 39 7 Updated Dec 2, 2022

maclong01 / DeBiFormer

[ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"

Python 29 2 Updated Jan 8, 2025

NVlabs / GCVit

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

Python 435 50 Updated Dec 22, 2023

AFeng-x / SMT

[ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".

Python 204 15 Updated Aug 1, 2023

qhfan / RMT

(CVPR2024)RMT: Retentive Networks Meet Vision Transformer

Python 349 25 Updated Jul 29, 2024

Meshcapade / difflocks

Code for our CVPR'25 paper - "DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models"

C++ 45 1 Updated May 12, 2025

modelscope / Nexus-Gen

Python 176 10 Updated May 14, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,256 2,230 Updated Feb 1, 2025

csyxwei / Awesome-Personalized-Image-Generation

A collection of resources on personalized image generation.

142 5 Updated Apr 19, 2025

synth-forge / synthforge-generate

Dataset generation scripts for SynthForge dataset

Python 4 Updated Jun 13, 2024

synth-forge / synthforge-train

Code for SynthForge: Synthesizing High-Quality Face Dataset with Controllable 3D Generative Models

Jupyter Notebook 1 Updated Jun 13, 2024

wolo-wolo / FSFM

FSvFM: A Generalizable Face Security vision Foundation Model via Self-Supervised Facial Representation Learning (CVPR25)

Python 63 7 Updated Mar 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly