-
XTU
-
05:39
(UTC -12:00) - https://xiaolingdudu.github.io/
Lists (2)
Sort Name ascending (A-Z)
Starred repositories
A Collection of Papers and Codes in CVPR2023/2022 about low level vision
A Collection of Papers and Codes in ECCV2022 about low level vision
[ECCV‘24] Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
🔥 [ICLR 2025] FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Official implementation of Scalable Transformer for PDE surrogate modelling
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Source codes for the paper "Deep Learning Algorithms for Rotating Machinery Intelligent Diagnosis: An Open Source Benchmark Study"
An official code for paper: TFPred: Learning discriminative representations from unlabeled data for few-label rotating machinery fault diagnosis
A transfer learning fault diagnosis repository covering popular algorithms
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
[NeurIPS'23] Emergent Correspondence from Image Diffusion
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
Official repo for FaceShot: Bring Any Character into Life
A project that can generate ancient poems based on pictures, including CLIP, T5, GPT2 models
ICML 2025: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence
StyTr2 : Image Style Transfer with Transformers
Neural Style and MSG-Net