8000 yfzhang114 (Yi-Fan Zhang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yfzhang114's full-sized avatar

Organizations

@MME-Benchmarks

Block or report yfzhang114

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 28 1 Updated May 28, 2025

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 149 7 Updated May 9, 2025

MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

Python 38 4 Updated Apr 10, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 56,767 12,036 Updated Jun 15, 2025

Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

Jupyter Notebook 30 Updated Mar 28, 2025

This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

Python 71 2 Updated Apr 28, 2025

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 199 19 Updated Feb 23, 2025

The Next Step Forward in Multimodal LLM Alignment

Python 164 5 Updated May 1, 2025

A PyTorch Library for Multi-Task Learning

Python 2,333 221 Updated May 14, 2025

This is a repository dedicated to organizing articles and related works on Unified Multimodal Model for Understanding and Generation.

1 Updated Mar 11, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

3 Updated Nov 5, 2024

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

445 20 Updated Jun 5, 2025

✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 124 8 Updated Mar 4, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,328 173 Updated Mar 28, 2025

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 159 7 Updated Dec 26, 2024

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

569 22 Updated May 8, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 3,273 609 Updated Jan 24, 2025

This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.

Python 78 3 Updated Feb 22, 2025

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

15,298 1,459 Updated Feb 13, 2023

Recent LLM-based CV and related works. Welcome to comment/contribute!

865 36 Updated Mar 8, 2025

This is an official PyTorch implementation of the NeurIPS 2023 paper 《OneNet: Enhancing Time Series Forecasting Models under Concept Drift by Online Ensembling》

Python 117 16 Updated Nov 27, 2024

This is an official PyTorch implementation of the ICML 2023 paper AdaNPC and SIGKDD paper DRM.

Python 85 7 Updated Apr 16, 2024

Contrastive Learning for Domain Adaptation of Time Series

Python 88 13 Updated Apr 11, 2024

This is an official PyTorch implementation of the ICLR 2023 paper 《Free Lunch for Domain Adversarial Training: Environment Label Smoothing》.

Python 59 3 Updated Feb 4, 2023

关于domain generalization,domain adaptation,causality,robutness,prompt,optimization,generative model各式各样研究的阅读笔记

1,215 103 Updated Dec 14, 2023
Python 33 9 Updated Jul 6, 2022

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 2,048 4,094 Updated Jun 17, 2025

This codebase is the official implementation of Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization (NeurIPS2021, Spotlight)

Python 99 10 Updated Dec 2, 2021

DomainBed is a suite to test domain generalization algorithms

Python 1,518 312 Updated Dec 31, 2024

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 13,961 3,841 Updated Feb 18, 2025
0