8000 2811668688 (Hu Ruofan) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View 2811668688's full-sized avatar
  • Zhe Jiang University
  • China

Block or report 2811668688

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Leveraging passage embeddings for efficient listwise reranking with large language models.

Python 43 2 Updated Dec 7, 2024

ICCV 2023 Paper Global Features are All You Need for Image Retrieval and Reranking Official Repository

Python 229 17 Updated Sep 14, 2023

Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected Training.

Python 78 5 Updated Nov 15, 2024

Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval

Python 50 4 Updated Oct 23, 2024

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 85,789 9,992 Updated May 19, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,898 166 Updated May 28, 2025

Parsing-free RAG supported by VLMs

Python 719 56 Updated Feb 19, 2025

LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever.

Python 518 24 Updated Mar 24, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 1,929 81 Updated May 21, 2025

"MiniRAG: Making RAG Simpler with Small and Free Language Models"

Python 1,121 137 Updated May 12, 2025

s1: Simple test-time scaling

Python 6,414 748 Updated May 19, 2025

R1V, trained with AI feedback, answers open-ended visual questions.

Python 13 1 Updated Apr 12, 2025

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 769 97 Updated May 30, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,723 772 Updated May 15, 2025

Fully open reproduction of DeepSeek-R1

Python 24,619 2,273 Updated May 28, 2025

[ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.

Python 14 Updated Mar 2, 2024

Witness the aha moment of VLM with less than $3.

Python 3,703 285 Updated May 19, 2025

ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities

Python 40 2 Updated Sep 3, 2024
HTML 38 3 Updated Aug 15, 2023

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,315 2,240 Updated Feb 1, 2025

Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

Python 664 67 Updated Sep 19, 2024

Solve Visual Understanding with Reinforced VLMs

Python 5,025 308 Updated May 11, 2025

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 3,819 475 Updated May 28, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,073 86 Updated Apr 3, 2025

Audio Captioning datasets for PyTorch.

Python 117 8 Updated May 26, 2025

This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and continuously update our survey, we maintain this repository of rel…

70 4 Updated Jul 26, 2024

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 134 2 Updated Dec 13, 2024

code for A Large-scale Dataset for Audio-Language Representation Learning

C 13 Updated Sep 18, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 7,856 666 Updated May 30, 2025
Next
BB7
0