8000 Chen-Boxu / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Chen-Boxu's full-sized avatar

Block or report Chen-Boxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,123 6,296 Updated Jun 12, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,507 390 Updated Jun 12, 2025

https://arxiv.org/abs/2408.02032

Python 108 6 Updated Jan 16, 2025

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,895 251 Updated Jan 24, 2024

The official repo for "Where do Large Vision-Language Models Look at when Answering Questions?"

Python 37 2 Updated May 20, 2025

[NAACL 2025 Oral] 🎉 From redundancy to relevance: Enhancing explainability in multimodal large language models

Python 98 8 Updated Feb 13, 2025

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

Python 147 10 Updated Mar 26, 2025

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-…

Jupyter Notebook 852 110 Updated Aug 24, 2023

[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models

Python 30 2 Updated Feb 16, 2025

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 750 82 Updated Jun 2, 2025
Python 56 3 Updated Nov 5, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,281 306 Updated Feb 18, 2025

Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering

Python 57 4 Updated Nov 23, 2024

[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs

Python 122 5 Updated Nov 6, 2024

VIP cheatsheets for Stanford's CS 229 Machine Learning

18,228 4,024 Updated May 20, 2020

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

HTML 1,426 72 Updated Jun 8, 2025

A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.

Jupyter Notebook 72 15 Updated Mar 7, 2025

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python 636 30 Updated Dec 23, 2024

ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)

Python 24 1 Updated Nov 2, 2024

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

137 7 Updated May 10, 2025

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Python 121 2 Updated Jan 15, 2024
Python 99 2 Updated Dec 22, 2023

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 1,367 1,232 Updated May 30, 2025

Preference Learning for LLaVA

Python 46 Updated Nov 9, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,055 693 Updated Jun 12, 2025

[ICCV 2017] Torch code for Grad-CAM

Lua 1,564 229 Updated Sep 17, 2022

A RLHF Infrastructure for Vision-Language Models

Python 176 7 Updated Nov 15, 2024

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 14,523 1,158 Updated Jan 18, 2025
Jupyter Notebook 95 11 Updated Feb 11, 2025

Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection

OpenEdge ABL 28 Updated Mar 13, 2025
Next
0