[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,895 251 Updated Jan 24, 2024

bytedance / LVLM_Interpretation

The official repo for "Where do Large Vision-Language Models Look at when Answering Questions?"

Python 37 2 Updated May 20, 2025

zhangbaijin / From-Redundancy-to-Relevance

[NAACL 2025 Oral] 🎉 From redundancy to relevance: Enhancing explainability in multimodal large language models

Python 98 8 Updated Feb 13, 2025

thu-ml / MMTrustEval

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

Python 147 10 Updated Mar 26, 2025

hila-chefer / Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-…

Jupyter Notebook 852 110 Updated Aug 24, 2023

seilk / VisAttnSink

[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models

Python 30 2 Updated Feb 16, 2025

stanfordnlp / pyvene

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 750 82 Updated Jun 2, 2025

clemneo / llava-interp

Python 56 3 Updated Nov 5, 2024

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,281 306 Updated Feb 18, 2025

shengliu66 / VTI

Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering

Python 57 4 Updated Nov 23, 2024

LALBJ / PAI

[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs

Python 122 5 Updated Nov 6, 2024

afshinea / stanford-cs-229-machine-learning

VIP cheatsheets for Stanford's CS 229 Machine Learning

18,228 4,024 Updated May 20, 2020

ydyjya / Awesome-LLM-Safety

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

HTML 1,426 72 Updated Jun 8, 2025

ajyl / dpo_toxic

A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.

Jupyter Notebook 72 15 Updated Mar 7, 2025

VITA-MLLM / Woodpecker

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python 636 30 Updated Dec 23, 2024

gyhdog99 / ECSO

ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)

Python 24 1 Updated Nov 2, 2024

NishilBalar / Awesome-LVLM-Hallucination

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

137 7 Updated May 10, 2025

junyangwang0410 / AMBER

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Python 121 2 Updated Jan 15, 2024

vlf-silkie / VLFeedback

Python 99 2 Updated Dec 22, 2023

TideDra / zotero-arxiv-daily

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 1,367 1,232 Updated May 30, 2025

YuxiXie / V-DPO

Preference Learning for LLaVA

Python 46 Updated Nov 9, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,055 693 Updated Jun 12, 2025

ramprs / grad-cam

[ICCV 2017] Torch code for Grad-CAM

Lua 1,564 229 Updated Sep 17, 2022

TideDra / VL-RLHF

A RLHF Infrastructure for Vision-Language Models

Python 176 7 Updated Nov 15, 2024

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 14,523 1,158 Updated Jan 18, 2025

KihoPark / linear_rep_geometry

Jupyter Notebook 95 11 Updated Feb 11, 2025

Ziwei-Zheng / Nullu

Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection

OpenEdge ABL 28 Updated Mar 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chen-Boxu

Block or report Chen-Boxu

Lists (2)

alignment truthfulness

materials

Stars

hiyouga / LLaMA-Factory

open-compass / VLMEvalKit

huofushuo / SID

hila-chefer / Transformer-Explainability