Stars
My learning notes/codes for ML SYS.
[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).
Curated list of datasets and tools for post-training.
minimal GRPO implementation from scratch
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
repo for paper https://arxiv.org/abs/2504.13837
Awesome RL Reasoning Recipes ("Triple R")
MM-IFEngine: Towards Multimodal Instruction Following
[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.
A Model Context Protocol server for searching and analyzing arXiv papers
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
Official repo of Griffon series including v1(ECCV 2024), v2, and G
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Latest Advances on Long Chain-of-Thought Reasoning
Implementations of few-shot object detection benchmarks
Paper List of Inference/Test Time Scaling/Computing
SpatialLM: Large Language Model for Spatial Understanding
up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources
Collection of papers and repos for multimodal chain-of-thought
An Arena-style Automated Evaluation Benchmark for Detailed Captioning
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives
A powerful tool for creating fine-tuning datasets for LLM
Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths
Fully open data curation for reasoning models