8000 xin-li-67 (Xin Li) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xin-li-67's full-sized avatar

Block or report xin-li-67

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

My learning notes/codes for ML SYS.

Python 2,082 126 Updated May 8, 2025

[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Python 47 2 Updated Dec 13, 2024

An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).

Python 56 6 Updated Aug 13, 2024

Curated list of datasets and tools for post-training.

3,017 262 Updated Jan 29, 2025

minimal GRPO implementation from scratch

Python 87 11 Updated Mar 14, 2025

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python 217 11 Updated May 2, 2025

repo for paper https://arxiv.org/abs/2504.13837

117 5 Updated Apr 21, 2025

Awesome RL Reasoning Recipes ("Triple R")

523 31 Updated May 8, 2025

MM-IFEngine: Towards Multimodal Instruction Following

Python 80 Updated Apr 26, 2025

[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 348 6 Updated May 5, 2025

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

823 39 Updated Apr 20, 2025

🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.

Jupyter Notebook 262 22 Updated May 7, 2025

A Model Context Protocol server for searching and analyzing arXiv papers

Python 1,064 60 Updated Apr 22, 2025

Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation

Python 78 1 Updated Nov 13, 2024

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 554 29 Updated Dec 9, 2024

[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'

Python 182 7 Updated Apr 20, 2025

Official repo of Griffon series including v1(ECCV 2024), v2, and G

Python 203 10 Updated Mar 29, 2025

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 221 15 Updated Mar 21, 2025

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

370 10 Updated Apr 25, 2025

Latest Advances on Long Chain-of-Thought Reasoning

286 18 Updated Apr 13, 2025

Implementations of few-shot object detection benchmarks

Python 1,150 225 Updated Nov 21, 2023

Paper List of Inference/Test Time Scaling/Computing

Python 213 6 Updated Apr 29, 2025

SpatialLM: Large Language Model for Spatial Understanding

Python 3,148 240 Updated Mar 28, 2025

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

125 6 Updated Apr 6, 2025

Collection of papers and repos for multimodal chain-of-thought

82 3 Updated Nov 6, 2024

An Arena-style Automated Evaluation Benchmark for Detailed Captioning

Python 31 1 Updated Mar 27, 2025

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

Python 57 3 Updated Apr 2, 2025

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 6,340 666 Updated May 8, 2025

Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths

HTML 472 40 Updated Mar 27, 2025

Fully open data curation for reasoning models

Python 1,750 148 Updated Apr 7, 2025
Next
0