8000 hedes1992 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View hedes1992's full-sized avatar

Block or report hedes1992

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Solve Visual Understanding with Reinforced VLMs

Python 4,977 308 Updated May 11, 2025

R1V, trained with AI feedback, answers open-ended visual questions.

Python 13 1 Updated Apr 12, 2025

Puzzles for learning Triton, play it with minimal environment configuration!

Python 321 36 Updated Dec 3, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,186 92 Updated May 22, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,349 57 Updated May 13, 2025

A Massively Parallel Large Scale Self-Play Framework

Python 349 35 Updated Jan 9, 2023

😎 Awesome papers on token redundancy reduction

7 Updated Mar 12, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,396 6,010 Updated May 21, 2025

Efficient Triton Kernels for LLM Training

Python 5,043 327 Updated May 22, 2025

💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

Python 199 19 Updated Apr 26, 2025

Repo for the Deep Reinforcement Learning Nanodegree program

Jupyter Notebook 5,048 2,374 Updated Nov 16, 2023

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

860 40 Updated Apr 20, 2025

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Python 368 35 Updated May 8, 2025

DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models

Python 47 Updated Apr 10, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,522 1,752 Updated Dec 25, 2024
Python 80 1 Updated Apr 5, 2025

A paper list of some recent works about Token Compress for Vit and VLM

476 22 Updated May 21, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 14,546 2,887 Updated May 22, 2025

Muon is Scalable for LLM Training

1,048 48 Updated Mar 28, 2025

Awesome papers & datasets specifically focused on long-term videos.

275 12 Updated Nov 15, 2024

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 2,985 228 Updated May 19, 2025

VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models

Python 19 Updated Mar 26, 2025

[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention

Python 484 73 Updated Feb 28, 2024

MLLM-DataEngine: An Iterative Refinement Approach for MLLM

Python 46 5 Updated May 24, 2024

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python 276 12 Updated Jun 13, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,326 1,016 Updated May 22, 2025

AllenAI's post-training codebase

Python 2,973 387 Updated May 22, 2025

SOTA Re-identification Methods and Toolbox

Python 3,620 852 Updated Jul 30, 2024

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,416 465 Updated May 17, 2025
Next
0