HangyuRan

ricardo HangyuRan

2 followers · 19 following

Stars

MME-Benchmarks / Video-MME

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

576 23 Updated May 8, 2025

llyx97 / TempCompass

[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Python 117 3 Updated Apr 4, 2025

google-deepmind / perception_test

Jupyter Notebook 214 15 Updated Jun 4, 2025

WXRIW / Lyricify-App

Lyricify (/lɪ'rɪsəfaɪ/), a fantastic app to provide scroll lyrics for Spotify and other apps. 一款为 Spotify 等各种应用提供滚动歌词的软件。

5,661 105 Updated Jun 20, 2025

yuweihao / MM-Vet

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)

Python 304 11 Updated Jan 20, 2025

ultralytics / ultralytics

Ultralytics YOLO11 🚀

Python 42,345 8,255 Updated Jun 25, 2025

luca-medeiros / lang-segment-anything

SAM with text prompt

Python 2,252 260 Updated May 10, 2025

NJU-SE-15-share-review / postgraduate-recommendation

南大软院保研攻略

Python 187 32 Updated May 26, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,627 528 Updated Feb 26, 2025

Paper2Poster / Paper2Poster

Open-source Multi-agent Poster Generation from Papers

Python 2,187 118 Updated Jun 17, 2025

yuezih / King-of-Pigeon

计算机保研简历与文书实用模板

2,005 88 Updated Jun 4, 2024

potamides / DeTikZify

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ

Python 1,436 73 Updated Jun 7, 2025

kingnobro / IconShop

(Siggraph Asia 2023) Code of "IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers"

Python 91 15 Updated Jan 19, 2025

ximinng / LLM4SVG

[CVPR 2025] Official implementation for "Empowering LLMs to Understand and Generate Complex Vector Graphics" https://arxiv.org/abs/2412.11102

Python 529 5 Updated May 22, 2025

kingnobro / Chat2SVG

(CVPR 2025) Code of "Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models"

Python 158 13 Updated Apr 2, 2025

ximinng / SVGDreamer

[CVPR 2024] Official implementation for "SVGDreamer: Text Guided SVG Generation with Diffusion Model" https://arxiv.org/abs/2312.16476

Python 380 39 Updated May 1, 2025

showlab / LayerTracer

Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"

Python 53 3 Updated Apr 1, 2025

bytedance / UNO

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,133 68 Updated Apr 17, 2025

UX-Decoder / LLaVA-Grounding

Python 403 15 Updated Jul 29, 2024

cloudcommunity / Free-Certifications

A curated list of free courses with certifications. Also available at https://free-certifications.com/

33,774 2,404 Updated Feb 13, 2025

LayTextLLM / LayTextLLM

Jupyter Notebook 94 11 Updated Dec 23, 2024

CSU-JPG / TextAtlas

A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation

Python 70 Updated Feb 22, 2025

OmniSVG / OmniSVG

OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to in…

Python 1,797 52 Updated May 26, 2025

sxhthreo / QUIVERIF

Solidity 2 Updated Apr 27, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,070 496 Updated May 28, 2025

joanrod / star-vector

StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…

Python 3,918 206 Updated Apr 15, 2025

CSU-JPG / Awesome-VLM-Reasoning

15 Updated May 19, 2025

CSU-JPG / V-MAGE

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in MLLMs

Python 19 Updated May 19, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,273 711 Updated Jun 25, 2025

CyberAgentAILab / cr-renderer

Renderer for the Crello dataset

Python 9 Updated Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly