zyong812

🍎

I may be slow to respond.

Yong Z zyong812

🍎

I may be slow to respond.

CS PhD

35 followers · 188 following

CUHK-SZ
Shenzhen, China
https://zyong812.github.io

Achievements

Lists (1)

Sort

✨ Inspiration

1 repository

Starred repositories

Alibaba-NLP / WebAgent

🌐 WebWalker [ACL2025] & WebDancer [Preprint]

Python 670 48 Updated May 30, 2025

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 16,024 1,649 Updated May 30, 2025

codefuse-ai / Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,541 165 Updated May 29, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 3,974 201 Updated May 5, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 9,413 935 Updated May 16, 2025

microsoft / vscode

Visual Studio Code

TypeScript 172,893 32,788 Updated May 31, 2025

voideditor / void

TypeScript 22,453 1,433 Updated May 30, 2025

langfuse / langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 11,928 1,092 Updated May 30, 2025

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,353 97 Updated Apr 24, 2025

ByteDance-Seed / Seed-Coder

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

473 33 Updated May 15, 2025

abilliyb / Knowledge_Injection_Survey_Papers

36 4 Updated May 10, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,546 478 Updated May 28, 2025

cline / cline

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 44,876 5,410 Updated May 31, 2025

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,335 170 Updated Jul 25, 2023

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 13,474 1,610 Updated May 30, 2025

harishsg993010 / damn-vulnerable-MCP-server

Damn Vulnerable MCP Server

Python 1,000 61 Updated Apr 28, 2025

Wang-Xiaodong1899 / Open-R1-Video

✨First Open-Source R1-like Video-LLM [2025/02/18]

Python 342 12 Updated Feb 23, 2025

Lookuz / VidHal

Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs

Python 12 Updated Apr 19, 2025

Leon1207 / Video-RAG-master

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 191 19 Updated Feb 23, 2025

TencentARC / SEED-Bench-R1

Python 81 1 Updated Apr 5, 2025

mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,369 112 Updated Mar 29, 2025

chao1224 / ProteinDT

A Text-guided Protein Design Framework, Nat Mach Intell 2025 (https://www.nature.com/articles/s42256-025-01011-z)

Python 80 7 Updated Jan 11, 2025

tulerfeng / Video-R1

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 545 26 Updated May 28, 2025

rese1f / aurora

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Python 105 4 Updated Apr 22, 2025

jgm / pandoc

Universal markup converter

Haskell 37,645 3,527 Updated May 29, 2025

executeautomation / mcp-playwright

Playwright Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and More 🔌

TypeScript 3,692 299 Updated May 21, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,049 236 Updated May 28, 2025

keshik6 / HourVideo

[NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding

Jupyter Notebook 149 4 Updated Mar 7, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,828 1,366 Updated May 27, 2025

camel-ai / owl

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 16,742 1,967 Updated May 30, 2025

Yong Z zyong812

Lists (1)

✨ Inspiration

Starred repositories

video-understanding