zihaolucky

🕶️

Coding and doing experiment

Zihao Zheng zihaolucky

🕶️

Coding and doing experiment

Working on NLP, ASR in Dialog System

286 followers · 84 following

Alibaba Group
Hangzhou, China
http://zihaolucky.github.io

Achievements

Organizations

Stars

joojs / fairface

FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age

446 18 Updated Sep 28, 2023

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 597 80 Updated Jun 9, 2025

pat-jj / DeepRetrieval

DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning

Python 550 72 Updated May 30, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,407 57 Updated Apr 18, 2025

unslothai / unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 40,283 3,193 Updated Jun 9, 2025

Agent-RL / ReCall

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 904 61 Updated May 16, 2025

zhx47 / cursor-api

JavaScript 250 146 Updated Jun 7, 2025

wisdgod / cursor-api

Rust 256 146 Updated Apr 15, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Langua 8000 ge Models for Advanced Multimodal Understanding

Python 4,875 1,751 Updated Feb 26, 2025

THUDM / VisionReward

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 254 6 Updated Mar 26, 2025

KwaiVGI / Koala-36M

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

Python 177 5 Updated Mar 19, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,289 62 Updated Feb 8, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,735 2,288 Updated Jun 2, 2025

TIGER-AI-Lab / VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]

Python 243 17 Updated Jun 4, 2025

PKU-YuanGroup / LLaVA-CoT

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 2,002 76 Updated May 13, 2025

DallasBuyer / awesome-dynamic-pricing

😎 awesome dynamic pricing 💵

102 12 Updated Oct 27, 2022

llava-rlhf / LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Python 366 27 Updated Nov 1, 2023

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,292 628 Updated May 29, 2025

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,361 154 Updated Mar 3, 2025

discus0434 / pdf-translator

pdf-translator translates English PDF files into Japanese, preserving the original layout.

Python 322 42 Updated May 7, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 26,635 2,581 Updated Apr 30, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 14,992 1,961 Updated Jun 9, 2025

Unbabel / COMET

A Neural Framework for MT Evaluation

Python 609 92 Updated May 21, 2025

punica-ai / punica

Serving multiple LoRA finetuned LLM as one

Python 1,063 52 Updated May 8, 2024

pymc-labs / CausalPy

A Python package for causal inference in quasi-experimental settings

Python 1,002 75 Updated Jun 9, 2025

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 35,248 5,910 Updated Mar 25, 2025

DayuanJiang / giant_ja-en_parallel_corpus

This directory includes a giant Japanese-English subtitle corpus. The raw data comes from the Stanford’s JESC project.

Python 5 Updated Aug 4, 2019

sunzeyeah / item-alignment

ccks2022 task9 subtask2 商品同款识别

Jupyter Notebook 43 13 Updated Feb 9, 2023

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,746 2,505 Updated Aug 12, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,197 1,196 Updated Jun 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zihao Zheng zihaolucky

Achievements

Achievements

Organizations

Block or report zihaolucky

Stars

joojs / fairface

allenai / reward-bench

pat-jj / DeepRetrieval

policy-gradient / GRPO-Zero

unslothai / unsloth

Agent-RL / ReCall

zhx47 / cursor-api

wisdgod / cursor-api

deepseek-ai / DeepSeek-VL2

THUDM / VisionReward

KwaiVGI / Koala-36M

EvolvingLMMs-Lab / open-r1-multimodal

huggingface / open-r1

TIGER-AI-Lab / VLM2Vec

PKU-YuanGroup / LLaVA-CoT

DallasBuyer / awesome-dynamic-pricing

llava-rlhf / LLaVA-RLHF

OpenGVLab / InternVL

THUDM / CogVLM2

discus0434 / pdf-translator

hpcaitech / Open-Sora

sgl-project / sglang

Unbabel / COMET

punica-ai / punica

pymc-labs / CausalPy

chatchat-space / Langchain-Chatchat

DayuanJiang / giant_ja-en_parallel_corpus

sunzeyeah / item-alignment

haotian-liu / LLaVA

huggingface / text-generation-inference