8000 BinZhu-ece (Bin Zhu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View BinZhu-ece's full-sized avatar
  • BeiJing
  • 14:22 (UTC -12:00)

Block or report BinZhu-ece

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 786 56 Updated Apr 17, 2025

可循环值守和多人录制的直播录制软件,支持抖音、TikTok、Youtube、快手、虎牙、斗鱼、B站、小红书、pandatv、sooplive、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、17Live、Twitch、Acfun、CHZZK、shopee等40+平台直播录制

Python 6,662 863 Updated Mar 20, 2025

Record some basic training on the stable diffusion series, including Lora, Controlnet, IP-adapter, and a bit of fun AIGC play!

Python 32 2 Updated Aug 14, 2024

MAGI-1: Autoregressive Video Generation at Scale

Python 2,974 159 Updated May 8, 2025

📌 [Arxiv2025] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"

168 3 Updated Apr 1, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,622 78 Updated Feb 11, 2025
Python 106 5 Updated Feb 28, 2025
Python 3 Updated Apr 2, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,228 2,232 Updated Feb 1, 2025

Wav2Lip UHQ extension for Automatic1111

Python 1,364 181 Updated Jun 14, 2024

High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN

Python 451 96 Updated Mar 27, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,857 2,499 Updated Apr 19, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,705 435 Updated Feb 27, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,602 833 Updated Jul 18, 2024

easy_clash_tool是一个clash的python库,可以很便捷的自动切换可用节点

Python 9 2 Updated Apr 17, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,966 866 Updated Apr 27, 2025

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 683 33 Updated Apr 22, 2025

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,979 75 Updated Apr 13, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,827 479 Updated Mar 22, 2025

[CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

6BCD
Python 137 6 Updated May 11, 2025

Python based web automation tool. Powerful and elegant.

Python 9,700 881 Updated Mar 25, 2025
Python 3 Updated Oct 31, 2024

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,074 63 Updated Feb 7, 2025

[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Python 217 6 Updated Oct 16, 2024

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 53,766 16,893 Updated May 11, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,240 1,493 Updated Apr 29, 2025

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,919 291 Updated Dec 21, 2024
Next
0