8000 jhcao23 (John Cao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jhcao23's full-sized avatar
💭
Deep Diving into AI coding
💭
Deep Diving into AI coding

Block or report jhcao23

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SkyReels-V2: Infinite-length Film Generative model

Python 2,460 283 Updated May 16, 2025

chat log tool, easily use your own chat data. 聊天记录工具,轻松使用自己的聊天数据

Go 4,518 550 Updated Apr 19, 2025

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Python 4,500 718 Updated Mar 7, 2025

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 24,160 6,253 Updated May 21, 2025

Presentation Slides for Developers

TypeScript 37,993 1,566 Updated May 19, 2025

The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.

Python 28,738 890 Updated May 15, 2025

🚀 Strapi is the leading open-source headless CMS. It’s 100% JavaScript/TypeScript, fully customizable, and developer-first.

TypeScript 66,716 8,564 Updated May 21, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 16,569 1,946 Updated May 21, 2025

A Training-free Iterative Framework for Long Story Visualization

Python 887 125 Updated Jan 18, 2025

Code Implementation of "PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data"

Python 387 26 Updated Apr 23, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 8,819 739 Updated May 19, 2025

An open-sourced end-to-end VLM-based GUI Agent

Python 947 74 Updated Apr 4, 2025

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 14,085 1,163 Updated May 21, 2025

Your AI Operator for Web, Android, Automation & Testing.

TypeScript 8,916 535 Updated May 21, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 12,446 870 Updated May 21, 2025

AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording

TypeScript 14,748 1,076 Updated May 20, 2025

Memory-Guided Diffusion for Expressive Talking Video Generation

Python 862 95 Updated Jan 24, 2025

AI model that understands text & humanoids.

Python 107 31 Updated May 17, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,764 447 Updated Feb 27, 2025

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 1,512 173 Updated May 8, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,167 221 Updated Mar 10, 2025

An LLM-based Web Navigating Agent (KDD'24)

Python 859 76 Updated Sep 27, 2024

微信机器人,可接入DeepSeek、Gemini、ChatGPT、ChatGLM、讯飞星火、Tigerbot等大模型。微信 hook WeChat Robot Hook.

C++ 6,321 1,291 Updated May 4, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 61,014 6,770 Updated May 21, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 53,089 5,085 Updated May 21, 2025

SOTA Open Source TTS

Python 21,163 1,695 Updated Apr 12, 2025

UI Library for Design Engineers. Animated components and effects you can copy and paste into your apps. Free. Open Source.

MDX 16,720 692 Updated May 20, 2025

One UI is all done with chatgpt web, midjourney, gpts,suno,luma,runway,viggle,flux,ideogram,realtime,pika,udio; Simultaneous support Web / PWA / Linux / Win / MacOS platform

JavaScript 6,206 1,551 Updated May 18, 2025
Next
0