- shanghai.china
- http://weibo.com/maksim9
Starred repositories
PPTGenius 是一个基于大语言模型的智能幻灯片生成系统。项目名称中的 "Genius" 代表智慧与创造力,突出了项目的技术深度和自动化能力。通过简单的输入,即可快速生成专业的幻灯片内容。
Python based web automation tool. Powerful and elegant.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A simple, cross platform, enterprise desktop software development framework
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
H5 Page Maker, H5 Editor, LowCode. Make H5 as easy as building blocks. | 让H5制作像搭积木一样简单, 轻松搭建H5页面, H5网站, PC端网站,LowCode平台.
Visualize Your Ideas With Code
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
real time face swap and one-click video deepfake with only a single image
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Python APIs for web automation, testing, and bypassing bot-detection.
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web without worrying about infrastructure.
Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Official code for "F5-TTS: A Fairytaler that 4756 Fakes Fluent and Faithful Speech with Flow Matching"
This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audio input.