Stars
vits2 backbone with bert
基于Bert-VITS2做的表情、动画测试. Animation testing based on Bert-VITS2.
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
FlowGram is a node-based flow building engine that helps developers quickly create workflows in either fixed layout or free connection layout modes
vits2 backbone with multilingual-bert
An open-source cross-platform alternative to AirDrop
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
🚀🚀🚀一款漂亮易用的在线设计器,支持PSD导入、PSD解析,可用于海报设计器、广告设计器、logo设计器、AI创作图片合成器等。常用于生成二维码海报,图片海报,二维码推广海报,图片处理,名片设计,电商产品图,节假日海报等。http://gzm-design-doc.guozimi.cn/
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
Python tool for converting files and office documents to Markdown.
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
跨平台视频提取工具:支持流媒体下载、视频下载、m3u8 下载及 B站视频下载,提供 Windows 和 Mac 桌面客户端。Cross-platform video extraction tool: Supports streaming download, video download, m3u8 download, and Bilibili video download, with des…
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Install guide of ROCm and Tensorflow on Ubuntu for the RX580
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Quick exploration into fine tuning florence 2
Demo desktop apps built with Python & Qt. With examples for PyQt6, PySide6, PyQt5 & PySide2
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
AirLLM 70B inference with single 4GB GPU
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Large Language Model Text Generation Inference
A Q&A platform software for teams at any scales. Whether it's a community forum, help center, or knowledge management platform, you can always count on Apache Answer.