AI-in-work
One click templates for inferencing Language Models
PixArt-ฮฑ: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfyโฆ
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Fast and Simple Face Swap Extension Node for ComfyUI
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
๐น A more flexible framework that can generate videos at any resolution and creates videos from images.
๐บ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
Training Large Language Model to Reason in a Continuous Latent Space
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A set of nodes to edit videos using the Hunyuan Video model
Synchronized Translation for Videos. Video dubbing
State-of-the-art Machine Learning for the web. Run ๐ค Transformers directly in your browser, with no need for a server!
Riona Ai Agent ๐ธ is built using Node.js and TypeScript ๐ ๏ธ, designed for seamless job execution ๐ธ. It's lightweight, efficient, and still evolving ๐งโexciting new features coming soon! ๐
File Parser optimised for LLM Ingestion with no loss ๐ง Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl
๐ฅ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Beating the GAIA benchmark with Transformers Agents. ๐
๐ค smolagents: a barebones library for agents that think in code.