Lists (6)
Sort Name ascending (A-Z)
Stars
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
🌈 React for interactive command-line apps
View and Interact with PDFs in React, SolidJS, Svelte and JavaScript apps
A simple screen parsing tool towards pure vision based GUI agent
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Build realtime multimodal AI agents with Node.js
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Installation dialog for Progressive Web Application. Provides a more convenient user experience and fixes the lack of native dialogs in some browsers.
first base model for full-duplex conversational audio
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Open-source implementation of MobilePoser: Real-Time Full-Body Pose Estimation and 3D Human Translation from IMUs in Mobile Consumer Devices.
Typescript/React Library for AI Chat💬🚀
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…
Convert any PDF into a podcast episode!
real time face swap and one-click video deepfake with only a single image
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Experts.js is the easiest way to create and deploy OpenAI's Assistants and link them together as Tools to create advanced Multi AI Agent Systems with expanded memory and attention to detail.
Implementation of the GPT architecture in Rust 🦀 + Burn 🔥
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
OpenUI let's you describe UI using your imagination, then see it rendered live.