Stars
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
哔哩哔哩-API收集整理【不断更新中....】
一个桌宠形式的mcp client,可以对接任意mcp server,配合测试的mcp server 开源地址:https://github.com/shijianzhong/mcp-server-for-pc
An automated pipeline for evaluating LLMs for role-playing.
An Open-Ended Embodied Agent with Large Language Models
Live2D Library for Python (C++ Wrapper): Supports model loading, lip-sync and basic face rigging, precise click test.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …
Create Minecraft bots with a powerful, stable, and high level JavaScript API.
Master HTML / CSS / JavaScript with Fun Projects
Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.
Retrieval and Retrieval-augmented LLMs
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
Solve Visual Understanding with Reinforced VLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的mcp框架。
🤖 可 DIY 的 多模态 AI 聊天机器人 | 🚀 快速接入 微信、 QQ、Telegram、等聊天平台 | 🦈支持DeepSeek、Grok、Claude、Ollama、Gemini、OpenAI | 工作流系统、网页搜索、AI画图、人设调教、虚拟女仆、语音对话 |
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
kagari-bi / UmaChat
Forked from katboi01/UmaViewerAsset Viewer for Uma Musume
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
A PixiJS plugin to display Live2D models of any kind.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.