Stars
Real-time webcam demo with SmolVLM and llama.cpp server
[ICLR 2025] CBraMod: A Criss-Cross Brain Foundation Model for EEG Decoding
This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers
🚀 The fast, Pythonic way to build MCP servers and clients
Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.
A Conversational Speech Generation Model
[IEEE J-BHI-2024] A Convolutional Transformer to decode mental states from Electroencephalography (EEG) for Brain-Computer Interfaces (BCI)
EEG Transformer 2.0. i. Convolutional Transformer for EEG Decoding. ii. Novel visualization - Class Activation Topography.
A dataset containing 23,270 time-locked (0.7s) word-level EEG recordings acquired from participants who read both text that was semantically relevant and irrelevant to self-selected topics
[ICLR 2025] NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals
This repository contains the original Python code for "Spatial Distillation-based Distribution Alignment (SDDA) for Cross-Headset EEG Classification". The full code will be open-sourced in the future.
[ACL 2024] Contrastive EEG-Text Masked Autoencoder Learn Transferable representations
toLLMatch🔪: Context-aware LLM-based simultaneous translation
(unofficial) Googletrans: Free and Unlimited Google translate API for Python. Translates totally free of charge.
VietTTS: An Open-Source Vietnamese Text to Speech
zero-shot voice conversion & singing voice conversion, with real-time support
Easily train a good VC model with voice data <= 10 mins!
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…