-
Qwen2.5-Omni Public
Forked from QwenLM/Qwen2.5-OmniQwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Jupyter Notebook Apache License 2.0 UpdatedMar 29, 2025 -
yoloe Public
Forked from THU-MIG/yoloeYOLOE: Real-Time Seeing Anything
Python GNU Affero General Public License v3.0 UpdatedMar 21, 2025 -
olmocr Public
Forked from allenai/olmocrToolkit for linearizing PDFs for LLM datasets/training
Python Apache License 2.0 UpdatedMar 10, 2025 -
Umi-OCR Public
Forked from hiroi-sora/Umi-OCROCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Python MIT License UpdatedMar 10, 2025 -
OrbbecSDK_ROS2 Public
Forked from orbbec/OrbbecSDK_ROS2OrbbecSDK ROS2 wrapper
C++ Apache License 2.0 UpdatedJan 4, 2025 -
ollama Public
Forked from ollama/ollamaGet up and running with Llama 3.3, Mistral, Gemma 2, and other large language models. 8000
Go MIT License UpdatedJan 1, 2025 -
ollama-python Public
Forked from ollama/ollama-pythonOllama Python library
Python MIT License UpdatedDec 29, 2024 -
DeepSeek-V3 Public
Forked from deepseek-ai/DeepSeek-V3very nice!!!
Python MIT License UpdatedDec 27, 2024 -
3D-Speaker Public
Forked from modelscope/3D-SpeakerA Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Python Apache License 2.0 UpdatedDec 24, 2024 -
install Public
Forked from fishros/install一键安装程序,欢迎大家提交代码和小鱼一起一键安装停止浪费生命
Python UpdatedNov 19, 2024 -
mini-omni Public
Forked from gpt-omni/mini-omniopen-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Python MIT License UpdatedNov 5, 2024 -
segment-anything-2 Public
Forked from facebookresearch/sam2The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Jupyter Notebook Apache License 2.0 UpdatedAug 14, 2024 -
ollama-chatbot-ui Public
Forked from mckaywrigley/chatbot-uiAI chat for any model.
TypeScript MIT License UpdatedAug 3, 2024 -
paho.mqtt.python Public
Forked from eclipse-paho/paho.mqtt.pythonpaho.mqtt.python
Python Other UpdatedAug 1, 2024 -
GitHub520 Public
Forked from 521xueweihan/GitHub520😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
Python UpdatedJul 11, 2024 -
CosyVoice_For_Windows Public
Forked from v3ucn/CosyVoice_For_WindowsCosyVoice在Windows环境下使用的版本
Python Apache License 2.0 UpdatedJul 9, 2024 -
CosyVoice Public
Forked from FunAudioLLM/CosyVoiceMulti-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Python Apache License 2.0 UpdatedJul 7, 2024 -
SenseVoice Public
Forked from FunAudioLLM/SenseVoiceMultilingual Voice Understanding Model
Python MIT License UpdatedJul 5, 2024 -
FunAudioLLM-APP Public
Forked from FunAudioLLM/FunAudioLLM-APPPython MIT License UpdatedJul 5, 2024 -
ffmpeg-python Public
Forked from kkroening/ffmpeg-pythonPython bindings for FFmpeg - with complex filtering support
Python Apache License 2.0 UpdatedJun 26, 2024 -
usb_cam Public
Forked from ros-drivers/usb_camA ROS Driver for V4L2 USB Cameras
C++ Other UpdatedJun 22, 2024 -
insightface Public
Forked from deepinsight/insightfaceState-of-the-art 2D and 3D Face Analysis Project
Python UpdatedJun 10, 2024 -
OpenDevin Public
Forked from All-Hands-AI/OpenHands🐚 OpenDevin: Code Less, Make More
Python MIT License UpdatedJun 10, 2024 -
Qwen2b Public
Forked from QwenLM/Qwen3Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Shell UpdatedJun 7, 2024 -
ragapp Public
Forked from ragapp/ragappThe easiest way to use Agentic RAG in any enterprise
TypeScript Apache License 2.0 UpdatedJun 6, 2024 -
deforum-stable-diffusion Public
Forked from deforum-art/deforum-stable-diffusionPython Other UpdatedMay 25, 2024 -
edge-tts Public
Forked from rany2/edge-ttsUse Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Python GNU General Public License v3.0 UpdatedMay 22, 2024 -
stablediffusion Public
Forked from Stability-AI/stablediffusionHigh-Resolution Image Synthesis with Latent Diffusion Models
Python MIT License UpdatedMay 18, 2024 -
sd-webui-controlnet Public
Forked from Mikubill/sd-webui-controlnetWebUI extension for ControlNet
Python GNU General Public License v3.0 UpdatedMay 18, 2024 -
streamlit-file-browser Public
Forked from pragmatic-streamlit/streamlit-file-browserStreamlit file browser
Python UpdatedApr 23, 2024