10000 shengxingdong (sindo) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View shengxingdong's full-sized avatar

Block or report shengxingdong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR 2025 Highlight] Official Code Release Volumetrically Consistent 3D Gaussian Rasterization

Python 40 2 Updated Apr 17, 2025

Ultralytics implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 5 Updated May 11, 2025

Share a single keyboard and mouse between multiple computers.

C++ 17,509 4,018 Updated May 10, 2025

如果想体验小智项目,或者开发server端测试的同志,可以使用这个web端damo 体验下。 语音端已经完成,文字端完成,可以语音加文字输出。 等迭代慢慢完善。欢迎PR

HTML 104 43 Updated Mar 30, 2025

A PyTorch implementation of the paper "EDGS: Eliminating Densification for Efficient Convergence of 3DGS"

Jupyter Notebook 412 24 Updated Apr 23, 2025

一个基于小智、xiaozhi-server的Android、IOS语音对话应用,支持实时语音交互和文字对话。现在是flutter版本,打通IOS、Android端。请同志们动动小手,点点小星星,予以鼓励。

Dart 603 154 Updated May 6, 2025

python版本的小智ai,主要帮助那些没有硬件却想体验小智功能的人,如果可以请点个小星星!

Python 1,332 269 Updated May 10, 2025

本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.

Python 4,286 1,515 Updated May 10, 2025

The python library for real-time communication

JavaScript 3,852 331 Updated Apr 23, 2025

[CVPR 2025 Oral] VGGT: Visual Geometry Grounded Transformer

Python 6,464 637 Updated May 9, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 31,576 3,546 Updated May 5, 2025

Ultimate camera streaming application with support RTSP, RTMP, HTTP-FLV, WebRTC, MSE, HLS, MP4, MJPEG, HomeKit, FFmpeg, etc.

Go 8,839 649 Updated May 2, 2025

Build your own AI friend

C++ 12,875 2,504 Updated May 10, 2025

The official release of paper "SeeLe: A Unified Acceleration Framework for Real-Time Gaussian Splatting"

Python 36 3 Updated May 10, 2025

Trae official

532 18 Updated Apr 17, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 11,782 1,662 Updated May 5, 2025

Spark-TTS Inference Code

Python 9,212 961 Updated Apr 9, 2025

Agent Framework / shim to use Pydantic with LLMs

Python 9,326 823 Updated May 10, 2025

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Python 4,297 738 Updated May 6, 2025
Python 83 2 Updated Mar 22, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 24,966 2,217 Updated May 8, 2025
Python 331 34 Updated Apr 9, 2025

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,556 204 Updated May 5, 2025

🤗 smolagents: a barebones library for agents that think in python code.

Python 18,385 1,594 Updated May 9, 2025

3D Reconstruction for all

Rust 1,701 72 Updated May 9, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,387 1,401 Updated Mar 3, 2025

tiny vision language model

Python 7,928 616 Updated Apr 14, 2025

This repository collects papers on VLLM applications. We will update new papers irregularly.

119 12 Updated Apr 25, 2025

SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving

Cuda 161 9 Updated Apr 2, 2025
Next
0