10000 xxl007 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xxl007's full-sized avatar

Block or report xxl007

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents

Python 347 18 Updated Apr 20, 2025

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,755 194 Updated Jan 16, 2025

[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,572 513 Updated Feb 27, 2025

Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation

Python 227 15 Updated Mar 30, 2025

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python 244 12 Updated May 28, 2025

TPI-LLM: Serving 70b-scale LLMs Efficiently on Low-resource Edge Devices

Python 179 13 Updated May 28, 2025

Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges

Python 68 2 Updated Feb 27, 2025

The model, data and code for the visual GUI Agent SeeClick

HTML 382 19 Updated Nov 22, 2024

ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)

Python 460 46 Updated Nov 25, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,406 709 Updated Jun 5, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,869 780 Updated May 15, 2025

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python 12,495 1,082 Updated Apr 28, 2025
0