8000 dvdface (DingYi) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View dvdface's full-sized avatar
  • Wuhan, Hubei, CHINA

Block or report dvdface

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Python 328 19 Updated Mar 7, 2025

👓 A curated list of awesome android learning resources for android app developers.

Kotlin 1,838 268 Updated May 11, 2024

Google Research

Jupyter Notebook 35,923 8,133 Updated Jun 30, 2025

工位区域员工在岗检测员工行为监测图像分割系统源码和数据集:改进yolo11-ODConv

Python 2 Updated Nov 18, 2024

Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations between selected general UI elements and their text labels. A…

28 3 Updated Jun 27, 2024

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

Jupyter Notebook 841 112 Updated Apr 3, 2025

Offical implementation of "Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Python 33 1 Updated Jul 3, 2025

GUI Grounding for Professional High-Resolution Computer Use

Python 225 25 Updated Jul 3, 2025

gaze tracking software

Python 377 37 Updated Jul 1, 2025
C++ 9 1 Updated May 5, 2019

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,570 1,896 Updated Mar 26, 2025

It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item Selection (VIS) data. Both datasets are written TFRecords.

44 4 Updated Aug 2, 2021

This repository contains all the code examples, projects, and resources used in "The Complete Hugging Face Blueprint" book. The book provides a comprehensive guide to using Hugging Face's ecosystem…

Python 3 Updated Apr 5, 2025
Python 527 45 Updated Jul 26, 2024

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 38,111 3,132 Updated Jul 5, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 2,392 258 Updated May 26, 2025

Prompt Declaration Language (PDL) is a declarative prompt programming language.

Python 173 34 Updated Jul 3, 2025
Jupyter Notebook 964 112 Updated Jul 2, 2025

MS-Agent: Lightweight Framework for Empowering Agents with Autonomous Exploration

Python 3,246 373 Updated Jun 30, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,377 847 Updated Aug 12, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 13 Updated May 5, 2025

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize. AI Town中文版

TypeScript 16 Updated Jun 19, 2024

用基于策略梯度得强化学习方法训练AI玩王者荣耀

Python 1,729 401 Updated Nov 16, 2021

用Resnet101+GPT搭建一个玩王者荣耀的AI

Python 2,895 752 Updated Aug 8, 2021

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python 22,201 2,924 Updated Jul 4, 2025

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 16,885 2,334 Updated Jul 4, 2025

Generative Agents: Interactive Simulacra of Human Behavior

19,240 2,584 Updated Aug 5, 2024

Agent that empowers software testing with LLMs; industrial-first in China

Python 619 71 Updated Mar 4, 2024

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!

Python 405 34 Updated Jun 30, 2025
Next
0