8000 jin-s13 (Jas) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jin-s13's full-sized avatar
  • University of Hong Kong
  • Hong Kong

Block or report jin-s13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Python 67 3 Updated May 20, 2025

Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"

Python 110 12 Updated Oct 19, 2024

A python parametric CAD scripting framework based on OCCT

Python 3,809 339 Updated Jun 27, 2025

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Python 112 6 Updated Jun 2, 2025

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 46 2 Updated Jun 10, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,313 277 Updated Jun 28, 2025
Python 118 2 Updated Jun 27, 2025

AIOS: AI Agent Operating System

Python 4,289 533 Updated Jun 11, 2025

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Python 1,921 129 Updated Feb 23, 2024

[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Python 3,434 277 Updated Dec 27, 2024

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 10,607 953 Updated Jun 23, 2025

[CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation

Python 75 6 Updated Jun 6, 2025

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,936 113 Updated Jun 16, 2025

OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.

Python 3,233 586 Updated Jul 14, 2024

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 37,418 3,066 Updated Jun 28, 2025
8000 Python 11 2 Updated Mar 14, 2025

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Python 1,287 85 Updated Jun 29, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 928 52 Updated Mar 20, 2025

[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Python 133 3 Updated May 21, 2025

Vision agent

Python 4,896 547 Updated Jun 30, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,629 531 Updated Feb 26, 2025

YOLOE: Real-Time Seeing Anything [ICCV 2025]

Python 1,377 119 Updated Jun 26, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 59,706 1,698 Updated Jun 30, 2025

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

6,020 390 Updated Jun 29, 2025

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 977 65 Updated Jan 31, 2025

Build resilient language agents as graphs.

Python 14,951 2,558 Updated Jun 30, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 17,232 2,015 Updated Jun 26, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 13,123 1,412 Updated Jun 30, 2025
Python 100 10 Updated Apr 22, 2025

[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Python 282 14 Updated Apr 14, 2025
Next
0