10000 Jingkang50 (Jingkang Yang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Jingkang50's full-sized avatar
🥝
Today's Fruit
🥝
Today's Fruit

Block or report Jingkang50

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official code of the paper "EgoExOR: An Egocentric–Exocentric Operating Room Dataset for Comprehensive Understanding of Surgical Activities" submitted at NeurIPS 2025 Datasets & Benchmarks Track.

Jupyter Notebook 7 Updated May 15, 2025
Python 70 6 Updated May 4, 2025

This is the project repo for 'PSG-4D-LLM'.

CSS 9 Updated May 27, 2025

[ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

Python 62 3 Updated Mar 19, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 46,143 8,063 Updated May 27, 2025

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

Python 284 17 Updated Mar 19, 2025

A fork to add multimodal model training to open-r1

Python 1,275 61 Updated Feb 8, 2025

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,247 214 Updated Jul 31, 2024

Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.

Python 132 6 Updated Jan 24, 2025

Official Implementation of ReALFRED (ECCV'24)

Python 40 2 Updated Oct 11, 2024

[ECCV 2024 Oral] Code for our paper "A Fair Ranking and New Model for Panoptic Scene Graph Generation"

Python 14 1 Updated May 27, 2025

Official Code for "Digital Life Project: Autonomous 3D Characters with Social Intelligence"

37 Updated Sep 9, 2024

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, arXiv2024]

88 3 Updated Jan 30, 2025

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,165 80 Updated Jan 23, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 15,962 1,703 Updated May 3, 2025
Python 17 1 Updated Jul 23, 2024

Long Context Transfer from Language to Vision

Python 375 18 Updated Mar 18, 2025

Code for FreeTraj, a tuning-free method for trajectory-controllable video generation

Python 104 3 Updated Jul 24, 2024

Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.

Python 22 Updated Jan 6, 2025
Python 146 3 Updated Jan 16, 2025

Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains (CVPR 2024)

Jupyter Notebook 8 1 Updated May 12, 2024

4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)

Python 109 3 Updated Mar 13, 2025

[CVPR 2024] Official repository of the paper "Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly"

Python 77 22 Updated Jan 15, 2025

Turn any glasses into AI-powered smart glasses

C 3,649 474 Updated Aug 14, 2024
Python 3,867 361 Updated May 24, 2025

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

299 17 Updated May 27, 2025

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 725 49 Updated Sep 27, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 8,025 493 Updated May 18, 2025

Visual Studio Code

TypeScript 172,745 32,703 Updated May 27, 2025
Next
0