Jingkang50

🥝

Today's Fruit

Jingkang Yang Jingkang50

🥝

Today's Fruit

MMLab@NTU PhD Student

290 followers · 71 following

Nanyang Technological University
Singapore
jingkang50.github.io
@JingkangY

Achievements

x3 x2

Achievements

x3 x2

Lists (1)

Sort

mmlab

14 repositories

Starred repositories

ardamamur / EgoExOR

Official code of the paper "EgoExOR: An Egocentric–Exocentric Operating Room Dataset for Comprehensive Understanding of Surgical Activities" submitted at NeurIPS 2025 Datasets & Benchmarks Track.

Jupyter Notebook 7 Updated May 15, 2025

EvolvingLMMs-Lab / Aero-1

Python 70 6 Updated May 4, 2025

EvolvingLMMs-Lab / multimodal-search-r1

Python 102 7 Updated Apr 8, 2025

ChocoWu / PSG-4D-LLM

This is the project repo for 'PSG-4D-LLM'.

CSS 9 Updated May 27, 2025

yukangcao / AvatarGO

[ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

Python 62 3 Updated Mar 19, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 46,143 8,063 Updated May 27, 2025

EvolvingLMMs-Lab / EgoLife

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

Python 284 17 Updated Mar 19, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,275 61 Updated Feb 8, 2025

octo-models / octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,247 214 Updated Jul 31, 2024

EvolvingLMMs-Lab / multimodal-sae

Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.

Python 132 6 Updated Jan 24, 2025

snumprlab / realfred

Official Implementation of ReALFRED (ECCV'24)

Python 40 2 Updated Oct 11, 2024

lorjul / fair-psgg

[ECCV 2024 Oral] Code for our paper "A Fair Ranking and New Model for Panoptic Scene Graph Generation"

Python 14 1 Updated May 27, 2025

caizhongang / digital_life_project

Official Code for "Digital Life Project: Autonomous 3D Characters with Social Intelligence"

37 Updated Sep 9, 2024

AtsuMiyai / Awesome-OOD-VLM

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, arXiv2024]

88 3 Updated Jan 30, 2025

DAMO-NLP-SG / VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,165 80 Updated Jan 23, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 15,962 1,703 Updated May 3, 2025

ztyang23 / BACON

Python 17 1 Updated Jul 23, 2024

EvolvingLMMs-Lab / LongVA

Long Context Transfer from Language to Vision

Python 375 18 Updated Mar 18, 2025

arthur-qiu / FreeTraj

Code for FreeTraj, a tuning-free method for trajectory-controllable video generation

Python 104 3 Updated Jul 24, 2024

egeozsoy / ORacle

Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.

Python 22 Updated Jan 6, 2025

mutonix / Vript

Python 146 3 Updated Jan 16, 2025

Edw2n / ImageNet-ES

Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains (CVPR 2024)

Jupyter Notebook 8 1 Updated May 12, 2024

Jingkang50 / PSG4D

4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)

Python 109 3 Updated Mar 13, 2025

fesvhtr / CUVA

[CVPR 2024] Official repository of the paper "Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly"

Python 77 22 Updated Jan 15, 2025

BasedHardware / OpenGlass

Turn any glasses into AI-powered smart glasses

C 3,649 474 Updated Aug 14, 2024

LLaVA-VL / LLaVA-NeXT

Python 3,867 361 Updated May 24, 2025

ziqihuangg / Awesome-Evaluation-of-Visual-Generation

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

299 17 Updated May 27, 2025

jzhang38 / EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 725 49 Updated Sep 27, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 8,025 493 Updated May 18, 2025

microsoft / vscode

Visual Studio Code

TypeScript 172,745 32,703 Updated May 27, 2025

Jingkang Yang Jingkang50

Lists (1)

mmlab

Starred repositories

Terminal

Python

LaTeX