-
Nanyang Technological University
- Singapore
- jingkang50.github.io
- @JingkangY
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Official code of the paper "EgoExOR: An Egocentric–Exocentric Operating Room Dataset for Comprehensive Understanding of Surgical Activities" submitted at NeurIPS 2025 Datasets & Benchmarks Track.
[ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
No fortress, purely open ground. OpenManus is Coming.
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
A fork to add multimodal model training to open-r1
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
[ECCV 2024 Oral] Code for our paper "A Fair Ranking and New Model for Panoptic Scene Graph Generation"
Official Code for "Digital Life Project: Autonomous 3D Characters with Social Intelligence"
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, arXiv2024]
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Long Context Transfer from Language to Vision
Code for FreeTraj, a tuning-free method for trajectory-controllable video generation
Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.
Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains (CVPR 2024)
4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)
[CVPR 2024] Official repository of the paper "Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly"
Turn any glasses into AI-powered smart glasses
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…