-
Toyota Technological Institute
- Nagoya city, Japan
-
15:18
(UTC +09:00) - https://iminthemiddle.github.io/
- @ImlntheMiddle
- in/ImIntheMiddle
- https://scholar.google.co.jp/citations?user=HkFAVbcAAAAJ
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Code accompanying the BABEL dataset (CVPR 2021).
[CVPR 2025 Oral] TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
Official implementation of the paper "Following Is All You Need: Robot Crowd Navigation Using People As Planners"
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos
[ACM MM 2023] Lightweight Super-Resolution Head for Human Pose Estimation
Flops counter for neural networks in pytorch framework
Lightweight coding agent that runs in your terminal
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase
This repo is the official implementation of "MART: MultiscAle Relational Transformer Networks for Trajectory Prediction", ECCV 2024.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
High-resolution models for human tasks.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
A simple training-free approach adapting DUSt3R for dynamic scenes.
NeurIPS 2024 | 🏃♂️ SMPL Visual Annotation Tool
[CVPR 2025] Multi-modal Knowledge Distillation-based Human Trajectory Forecasting
NeurIPS 2024 | 🤸♂️💥🚗Pedestrian-Centric 3D Pre-collision Pose and Shape Estimation from Dashcam Perspective
Self-Critical Sequence Training (SCST) and various multi-head attention mechanisms.