8000 statjuns2 (SeongJun Jeong) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View statjuns2's full-sized avatar
  • Seoul National University

Highlights

  • Pro

Block or report statjuns2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation of ReALFRED (ECCV'24)

Python 40 2 Updated Oct 11, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,740 260 Updated Jan 14, 2025

VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

Python 695 90 Updated Feb 20, 2025

[ICRA 2025] Official implementation of Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs

Python 43 3 Updated May 31, 2025
Jupyter Notebook 12 Updated Oct 18, 2024
Python 1 Updated Apr 22, 2025

A Best-of-list of Robot Simulators, re-generated weekly on Wednesdays

672 44 Updated Jun 2, 2025

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 8,861 945 Updated Feb 5, 2025

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 285 25 Updated Sep 20, 2024

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,147 371 Updated Apr 8, 2024

The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models

Python 15 2 Updated Oct 4, 2024

[Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations

Python 140 7 Updated Jun 22, 2024

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 834 58 Updated May 19, 2025

An open source implementation of CLIP.

Python 11,878 1,110 Updated Jun 3, 2025

[CVPR 2023] CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation

Python 130 7 Updated Oct 29, 2023
Python 28 Updated May 26, 2025

Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" Paper:https://arxiv.org/abs/2310.07968 Video:https://www.youtube.com/watch?v=rN5S8QIhhQc

Python 31 1 Updated Jun 18, 2024

ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings. NeurIPS 2022

Python 75 9 Updated Jan 31, 2023
Python 33 1 Updated Jan 24, 2025

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Jupyter Notebook 4,136 707 Updated Jun 22, 2024

Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

18 1 Updated Dec 10, 2024

Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges

Python 68 2 Updated Feb 27, 2025

A repository accompanying the PARTNR benchmark for using Large Planning Models (LPMs) to solve Human-Robot Collaboration or Robot Instruction Following tasks in the Habitat simulator.

Python 296 36 Updated Apr 17, 2025

Extending the existing benchmark VideoQA datasets

Python 4 Updated Nov 6, 2024

code for downloading videos from HowTo100M dataset

Python 16 2 Updated May 13, 2021

A Datasette instance for searching WebVid-10M

Shell 13 1 Updated Sep 30, 2022

Large-scale text-video dataset. 10 million captioned short videos.

Python 639 40 Updated Aug 14, 2024
Python 144 8 Updated Mar 29, 2025

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Python 3,824 410 Updated Jun 20, 2024
Next
0