8000 LimGeunTaekk (Lim Geun Taek) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View LimGeunTaekk's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report LimGeunTaekk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3,839 359 Updated May 6, 2025

[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding

Python 62 4 Updated Apr 22, 2025

[ACL 2025 Main] EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 267 20 Updated May 16, 2025
Python 16 1 Updated May 12, 2025

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Python 2,181 310 Updated May 16, 2025

Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation

27 1 Updated Apr 28, 2025

VELOCITI Benchmark Evaluation and Visualisation Code

Python 6 Updated Apr 18, 2025

Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

Python 777 42 Updated Apr 27, 2025

ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"

Python 87 4 Updated May 16, 2025

Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).

Python 125 8 Updated Apr 29, 2025

[CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"

Python 423 71 Updated Apr 9, 2025

[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Python 75 3 Updated Apr 12, 2025

A survey on MM-LLMs for long video understanding: From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding

8 Updated Jan 3, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,079 813 Updated Aug 12, 2024
108 1 Updated Mar 14, 2024

Code for "Predicate Hierarchies Improve Few-Shot State Classification" , ICLR 2025

Python 4 Updated Mar 24, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,143 618 Updated Apr 27, 2025

[ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Python 33 1 Updated Apr 7, 2025

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Python 367 34 Updated May 8, 2025

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 580 32 Updated Oct 6, 2024
Python 44 1 Updated Apr 5, 2025

Easily create large video dataset from video urls

Python 610 70 Updated Jul 30, 2024

[CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"

Python 8 Updated Apr 25, 2025
Python 5 2 Updated Oct 15, 2021

Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"

Python 11 Updated Feb 14, 2025
Python 333 11 Updated Jan 27, 2024

Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"

Python 60 8 Updated Apr 29, 2025

Awesome papers & datasets specifically focused on long-term videos.

275 12 Updated Nov 15, 2024

High Quality Video Reasoning Segmentation

Python 23 2 Updated May 6, 2025
Next
0