8000 j-min (Jaemin Cho) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View j-min's full-sized avatar

Highlights

  • Pro

Organizations

@PyTorchKR @PyTorchKorea

Block or report j-min

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source unified multimodal model

Python 3,731 261 Updated May 30, 2025

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.

54,947 16,827 Updated May 31, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,776 775 Updated May 15, 2025
Python 38 4 Updated May 19, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 25,210 2,262 Updated Jun 3, 2025

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 601 36 Updated Oct 16, 2024

PhD Dissertation Template for UNC Computer Science

TeX 5 2 Updated Feb 7, 2023

Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"

Python 95 12 Updated Jan 21, 2024

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,477 381 Updated Jun 4, 2025

Jupyter notebook server extension to proxy web services.

Python 367 151 Updated Jun 2, 2025

PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models

Python 22 2 Updated Jul 22, 2024

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,482 235 Updated May 17, 2025

A reading list of video generation

579 37 Updated Jun 3, 2025

LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR team.

JavaScript 438 37 Updated Feb 11, 2025
Python 3,882 250 Updated Mar 15, 2024

Official implementation of SEED-LLaMA (ICLR 2024).

Python 612 33 Updated Sep 21, 2024

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR 2025 Oral)

Python 440 14 Updated Feb 11, 2025

4M: Massively Multimodal Masked Modeling

Python 1,727 105 Updated Jun 2, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 8,114 498 Updated May 18, 2025

Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts (CVPR 2024)

Python 84 12 Updated Sep 28, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,284 282 Updated May 4, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,549 1,165 Updated Jun 2, 2025

Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)

Python 31 2 Updated Jul 13, 2024

Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data

Python 34 1 Updated Mar 12, 2024

Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)

Python 167 12 Updated May 3, 2024

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Python 226 12 Updated Apr 3, 2024

Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models

131 7 Updated Feb 18, 2025

[CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"

Python 63 10 Updated May 1, 2024
Next
0