8000 jhCOR (JUNG JIHYEOK) / Starred · GitHub

More Web Proxy on the site http://driver.im/

jhCOR

Follow

JUNG JIHYEOK jhCOR

Follow

I'm 4th grade of Sogang University. (Major: Electronic Engineering)

9 followers · 19 following

Achievements

Achievements

Highlights

Pro

Lists (5)

Sort

[ALL] Remarkable research

[Audio model Implementation]

16 repositories

[Framework/Toolkit]

Gesture

Video Generation

Stars

MIMICLab / DocsRay

Lightweight PDF Q&A tool powered by RAG (Retrieval-Augmented Generation) with MCP (Model Context Protocol) Support.

Python 10 1 Updated Jun 18, 2025

LLaVA-VL / LLaVA-NeXT

Python 3,931 371 Updated Jun 13, 2025

lllyasviel / ControlNet

Let us control diffusion models!

Python 32,573 2,910 Updated Feb 25, 2024

nerfies / nerfies.github.io

JavaScript 3,306 1,329 Updated Jun 21, 2024

AChavignon / PALA

Sharing scripts and functions for OPUS-PALA article, and LOTUS Software. All functions are usable with agreement from their owner.

MATLAB 74 26 Updated Apr 18, 2024

hahnec / rf-ulm

RF-ULM: Ultrasound Localization Microscopy Learned from Radio-Frequency Wavefronts

Python 28 8 Updated Sep 5, 2024

zc402 / ctpgr-pytorch

中国交通警察指挥手势识别 Chinese Traffic Police Gesture Recognizer, pytorch version

Python 98 21 Updated Jan 11, 2022

hukenovs / hagrid

HAnd Gesture Recognition Image Dataset

Python 790 113 Updated Feb 27, 2025

OpenGVLab / LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,882 381 Updated Mar 14, 2024

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,051 794 Updated May 15, 2025

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,988 446 Updated Aug 7, 2024

illyrs2 / LOCA-ULM

Jupyter Notebook 20 2 Updated Apr 5, 2024

OpenDriveLab / DriveLM

[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering

HTML 1,085 68 Updated Apr 29, 2025

talk2car / Talk2Car

The official Talk2Car dataset repo

Python 82 8 Updated May 29, 2025

qiantianwen / NuScenes-QA

[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.

Python 193 4 Updated Nov 1, 2024

TuragaLab / DECODE

This is the official implementation of our publication "Deep learning enables fast and dense single-molecule localization with high accuracy" (Nature Methods)

Python 105 28 Updated Jun 22, 2023

mu-cai / TemporalBench

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Python 34 1 Updated Nov 10, 2024

dmlguq456 / NeXt_TDNN_ASV

Official repository of NeXt-TDNN for speaker verification

Python 72 7 Updated Oct 10, 2024

THUDM / CogAgent

An open-sourced end-to-end VLM-based GUI Agent

Python 972 75 Updated Apr 4, 2025

dscripka / openWakeWord

An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.

Jupyter Notebook 1,207 127 Updated Jul 23, 2024

xai-org / grok-1

Grok open release

Python 50,291 8,356 Updated Aug 30, 2024

fukuroder / intfft

Integer FFT(Fast Fourier Transform) in Python

Python 12 4 Updated Nov 14, 2023

Farama-Foundation / miniwob-plusplus

MiniWoB++: a web interaction benchmark for reinforcement learning

HTML 323 53 Updated May 5, 2025

hymie122 / RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,670 115 Updated Aug 20, 2024

OpenBMB / XAgent

An Autonomous LLM Agent for Complex Task Solving

Python 8,372 884 Updated Aug 12, 2024

deepseek-ai / DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Python 21,735 2,499 Updated May 21, 2024

deepseek-ai / DeepSeek-V3

Python 97,679 15,888 Updated Jun 16, 2025

deepseek-ai / DeepSeek-R1

90,158 11,644 Updated Apr 9, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,543 408 Updated Jun 19, 2025

MMMU-Benchmark / MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Python 447 34 Updated May 19, 2025

2988

0