8000 jhCOR (JUNG JIHYEOK) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jhCOR's full-sized avatar

Highlights

  • Pro

Block or report jhCOR

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Lightweight PDF Q&A tool powered by RAG (Retrieval-Augmented Generation) with MCP (Model Context Protocol) Support.

Python 10 1 Updated Jun 18, 2025
Python 3,931 371 Updated Jun 13, 2025

Let us control diffusion models!

Python 32,573 2,910 Updated Feb 25, 2024
JavaScript 3,306 1,329 Updated Jun 21, 2024

Sharing scripts and functions for OPUS-PALA article, and LOTUS Software. All functions are usable with agreement from their owner.

MATLAB 74 26 Updated Apr 18, 2024

RF-ULM: Ultrasound Localization Microscopy Learned from Radio-Frequency Wavefronts

Python 28 8 Updated Sep 5, 2024

中国交通警察指挥手势识别 Chinese Traffic Police Gesture Recognizer, pytorch version

Python 98 21 Updated Jan 11, 2022

HAnd Gesture Recognition Image Dataset

Python 790 113 Updated Feb 27, 2025

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,882 381 Updated Mar 14, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,051 794 Updated May 15, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,988 446 Updated Aug 7, 2024
Jupyter Notebook 20 2 Updated Apr 5, 2024

[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering

HTML 1,085 68 Updated Apr 29, 2025

The official Talk2Car dataset repo

Python 82 8 Updated May 29, 2025

[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.

Python 193 4 Updated Nov 1, 2024

This is the official implementation of our publication "Deep learning enables fast and dense single-molecule localization with high accuracy" (Nature Methods)

Python 105 28 Updated Jun 22, 2023

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Python 34 1 Updated Nov 10, 2024

Official repository of NeXt-TDNN for speaker verification

Python 72 7 Updated Oct 10, 2024

An open-sourced end-to-end VLM-based GUI Agent

Python 972 75 Updated Apr 4, 2025

An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.

Jupyter Notebook 1,207 127 Updated Jul 23, 2024

Grok open release

Python 50,291 8,356 Updated Aug 30, 2024

Integer FFT(Fast Fourier Transform) in Python

Python 12 4 Updated Nov 14, 2023

MiniWoB++: a web interaction benchmark for reinforcement learning

HTML 323 53 Updated May 5, 2025

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,670 115 Updated Aug 20, 2024

An Autonomous LLM Agent for Complex Task Solving

Python 8,372 884 Updated Aug 12, 2024

DeepSeek Coder: Let the Code Write Itself

Python 21,735 2,499 Updated May 21, 2024

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,543 408 Updated Jun 19, 2025

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Python 447 34 Updated May 19, 2025
Next
2988
0