8000 morninghut (morninghut) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View morninghut's full-sized avatar
🎯
Focusing
🎯
Focusing
  • China

Highlights

  • Pro

Block or report morninghut

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

📚 从零开始的大语言模型原理与实践教程

5,477 384 Updated Jun 28, 2025

An easy way to apply LoRA to CLIP. Implementation of the paper "Low-Rank Few-Shot Adaptation of Vision-Language Models" (CLIP-LoRA) [CVPRW 2024].

Python 218 22 Updated Jun 6, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 48,653 4,040 Updated Jul 2, 2025

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,077 360 Updated Aug 7, 2024

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Python 699 83 Updated Apr 15, 2022

Papers about Explainable AI (Deep Learning-based)

25 2 Updated Jun 28, 2025

GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)

Python 193 31 Updated May 9, 2023

An Arena-style Automated Evaluation Benchmark for Detailed Captioning

Python 50 2 Updated Jun 1, 2025
Python 8 Updated Jan 5, 2025

[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation

Jupyter Notebook 1,628 140 Updated Oct 3, 2024

RIME 词库增强

452 52 Updated Dec 26, 2020

PiliPlus

Dart 3,231 79 Updated Jul 1, 2025

Access large language models from the command-line

Python 8,757 532 Updated Jun 20, 2025

A final sanity checklist to help your CS paper get accepted, not desk rejected.

1,343 127 Updated May 7, 2025

Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)

Python 1,867 129 Updated Jun 7, 2025

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Python 124 3 Updated Jan 15, 2024
JavaScript 7 Updated May 3, 2024

集找番、追番、看番的一站式弹幕追番平台,云收藏同步 (Bangumi),离线缓存,BitTorrent,弹幕云过滤。100% Kotlin/Compose Multiplatform

Kotlin 8,009 196 Updated Jun 29, 2025

Minimal and annotated implementations of key ideas from modern deep learning research.

Python 907 78 Updated Jun 29, 2025

中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型

Jupyter Notebook 1,104 125 Updated Jun 21, 2025

Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives

10 Updated Mar 23, 2025

CLIPScore EMNLP code

Python 226 26 Updated Dec 16, 2022

11 Lessons to Get Started Building AI Agents

Jupyter Notebook 28,539 7,987 Updated Jun 17, 2025

12 Weeks, 24 Lessons, 63E6 AI for All!

Jupyter Notebook 38,356 7,244 Updated Jun 25, 2025

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 830 54 Updated Jun 3, 2025

b站硬核会员考试llm自动答题

Python 10 1 Updated Jan 8, 2025

Brain tumor images classification with ResNet, EfficientNet, EfficientNet_V2 and Compact Convolutional Transformers architectures with PyTorch

Python 10 Updated Jan 5, 2023

Chrome 多窗口管理器是一款Chrome浏览器多窗口管理工具。它可以帮助用户轻松管理多个 Chrome 窗口,实现窗口批量打开、排列以及之间的同步操作,大大提高交互效率。

Python 493 253 Updated May 12, 2025

Uses an llm to generate ffmpeg commands

Shell 482 14 Updated Jan 19, 2025

Training A Small Emotional Vision Language Model for Visual Art Comprehension

Python 16 Updated Jul 26, 2024
Next
0