8000 jason0000100007 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jason0000100007's full-sized avatar

Block or report jason0000100007

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)

Python 237 30 Updated Mar 26, 2025

ConceptAttention: A method for interpreting multi-modal diffusion transformers.

Jupyter Notebook 271 10 Updated Apr 14, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,064 6,293 Updated Jun 10, 2025
Jupyter Notebook 41 5 Updated Dec 13, 2024

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 24,625 2,123 Updated Jun 11, 2025

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,683 2,940 Updated Sep 2, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,625 1,038 Updated Nov 18, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,768 2,506 Updated Aug 12, 2024

这是一个clip-pytorch的模型,可以训练自己的数据集。

Python 230 29 Updated Apr 5, 2023

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 5,272 501 Updated Aug 6, 2024
0