8000 Cherishnoobs (TeaQwQTea) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Cherishnoobs's full-sized avatar
🎯
less is more.
🎯
less is more.

Highlights

  • Pro

Organizations

@cczu-osa

Block or report Cherishnoobs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 185 8 Updated Apr 14, 2025

Audio Large Language Models

Python 529 30 Updated Mar 9, 2025

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

HTML 325 33 Updated Dec 26, 2023

Maximize the potential of Cursor best practices for Automatic Rule and Custom Agent Generation and Agile Workflows

Batchfile 1,758 249 Updated Apr 25, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,544 1,316 Updated May 17, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,678 769 Updated May 19, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,777 277 Updated May 15, 2025
Jupyter Notebook 3 Updated Feb 18, 2025

Fully open reproduction of DeepSeek-R1

Python 24,497 2,254 Updated May 21, 2025

🎨 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reasoning (based on LaTeX AST).

Jupyter Notebook 1,220 236 Updated Jun 11, 2024

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 654 69 Updated Apr 27, 2025

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,037 355 Updated Aug 7, 2024

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

Jupyter Notebook 7,985 512 Updated Apr 29, 2025
Python 3,837 359 Updated May 6, 2025

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,299 102 Updated May 4, 2025

🔥🔥MLVU: Multi-task Long Video Understanding Benchmark

Python 199 1 Updated Mar 24, 2025

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,239 262 Updated Jan 18, 2025

Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'

56 Updated Dec 26, 2024

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 188 19 Updated Feb 23, 2025

[ICCV2023] DETRDistill: A Universal Knowledge Distillation Framework for DETR-families

Jupyter Notebook 53 6 Updated Nov 3, 2023

[ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences

Jupyter Notebook 39 1 Updated Mar 11, 2025

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,401 60 Updated Apr 28, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,217 377 Updated Jan 27, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,566 660 Updated Feb 10, 2025

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Python 537 135 Updated Dec 21, 2022

Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"

Python 101 5 Updated Sep 11, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,351 463 Updated Nov 6, 2024

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Python 3,378 392 Updated May 21, 2025

Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation

Python 43 1 Updated Apr 24, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,525 757 Updated May 15, 2025
Next
0