-
Meituan
- Beijing
-
16:39
(UTC +08:00)
Highlights
- Pro
Starred repositories
The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Papers about training data quality management for ML models.
Large-Scale Multimodal Dataset of Astronomical Data
LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever.
A collection for math word problem (MWP) works, including datasets, algorithms and so on.
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
A generative speech model for daily dialogue.
🚀 Power Your World with AI - Explore, Extend, Empower.
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
CoreNet: A library for training deep neural networks
A local chatbot fine-tuned by bilibili user comments.
Python tools for WhisperKit: Model conversion, optimization and evaluation
A high-throughput and memory-efficient inference and serving engine for LLMs
Instant voice cloning by MIT and MyShell. Audio foundation model.
Bark Voice Cloning and Voice Cloning for Chinese Speech
Easily train a good VC model with voice data <= 10 mins!
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
List of Computer Science courses with video lectures.