Stars
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A curated list of foundation models for vision and language tasks
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Development repository for the Triton language and compiler
[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
GitHub Action to setup `ssh-agent` with a private key
Collection of papers, datasets, code and other resources for object tracking and detection using deep learning
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Turi Create simplifies the development of custom machine learning models.
Azure Quickstart Templates
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
A curated list of awesome Deep Learning tutorials, projects and communities.