8000 dfan (David Fan) / Starred Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View dfan's full-sized avatar
✌️
working hard
✌️
working hard

Organizations

@princetoneclub @facebookresearch @princetonscioly

Block or report dfan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).

Python 133 8 Updated Apr 29, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,197 63 Updated May 28, 2025

Official code for the CVPR 2025 paper "Navigation World Models".

Python 193 12 Updated Apr 10, 2025

Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning

Python 181 7 Updated Apr 19, 2025
Python 558 38 Updated Apr 12, 2025

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

Python 6,975 242 Updated Jun 6, 2025

GenEval: An object-focused framework for evaluating text-to-image alignment

HTML 292 21 Updated Mar 3, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusionπŸ”₯] [scaling laws in visual generationπŸ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 8,149 499 Updated May 18, 2025

Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow

Python 198 3 Updated Aug 16, 2024

FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more. Translations: πŸ‡ΊπŸ‡Έ πŸ‡¨πŸ‡³ πŸ‡°πŸ‡· πŸ‡ͺπŸ‡Έ πŸ‡»πŸ‡³ πŸ‡§πŸ‡·

C 10,432 992 Updated Feb 1, 2025
Jupyter Notebook 2 Updated Nov 18, 2022

This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection

Python 81 20 Updated Apr 14, 2023

Unofficial implementation of: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics

Python 555 82 Updated Nov 1, 2021

An end-to-end PyTorch framework for image and video classification

Python 1,604 275 Updated Jun 27, 2024

Inflate DenseNet and ResNet as per I3D with ImageNet weight transfer

Python 150 22 Updated Apr 28, 2021

Rotary Transformer

Python 963 55 Updated Mar 21, 2022

Out of time: automated lip sync in the wild

Python 771 171 Updated Jan 23, 2024

Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples" https://arxiv.org/abs/…

Python 488 67 Updated Apr 28, 2023

Official DeiT repository

Python 4,211 571 Updated Mar 15, 2024

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 6,887 953 Updated Jul 3, 2024

Cross-model active contrastive coding

Python 22 7 Updated Mar 17, 2021

Best Practices, code samples, and documentation for Computer Vision.

Jupyter Notebook 9,695 1,192 Updated Feb 16, 2024

A script to check for vaccine availability at a Safeway near you

Python 2 Updated Apr 16, 2021

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,363 4,921 Updated Jun 7, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 35,409 3,899 Updated Jun 6, 2025

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

Python 719 85 Updated Aug 25, 2021

The AWS Cloud Development Kit is a framework for defining cloud infrastructure in code

TypeScript 12,219 4,156 Updated Jun 6, 2025

TransNet V2: Shot Boundary Detection Neural Network

Python 638 109 Updated Dec 4, 2023
Next
0