Stars
Tool for generating high quality Synthetic datasets
Tools for merging pretrained large language models.
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Official MegEngine implementation of CREStereo(CVPR 2022 Oral).
[3DV 2024 Oral] DeDoDe 🎶 Detect, Don't Describe --- Describe, Don't Detect, for Local Feature Matching
Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Best Student Paper Award)
[CVPR2023] Official implementation of Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions
Schedule-Free Optimization in PyTorch
Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al
[IEEE J-BHI-2024] The PyTorch implementation of MASA-TCN
official repository for the paper: Multimodal emotion recognition with modality-pairwise unsupervised contrastive loss
Foundational model for human-like, expressive TTS
Deep Multi-Magnification Network for multi-class tissue segmentation of whole slide images
A generative model for programmable protein design
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Hackable and optimized Transformers building blocks, supporting a composable construction.
Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning
Code and Documentation for the first place solution in 2023 Abdominal Trauma Detection Competition hosted by RSNA on Kaggle.
2nd Place Solution for the RSNA 2023 Abdominal Trauma Detection Kaggle Competition
⚡ Finetune Wa2vec 2.0 For Speech Recognition