Stars
GRAFT-XPCI: Dataset of synchrotron X-ray images for detection of acute cellular rejection after heart transplantation
The Google Scholar PDF Reader browser extension, now with annotations!
This repo contains codes, scripts, documents, etc, that are deliverables and helpers for the synthetic satellite imagery project from Microsoft AI-for-good lab.
High accuracy RAG for answering questions from scientific documents with citations
SSL4EO-S12: a large-scale dataset for self-supervised learning in Earth observation
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
Official implementation of “LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion”, MICCAI 2023
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>
The multi-domain FAS work with SiW-Mv2 dataset (ECCV 2022 oral)
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
🌟 ChatGenTitle:使用百万arXiv论文信息在LLaMA模型上进行微调的论文题目生成模型
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Dynamic Token Expansion with Continual Transformers, accepted at CVPR 2022
Machine learning metrics for distributed, scalable PyTorch applications.
Image augmentation for machine learning experiments.
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
A PyTorch implementation of EfficientNet
[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
Single-Side Domain Generalization for Face Anti-Spoofing, CVPR2020
[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution
Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers. (ICCV 2021 Oral)
This is an official repository of End-to-end Lane Shape Prediction with Transformers.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Unsupervised Depth Learning in Challenging Indoor Video: Weak Rectification to Rescue
Unsupervised Scale-consistent Depth Learning from Video (IJCV2021 & NeurIPS 2019)
hengxyz / bts
Forked from cleinc/btsFrom Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation