-
thuthesis Public
Forked from tuna/thuthesisLaTeX Thesis Template for Tsinghua University
TeX LaTeX Project Public License v1.3c UpdatedMay 21, 2025 -
-
kinetics-downloader Public
Forked from piaxar/kinetics-downloaderSimple tool to download videos from kinetics dataset.
Python UpdatedApr 1, 2025 -
Swin-Transformer-Object-Detection Public
Forked from SwinTransformer/Swin-Transformer-Object-DetectionThis is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
Python Apache License 2.0 UpdatedNov 27, 2024 -
WebTextCleaner Public
Forked from shjwudp/c4-dataset-scriptInspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.
Python MIT License UpdatedNov 27, 2024 -
deduplicate-text-datasets Public
Forked from google-research/deduplicate-text-datasetsShell Apache License 2.0 UpdatedNov 27, 2024 -
mmsegmentation Public
Forked from open-mmlab/mmsegmentationOpenMMLab Semantic Segmentation Toolbox and Benchmark.
Python Apache License 2.0 UpdatedNov 27, 2024 -
Video-Swin-Transformer Public
Forked from SwinTransformer/Video-Swin-TransformerThis is an official implementation for "Video Swin Transformers".
-
dino Public
Forked from facebookresearch/dinoPyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Python Apache License 2.0 UpdatedNov 27, 2024 -
datacomp Public
Forked from mlfoundations/datacompDataComp: In search of the next generation of multimodal datasets
Python Other UpdatedMar 28, 2024 -
Xwin-LM Public
Forked from Xwin-LM/Xwin-LMXwin-LM: Powerful, Stable, and Reproducible LLM Alignment
Python UpdatedNov 15, 2023 -
RedPajama-Data Public
Forked from togethercomputer/RedPajama-DataThe RedPajama-Data repository contains code for preparing large datasets for training large language models.
Python Apache License 2.0 UpdatedApr 18, 2023 -
text-dedup Public
Forked from ChenghaoMou/text-dedupAll-in-one text de-duplication
Jupyter Notebook Apache License 2.0 UpdatedApr 7, 2023 -
bigcode-dataset Public
Forked from bigcode-project/bigcode-datasetJupyter Notebook Apache License 2.0 UpdatedMar 26, 2023 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedSep 15, 2022 -
-
unilm_official Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedSep 3, 2022 -
deit Public
Forked from facebookresearch/deitOfficial DeiT repository
Python Apache License 2.0 UpdatedJul 30, 2022 -
-
apex Public
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
-
esvit Public
Forked from microsoft/esvitEsViT: Efficient self-supervised Vision Transformers
Python MIT License UpdatedMay 12, 2022 -
coco-caption Public
Forked from LuoweiZhou/coco-captionkdexd/coco-caption@de6f385
Jupyter Notebook Other UpdatedApr 17, 2022 -
-
-
Dassl.pytorch Public
Forked from KaiyangZhou/Dassl.pytorchA PyTorch toolbox for domain adaptation and semi-supervised learning.
Python MIT License UpdatedNov 12, 2021 -
DownloadConceptualCaptions Public
Forked from igorbrigadir/DownloadConceptualCaptionsReliably download millions of images efficiently
Jupyter Notebook MIT License UpdatedApr 15, 2021 -
CLIP-pytorch Public archive
A non-JIT version implementation / replication of CLIP of OpenAI in pytorch
-
bottom-up-attention Public
Forked from einsiedler0408/bottom-up-attentionBottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Jupyter Notebook MIT License UpdatedJan 14, 2021 -
-
Active-Perception Public
Deep Reinforcement Learning for Robotic Pushing and Picking in Cluttered Environment