8000 JanineCHEN / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View JanineCHEN's full-sized avatar

Block or report JanineCHEN

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

SE-VGAE: Unsupervised Disentangled Representation Learning for Interpretable Architectural Layout Design Graph Generation

Python 2 Updated Mar 23, 2025

A generative speech model for daily dialogue.

Python 36,751 3,989 Updated May 23, 2025

SOTA Open Source TTS

Python 21,656 1,759 Updated Jun 7, 2025

FP4S: Floor plan image segmentation via scribble-based semi-weakly-supervised learning

Python 5 Updated Sep 12, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,528 1,117 Updated May 14, 2025

Curated list of project-based tutorials

231,301 30,217 Updated Aug 15, 2024

This project is established for real-time training of the RWKV model.

Python 49 4 Updated May 17, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 79,495 8,795 Updated Jun 12, 2025

WebUI extension for ControlNet

Python 17,668 2,017 Updated Aug 12, 2024

Stable Diffusion web UI

Python 153,414 28,542 Updated May 3, 2025

Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"

Python 165 6 Updated Jul 6, 2023
Python 1,008 132 Updated Oct 3, 2022

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Python 1,609 154 Updated Jun 9, 2025

[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"

Python 399 31 Updated Nov 10, 2023

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,061 763 Updated Dec 17, 2024

Code for ALBEF: a new vision-language pre-training method

Python 1,663 207 Updated Sep 20, 2022

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Python 6,386 1,284 Updated Aug 31, 2024

[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models

287 17 Updated Feb 15, 2025
Python 3 Updated Jul 27, 2023

Lightweight nudity detection

Python 2,062 383 Updated Jul 31, 2024

Official PyTorch implementation of BlobGAN: Spatially Disentangled Scene Representations

Python 173 12 Updated Aug 18, 2022

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Python 997 57 Updated Oct 24, 2024

[ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP

Python 37 3 Updated Jul 11, 2022

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)

HTML 4,094 571 Updated May 30, 2023

Pytorch Implementation for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"

Python 28 1 Updated Jul 31, 2022

Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"

Python 37 3 Updated Apr 13, 2022

[NeurIPS'23] Parts of Speech–Grounded Subspaces in Vision-Language Models

Jupyter Notebook 28 2 Updated Feb 25, 2024

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

Jupyter Notebook 320 34 Updated Nov 27, 2022
Next
0