-
vimicro
- shanghai
Stars
Open-Sora: Democratizing Efficient Video Production for All
FlagPerf is an open-source software platform for benchmarking AI chips.
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
High-Resolution Image Synthesis with Latent Diffusion Models
The code for "Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions (AAAI 2022)"
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的mcp框架。连通业务系统的agent框架。
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
An arbitrary face-swapping framework on images and videos with one single trained model!
Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"
DeepFaceLab is the leading software for creating deepfakes.
Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition" (CVPR 2022)
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
pbaylies / stylegan-encoder
Forked from Puzer/stylegan-encoderStyleGAN Encoder - converts real images to latent space
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
StyleGAN2 - Official TensorFlow Implementation
Here is a series of face generators based on StyleGAN2
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)