Stars
GenEval: An object-focused framework for evaluating text-to-image alignment
CogView4, CogView3-Plus and CogView3(ECCV 2024)
DaVinci toolkit aims at high-quality multimedia content creation which plays an important role in modern work and life. The targeted features can include both low-level image and video enhancement …
Example of converting HDR video to SDR in Android.在Android如何实现HDR视频转SDR的实践
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
VideoGen-Eval: Agent-based System for Video Generation Evaluation
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Processed / Cleaned Data for Paper Copilot
Sora Prompt Collection, a repository dedicated to inspiring AI-driven video creation with Sora.
The official GitHub page for the survey paper "A Survey of Large Language Models".
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
[CSUR] A Survey on Video Diffusion Models
优质稳定的OpenAI的API接口-For企业和开发者。OpenAI的api proxy,支持ChatGPT的API调用,支持openai的API接口,支持:gpt-4,gpt-3.5。不需要openai Key, 不需要买openai的账号,不需要美元的银行卡,通通不用的,直接调用就行,稳定好用!!智增增
DCP-o-matic repository: main is the development branch (where v2.18.x versions are being made)
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
A linear estimator on top of clip to predict the aesthetic quality of pictures
CLIP+MLP Aesthetic Score Predictor
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
对 ansj 编写的 Word2VEC_java 的进一步包装,同时实现了常用的词语相似度和句子相似度计算。