Starred repositories
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
情感分析、文本分类、词典、bayes、sentiment analysis、TextCNN、classification、tensorflow、BERT、CNN、text classification
[ICML2025] VARSR: Visual Autogressive Modeling for Image Super Resolution
Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]
[CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from V…
PyTorch codes for "Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors", ACM MM2022 (Oral)
[NeurIPS 2020] Blind Video Temporal Consistency via Deep Video Prior
[NeurIPS 2020] Blind Video Temporal Consistency via Deep Video Prior
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
RetinaFace (Single-stage Dense Face Localisation in the Wild, 2019) implemented (ResNet50, MobileNetV2 trained on single GPU) in Tensorflow 2.0+. This is an unofficial implementation. With Colab.
State-of-the-art 2D and 3D Face Analysis Project
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
A highly optimized LLM inference acceleration engine for Llama and its variants.
ESRGAN (Enhanced Super-Resolution Generative Adversarial Networks, published in ECCV 2018) implemented in Tensorflow 2.0+. This is an unofficial implementation. With Colab.
The first high-definition cloth retouching dataset CRHD-3K.
[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
[ACM MM'24 Oral] RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining