A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,733 195 Updated Apr 9, 2025

VamosC / CLIP4STR

An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".

Python 142 21 Updated Mar 12, 2025

fh2019ustc / DocTr-Plus

The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.

Python 459 47 Updated Nov 4, 2024

facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,496 616 Updated Feb 21, 2025

Tencent-Hunyuan / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,165 349 Updated Jan 13, 2025

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,428 657 Updated May 31, 2024

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,946 189 Updated May 19, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,766 1,504 Updated Jun 18, 2025

lichao-sun / Mora

Mora: More like Sora for Generalist Video Generation

Python 1,560 106 Updated Oct 10, 2024

lichao-sun / SoraReview

The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".

497 20 Updated Mar 21, 2024

csslc / CCSR

Official codes of CCSRv2 and CCSRv1: Improving the Stability and Efficiency of Diffusion Models for Content Consistent Super-Resolution

Python 543 43 Updated Dec 18, 2024

zsyOAOA / ResShift

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)

Python 1,208 63 Updated Dec 31, 2024

Ree1s / IDM

Python 320 23 Updated Sep 16, 2023

Delgan / loguru

Python logging made (stupidly) simple

Python 21,922 737 Updated Jun 15, 2025

seaweedfs / seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC ac…

Go 24,829 2,417 Updated Jun 17, 2025